Open main menu

CDOT Wiki β

Changes

GPU621/Apache Spark

8 bytes removed, 13:38, 30 November 2020
Overview: Spark vs Hadoop
Spark MLlib is a distributed machine-learning framework on top of Spark Core. It provides various types of ML algorithms including regression, clustering, and classification, which can perform various operations on data to get meaningful insights out of it.
== Overview: Spark vs Hadoop ===== Advantage and Disadvantages ====== Parallelism ====== Performance ===
== Spark vs Hadoop Wordcount Performance ==