Open main menu

CDOT Wiki β

Changes

GPU621/Apache Spark

No change in size, 09:07, 30 November 2020
|300px|alt="Hadoop" vs |200px|alt="Spark"
# Daniel Park
= [[File:Hadooplogo.png||300px]|alt="Hadoop"]] vs [[File:Apache Spark logo.svg.png||200px]|alt="Spark"]] =
'''MapReduce''' was famously used by Google to process massive data sets in parallel on a distributed cluster in order to index the web for accurate and efficient search results. '''Apache Hadoop''', the open-source platform inspired by Google’s early proprietary technology has been one of the most popular big data processing frameworks. However, in recent years its usage has been declining in favor of other increasingly popular technologies, namely '''Apache Spark'''.