Open main menu

CDOT Wiki β

Changes

GPU621/Apache Spark

107 bytes added, 14:48, 30 November 2020
Data Processing
Spark is easier to program and includes an interactive mode. It has various pre-built APIs for Java, Scala, and Python. Hadoop MapReduce is harder to program but there are some tools available to make it easier.
== Data Processing Cost ==According to benchmarks, Spark is more cost-effective as it requires less hardware to perform the same tasks faster. 
== Security ==