Changes

Jump to: navigation, search

GPU621/Apache Spark

8 bytes removed, 00:47, 23 November 2020
|300px vs |200px
== Apache Spark ==
== Spark == [https://spark.apache.org/ '''Apache Spark'''] is a unified analytics engine for large-scale data processing. It is an open-source, general-purpose cluster-computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Since its inception, Spark has become one of the biggest big data distributed processing frameworks in the world. It can be deployed in a variety of ways, provides high-level APIs in Java, Scala, Python, and R programming languages, and supports SQL, streaming data, machine learning, and graph processing.
=== Architecture ===
# Nov 9, 2020 - Added project description
# Nov 20, 2020 - Added outline and subsections
#
= References =

Navigation menu