33
edits
Changes
→Finance and Stock trading Use Case
Spark is advanced data processing/analysis model which is replacing MapReduce <br />
Spark does not have its own file system so it run on the top of HDFS <br />
[[File:10a.PNG]]
=== Spark vs MapReduce ===
[[File:3.PNG]]
== Features ==
In memory computations <br />
Faster than MapReduce for complex application on disks <br />
[[File:2abc.png ]]
== Resilient Distributed Datasets (RDDs) ==
<b> Transformations </b> <br />
Create a new data set from existing one <br />
[[File:5bc.PNG ]]
<b> Actions </b> <br />
Return a value to the driver program after running computation on data set <br />
[[File:6.PNG]] These examples and more are found at https://spark.apache.org/docs/latest/rdd-programming-guide.html == Examples & == === Word Count === [[File:4.PNG]] Using transformations ( flatmap, map, reduceByKey ) to build a data set of string and int pairs. It is then saved into a file === Finance and Stock trading Use Case ===