Open main menu

CDOT Wiki β

Changes

GPU621/Apache Spark Fall 2022

336 bytes added, 20:37, 3 December 2022
Deploy Apache Spark Application On AWS
From here, I will assume you have an AWS service account and that you have basic knowledge about AWS services like how to use S3 bucket, or how to add role or policy to services.
Also, you will need to have basic knowledge about SSH and Linux commands.
 
===Create an EMR cluster===
Search and choose EMR on AWS service panel.
-IMAGE-
Click the Create Cluster button.
 
Enter as cluster name and choose a release version. Here I will choose the EMR-5.11.1 for the Release version. For the application, you can see that there are many options, we will choose Spark as this is our main topic.
92
edits