Amazon EMR cluster setup guide
This section provides instructions to connect an Amazon EMR cluster to Unravel SaaS.
On the AWS console, select the EMR service and click Create cluster.
In the Create Cluster - Quick Options screen, click Go to advanced options.
In Step 1: Software and Steps, select any release up to emr-5.29.0.
In Step 2: Hardware, enter a configuration for your EMR cluster and click Next.
In Step 3: General Cluster Settings, specify the following settings in Add Bootstrap Action and click Add. The Amazon EMR cluster starts with this bootstrap action.
Setting
Action
Name
Select Custom action.
Script location
Specify the following bootstrap location:
s3://unravel-saas-bootstrap/unravel_emr_bootstrap.py
Note
If you want to monitor MR jobs, then you must pass additional optional argument --all
Optional arguments
Enter the following:
--unravel-server
ENTER-YOUR-UNRAVEL-INSTANCE-HOSTNAME-HERE
Click Configure and add.
In Step 4: Security, configure the security of the cluster as required. For example:
Choose the EC2 key pair.
Select the EC2 security groups. AWS EMR service automatically applies additional rules that are required for EMR nodes.
Click Create cluster. Your new EMR cluster finishes the bootstrap process and will be in the Waiting state.
For reference, you can also watch the following video: