Services or capabilities described in AWS documentation might vary by Region. To see the differences applicable to the AWS European Sovereign Cloud Region, see the AWS European Sovereign Cloud User Guide.Adding steps to a cluster with the Amazon EMR Management Console
Use the following procedures to add steps to a cluster with the AWS Management Console. For
detailed information about how to submit steps for specific big data applications, see
the following sections of the Amazon EMR Release
Guide:
Add steps during cluster creation
From the AWS Management Console, you can add steps when you create a cluster.
- Console
-
To add steps when you create a cluster with the console
-
Sign in to the AWS Management Console, and open the Amazon EMR console at
https://eusc-de-east-1.console.amazonaws-eusc.eu/emr.
-
Under EMR on EC2 in the left navigation
pane, choose Clusters, and then choose
Create cluster.
-
Under Steps, choose Add
step. Enter appropriate values in the fields in
the Add step dialog. For information on
formatting your step arguments, see Add step arguments. Options
differ depending on the step type. To add your step and exit the
dialog, select Add step.
-
Choose any other options that apply to your cluster.
-
To launch your cluster, choose Create
cluster.
Add steps to a running cluster
With the AWS Management Console, you can add steps to a cluster with the auto-terminate option
disabled.
- Console
-
To add steps to a running cluster with the console
-
Sign in to the AWS Management Console, and open the Amazon EMR console at
https://eusc-de-east-1.console.amazonaws-eusc.eu/emr.
-
Under EMR on EC2 in the left navigation
pane, choose Clusters, and select the
cluster that you want to update.
-
On the Steps tab on the cluster details
page, select Add step. To clone an existing
step, choose the Actions dropdown menu and
select Clone step.
-
Enter appropriate values in the fields in the Add
step dialog. Options differ depending on the step
type. To add your step and exit the dialog, choose Add
step.
Modify the step concurrency level in a running cluster
With the AWS Management Console, you can modify the step concurrency level in a running
cluster.
You can only run multiple steps in parallel with Amazon EMR version 5.28.0 and
later.
- Console
-
To modify step concurrency in a running cluster with the console
-
Sign in to the AWS Management Console, and open the Amazon EMR console at
https://eusc-de-east-1.console.amazonaws-eusc.eu/emr.
-
Under EMR on EC2 in the left navigation
pane, choose Clusters, and select the
cluster that you want to update. The cluster must be running to
change its concurrency attribute.
-
On the Steps tab on the cluster details
page, find the Attributes section. Select
Edit to change the concurrency. Enter a
value between 1 and 256.
Add step arguments
When you use the AWS Management Console to add a step to your cluster, you can specify
arguments for that step in the Arguments field. You must
separate arguments with whitespace and surround string arguments that consist of
characters and whitespace with quotation
marks.
Example: Correct arguments
The following example arguments are formatted correctly for the AWS Management Console,
with quotation marks around the final string argument.
bash -c "aws s3 cp s3://amzn-s3-demo-bucket/my-script.sh ."
You can also put each argument on a separate line for readability as shown in
the following example.
bash
-c
"aws s3 cp s3://amzn-s3-demo-bucket/my-script.sh ."
Example: Incorrect arguments
The following example arguments are improperly formatted for the AWS Management Console.
Notice that the final string argument, aws s3 cp
s3://amzn-s3-demo-bucket/my-script.sh ., contains whitespace and is
not surrounded by quotation marks.
bash -c aws s3 cp s3://amzn-s3-demo-bucket/my-script.sh .