Create Amazon EMR Cluster

Now that we finished setting up Identity Provider Trust, it's time to create an Amazon EMR cluster integrating with AWS Lake Formation. Upon successful completion of the CloudFormation template below, it will create the following IAM resources: Log out and log back into AWS console using TeamRole (if hashcode is given, use chapter AWS Event to login) or as Administrator, which you used for the Self Paced Labs.

This template is created for us-east-1 region (N.Virginia) and will not work in other regions.

Executing the CloudFormation template
Follow these steps to get to know more about this CloudFormation template and execute the template.
  • The CloudFormation template will take a few parameters as input. The first two parameters are very important - (1) Your SAML identity provider & (2) SAML identity provider metadata path.
  • The CloudFormation stack will roughly take 10-12 minutes to complete. Check the CloudFormation console and wait for the status CREATE_COMPLETE as shown below:
  • Once the stack creation is completed, your AWS account will have all the required resources to run this exercise. Take a note of the EMR Master node DNS and Notebooks bucket name from the output tab.
  • The CloudFormation template also shows the AWS console IAM login link. Use that link to switch between different users to run this exercise.