Allow Data Filtering
Lake Formation supports column-level permissions to restrict access to specific columns in a table. Integrated
analytics services like Amazon Athena, Amazon Redshift Spectrum, and Amazon EMR retrieve non-filtered table
metadata from the AWS Glue Data Catalog. The actual filtering of columns in query responses is the responsibility of the integrated service.
By default, this filter is opt out for Amazon EMR. Amazon EMR clusters will not be able to access data in Amazon S3 locations that are registered with Lake Formation.
In this step, we enable this filter as in this lab, you will use EMR Notebooks and Apache Zeppelin on
Amazon EMR to access data managed by Lake Formation.
- Continue in the Lake Formation console at https://console.aws.amazon.com/lakeformation/. Ensure
that you are signed in as the lf-admin user.
- Check the box for Allow Amazon EMR clusters to filter data managed by Lake Formation. and enter your
account id under AWS Account IDs box. Click on Save and proceed to the next chapter.
