• separate multiple addresses with a comma.

Configuring Catalog Integration with Amazon S3

323 Enrolled | 115 Completed

About this course

Instructor: Rebecca Golden, Senior Technical Trainer

Course Objectives:

  • Explain the AWS services Collibra uses for its DGC integration with S3
  • Configure the appropriate IAM roles and policies needed for integration
  • Register and synchronize an S3 File System within Catalog

In this course, we will discuss requirements for ingesting metadata for data held in Amazon’s Simple Storage Service, more commonly known as S3, into your DGC environment. By leveraging AWS services, such as Glue, Identity Access Management (IAM), and Athena, provisioning data access can be automated when approved for requested data sets by data analysts.

Collibra’s DGC leverages AWS Glue, which is an ETL service, to create and expose metadata about the data stored in your S3 buckets and provides visibility of that metadata to your DGC users, including the file system structure. Using IAM, DGC users will have access to see that metadata and shop for data sets within the report catalog. If they request access to the data set, you can actually provision temporary credentials through IAM and the user can be granted those permissions and use a service like AWS Athena to write reports inside of Tableau.

Collibra makes it easy for data citizens to find, understand and trust the organizational data they need to make business decisions every day. Unlike traditional data governance solutions, Collibra is a cross-organizational platform that breaks down the traditional data silos, freeing the data so all users have access.

©2020 Collibra. All Rights Reserved.


Not recently active