Collibra Catalog

Instructor:

Peter Princen, Product Manager

Objectives:

  • Explain approved data sets
  • Summarize ingestion of data
  • Demonstrate shopping for data

Learn everything about how to use the Collibra Catalog, which helps you find, understand, and trust the data you need. Collibra Catalog is a trusted, single source of intelligence for data experts and other data citizens who need quick access to enterprise data. By cataloging approved and trusted data sets and making them easily discoverable through semantic search, Collibra Catalog provides a new way for data users to find and access the data they need, evaluate its lineage, and even enrich its value.

Collibra Catalog 5.x: Rapid Data Discovery

Instructor:

Peter Princen, Product Manager

 Objectives:

  • Organize ingesting from data source
  • Experiment with Smart Catalog
  • Make use of the Data Dictionary

Description:

Learn everything about how to use the Collibra Catalog including registering a data source, such as CSV, Excel and JDBC support. Schedule a data refresh for the newly registered data sources with encrypted credentials. Take advantage of Smart Catalog, which will suggest recommended data sets and assets based on the browsing history of the logged in users and their peers. The Catalog also allows for shopping for data, with recommendations of data sets which may be of interest to you. Add data sets to your shopping basket and request access from the data owners. Access the Data Dictionary to review all the schemas you have registered within Collibra Platform and add them to existing data sets or create new ones. Finally, review the data profiling and data sampling available, to assess parameters of importance and review the corresponding graph.

Collibra Catalog and Tableau Integration

Instructor:

Yulia Prylypko, Inbound Product Manager

Course Objectives:

  • Explain Tableau integration process
  • Show lineage of Tableau assets
  • Certify Tableau reports

Description:

We will register the data source system as Tableau Server. By registering all physical data sources in Catalog, they become more easily discoverable by storing profiles and samples. Next, we will assign all Tableau Projects, Workbooks and Views to a community. Tableau sites will be synchronized to Catalog, allowing you to view the lineage of the assets using Traceability diagrams. As a result, data sets can be combined from different sources and Tableau reports can be certified. The integration allows for your Tableau data to be readily available in the Collibra Platform.

Google Cloud Platform Integration

Instructor:

Kai Manners, Presales Engineer

Objectives:

  • Assess the integration between the Collibra Platform and BigQuery
  • Select a schema for integration into BigQuery
  • Explain how to profile data from BigQuery

Description:

Collibra’s partnership with Google allows for easy integration between the Collibra Platform and Google Cloud Products. This course will specifically cover how to integrate the Collibra Platform with Google Cloud Product’s data warehouse BigQuery. We’ll cover how to ingest a schema into BigQuery, how to ingest a dataset from BigQuery into the Collibra Platform, how to profile the new data in Collibra, as well as how to exclude certain tables from ingestion. We’ll conclude with a demonstration of using Collibra for Desktop to then find the newly-ingested data.

Healthcare Data Sets with Collibra Catalog

Instructor:

Justin Washburn, Presales Engineer

Objectives:

  • Demonstrate ingesting data
  • Outline data profiling
  • Explain post data ingestion workflow

Description:

We will review Catalog features in 5.1, including data profiling, tagging and recommending business terms that may match ingested business assets. We will be using an ACO Inpatient file to demonstrate ingesting the data, creating a data profile and storing sample data sets. In addition to the data catalog ingestion demonstration, we will also review the post data ingestion workflow. And of course, the demonstrations will include the data profiling, linking business terms to imported schemas, and how tagging can be used to improve searchability and quick access to results.

Move Data to AWS for Tableau Analytics

Instructor:

Peter Princen, Sr. Product Manager


Course Objectives:

  • Explain onboarding ERP cloud data into data lake built on AWS S3
  • Utilize Amazon Athena to access data in AWS S3 data lake
  • Examine complete lineage of Tableau workbook and source systems

Description:

In this course, we will review a user journey of a business analyst that needs to make a report on sales forecasts in the domain of supply chain. The use case will show that not only Catalog will be used to find the correct data, but it will actually manage the whole process of looking for data, requesting new data to be onboarded into the data lake where our analyst can actually access it, and leading up to creating a report in a BI tool of choice. In our example the BI tool will Tableau, and the data lake will be built on AWS S3. At the end of our journey, the analyst can access the data that is stored on AWS S3.

Registering, Ingesting and Profiling JDBC Driver for Snowflake

Instructor:

Mahmoud Romeh, Backend Engineer

Course Objectives:

  • Inspect ingesting a Snowflake data source
  • Analyze required connection properties between Collibra Catalog and Snowflake
  • Examine data profiling

Description:

In this course, we will demonstrate ingesting and profiling a Snowflake data source through CData driver. We now have a new option in Catalog when you are registering a data source and would like to use CData Driver. Simply use the sample default wizard that now provides Register data source (use a Collibra provided driver) and from there you can upload your CData Driver. In this demo, we use the Snowflake CData Driver and will properly configure the connection parameters, the connection URL and the additional mandatory parameters needed for the configuration and successful connection to a Snowflake data source to ingest and profile.

Supply Chain Use Case: Tableau Integration

Instructor:

Yulia Prylypko, Product Manager

Course Objectives:

  • Understand the Tableau metamodel for integration with Collibra Catalog
  • Explain how to register Tableau within the Collibra Catalog
  • Examine Tableau assets, report attributes, logical layer, and workflow

Description:

To help you understand and trust your reports, Collibra provides direct integration between the Tableau server and Collibra Catalog. This course will cover the metadata model Collibra built for Tableau, how to register with a Tableau server either on-premises or online, and how to review the assets, report attributes, logical layer, and report workflow once synchronization is complete.

Collibra Catalog: Empowering Technical and Business User Collaboration

Instructor:

Vasiliki Nikolopoulou, Presales Engineer

Objectives:

  • Relate collaboration between technical and business
  • Explain ingestion through various data sources
  • Show data quality on assets

Description:

Learn everything about the Collibra Catalog, which empowers collaboration between business and technical people. While business and technical people often have different vocabularies, goals and outcomes; they now have a platform to collaborate and manage assets in productive and efficient manner. A company’s success greatly depends on efficient collaboration, because so much of the work relies on the technical aspects like applications and data.The Catalog facilitates communication and workflow through the structure and automation it provides. Users can define the processes that can bring business content to technical assets through automated procedures. Data sets can be ingested through various data sources, which triggers a workflow for validation and augmentation by the business users. The automated process eliminates the need for additional meetings and emails, allowing the user to track and monitor the assets at their own pace. All Data Governance team members can communicate, track assets, escalations and data requests through the Collibra Catalog, completing their assigned tasks and documenting all necessary steps in the process.

Configuring Catalog Integration with Amazon S3

Instructor:

Rebecca Golden, Senior Technical Trainer

Objectives:

  • Explain the AWS services Collibra uses for its DGC integration with S3
  • Configure the appropriate IAM roles and policies needed for integration
  • Register and synchronize an S3 File System within Catalog

Description:

The Collibra Platform leverages AWS Glue, which is an ETL service, to create and expose metadata about the data stored in your S3 buckets and provides visibility of that metadata to your Collibra Platform users, including the file system structure. Using IAM, Collibra users will have access to see that metadata and shop for data sets within the report catalog. If they request access to the data set, you can actually provision temporary credentials through IAM and the user can be granted those permissions and use a service like AWS Athena to write reports inside of Tableau.

>