Create an Atlas Data Lake Pipeline
On this page
Looking for documentation for what used to be called "Atlas Data Lake"? Atlas Data Lake is now called Atlas Data Federation. To learn more about the renamed federated query engine service, see Atlas Data Federation.
You can create Atlas Data Lake pipelines using the Atlas UI, Data Lake Pipelines API, and the Atlas CLI. This page guides you through the steps for creating an Atlas Data Lake pipeline.
Prerequisites
Before you begin, you must have the following:
Backup-enabled
M10
or higher Atlas cluster.Project Owner
role for the project for which you want to deploy a Data Lake.Sample data loaded on your cluster (if you wish to try the example in the following Procedure).
Procedure
Next steps
Now that you've created your Data Lake pipeline, proceed to Set Up a Federated Database Instance for Your Dataset.