Crawl Informatica CDI assets
Create a crawler workflow to automatically discover and catalog your Informatica Cloud Data Integration assets, including projects, workflows, and data lineage.
Prerequisites
Before you begin, verify you have:
- Completed the Set up Informatica CDI guide
- Access to your Informatica Cloud environment
- Parameter files downloaded from your Secure Agent machines
Create crawler workflow
Create a new workflow and select Informatica CDI as your connector source.
-
In the top-right corner of any screen, select New > New Workflow.
-
From the list of packages, select Informatica CDI Assets > Setup Workflow.
Configure authentication
Set up secure access to your Informatica Cloud environment by providing connection credentials.
-
In the Host field, enter your Informatica CDI domain without the protocol or sub-region.
Example
If your full URL is:
https://usw1.dmp-us.informaticacloud.com/
Enter only:
dmp-us.informaticacloud.com
-
Enter the Username and Password for the user you created in the Set up Informatica CDI guide.
-
Select Test Authentication to verify connectivity to Informatica CDI.
-
After successful authentication, select Next.
Configure connection
Set up connection management and define who can access and manage this connection.
- Enter a Connection Name that represents your source environment. For example, use values like production, development, gold, or analytics.
- To modify who can manage this connection, update the users or groups listed under Connection Admins. If you don't specify any user or group, no one can manage the connection, including admins.
- Select Next to continue.
Configure crawler
Set up what to crawl and configure advanced options for accurate lineage generation.
- Configure metadata filters using the Include Metadata and Exclude Metadata fields. If an asset appears in both fields, the exclude metadata field takes precedence.
- Include Metadata: Select the projects or folders you want to include in crawling. This defaults to all assets if none are specified.
- Exclude Metadata: Select the projects or folders you want to exclude from crawling. This defaults to no assets if none are specified.
- Configure advanced options for uploading parameter files:
- Upload parameter files used by the Informatica CDI projects or folders in a compressed format.
- MIME types: Windows ZIP or Linux Zip
Run crawler
Execute the crawler to discover and catalog your Informatica CDI assets.
-
To run the crawler immediately, select Run.
-
To schedule the crawler to run hourly, daily, weekly, or monthly, select Schedule & Run.
After the crawler completes, you can view the assets on Atlan's asset page.
See also
- What does Atlan crawl from Informatica CDI: Understand the metadata and assets discovered during crawling