Skip to main content

Crawl Informatica CDI assets

Create a crawler workflow to automatically discover and catalog your Informatica Cloud Data Integration assets, including projects, workflows, and data lineage.

Prerequisites

Before you begin, verify you have:

  • Completed the Set up Informatica CDI guide
  • Access to your Informatica Cloud environment
  • Parameter files downloaded from your Secure Agent machines

Create crawler workflow

Create a new workflow and select Informatica CDI as your connector source.

  1. In the top-right corner of any screen, select New > New Workflow.

  2. From the list of packages, select Informatica CDI Assets > Setup Workflow.

Configure authentication

Set up secure access to your Informatica Cloud environment by providing connection credentials.

  1. In the Host field, enter your Informatica CDI domain without the protocol or sub-region.

    Example

    If your full URL is:

    https://usw1.dmp-us.informaticacloud.com/

    Enter only:

    dmp-us.informaticacloud.com
  2. Enter the Username and Password for the user you created in the Set up Informatica CDI guide.

  3. Select Test Authentication to verify connectivity to Informatica CDI.

  4. After successful authentication, select Next.

Configure connection

Set up connection management and define who can access and manage this connection.

  1. Enter a Connection Name that represents your source environment. For example, use values like production, development, gold, or analytics.
  2. To modify who can manage this connection, update the users or groups listed under Connection Admins. If you don't specify any user or group, no one can manage the connection, including admins.
  3. Select Next to continue.

Configure crawler

Set up what to crawl and configure advanced options for accurate lineage generation.

  1. Configure metadata filters using the Include Metadata and Exclude Metadata fields. If an asset appears in both fields, the exclude metadata field takes precedence.
    • Include Metadata: Select the projects or folders you want to include in crawling. This defaults to all assets if none are specified.
    • Exclude Metadata: Select the projects or folders you want to exclude from crawling. This defaults to no assets if none are specified.
  2. Configure advanced options for uploading parameter files:
    • Upload parameter files used by the Informatica CDI projects or folders in a compressed format.
    • MIME types: Windows ZIP or Linux Zip

Run crawler

Execute the crawler to discover and catalog your Informatica CDI assets.

  1. To run the crawler immediately, select Run.

  2. To schedule the crawler to run hourly, daily, weekly, or monthly, select Schedule & Run.

After the crawler completes, you can view the assets on Atlan's asset page.

See also