Skip to main content

How can Atlan generate upstream lineage from the data warehouse layer?

Atlan has 2 API endpoints to generate lineage:

Atlan generates upstream lineage in the following ways:

  • Custom jobs:
    • Constraint - these jobs do not have an interface for Atlan to extract lineage. For example, custom Python jobs that move data from RDBMS to S3 to Snowflake.
    • Recommended path - use the lineage API to push source-to-target mappings to Atlan.
  • Data integration tools: 
    • Constraint - these tools have historically not had metadata APIs for Atlan to extract lineage.
    • Recommended path - Atlan has released support for data integration tools that have released metadata APIs, such as Fivetran.
  • Orchestration tools: 
    • Constraint - you need to run DAGs or jobs to catalog assets and lineage in Atlan, unlike other workflows that catalog assets after a workflow run.
    • Recommended path - use the lineage API to either create lineage directly or using OpenLineage.