Skip to main content

115 docs tagged with "lineage"

View all tags

Alert propagation

Propagate data-quality alerts downstream through asset lineage to surface impact across the catalog. Reference for workflow configuration, scope rules, and propagation-behavior settings.

Alteryx

Integrate, catalog, and govern Alteryx assets in Atlan using OpenLineage.

Build lineage from CSV

Import lineage relationships between assets from a CSV file using the Lineage Builder app. Create lineage connections that include transformation processes, ETL workflows, dbt models, and custom data pipelines that aren't automatically discovered by connectors.

Column Level Lineage

Data lineage shows the upstream and downstream dependencies of an asset. For a more granular view of these dependencies, you can view column-level lineage.

Crawl Dagster assets

Capture lineage from Dagster assets by creating a listener in Atlan. Extract data orchestration metadata and asset dependencies from your Dagster instance.

Crawl SageMaker

Crawl lineage from AWS SageMaker to catalog machine learning pipelines, jobs, models, and datasets. Extract ML workflow lineage after configuring AWS credentials.

Crawl Snowflake AI models

Discover, catalog, and build lineage for AI models registered in the Snowflake Model Registry using Atlan's Snowflake connector.

Create data products

Create data products to help data consumers discover and work with data assets. Setup requires the products module to be enabled and domain policy permissions for the specific domain or subdomain.

Dagster

Integrate, catalog, and visualize Dagster lineage in Atlan.

Data Pipelines

Learn how to connect your data pipelines to Atlan. Explore ETL tools, workflow orchestration, and lineage tracking to build a comprehensive view of your data movement.

DataStax Enterprise

Catalog and govern DataStax Enterprise assets in Atlan. Build asset- and column-level lineage for your distributed data.

Download and export lineage

View, download, and export lineage reports to analyze data flows and impact. Create CSV impact reports, export lineage graphs as images, and share lineage data to Google Sheets and Microsoft Excel for impact analysis and root cause analysis.

Download impacted assets in Google Sheets

Once you've [connected Atlan with Google Sheets](/product/integrations/collaboration/spreadsheets/how-tos/integrate-atlan-with-google-sheets), you can download impacted assets in Google Sheets. This can help you assess the downstream impact of any changes made to an upstream asset for [impact analysis](/product/capabilities/lineage/concepts/what-is-lineage#impact-analysis).

Dynamic mapping task lineage

How Atlan resolves source fields, target fields, and field-level lineage for Informatica CDI dynamic mapping tasks

Enrich Power BI OLS datasets

Learn how to create missing Power BI tables and columns blocked by Object Level Security and generate upstream lineage to your data warehouse using the PBI OLS Dataset Enricher app with BIM files from S3.

Generate lineage

Enable query logging and configure your ClickHouse instance so the ClickHouse Miner workflow can extract query history and generate lineage.

Generate lineage

Create an ingestion workflow in Atlan to extract Oracle query history and generate lineage for existing Oracle assets.

Generate lineage

Enable pg_stat_statements and configure your PostgreSQL instance so the PostgreSQL Miner workflow can extract query history and generate lineage.

Generate lineage between assets

Automatically create lineage between assets across two connections when their names match or follow a consistent pattern. The Lineage Generator (no transformations) app creates direct lineage relationships for data flows between databases, warehouses, or storage systems.

Generate lineage from S3 queries

Extract lineage from SQL query history stored in Amazon S3 using the Generic Miner app. Generate table-level and column-level lineage for existing connections when source systems don't expose complete query history or when query logs are retained externally.

Generate Power BI columns to dataset lineage

Learn how to generate column-to-dataset lineage for Microsoft Power BI by using the PowerBI Columns -> Dataset lineage app with report.json files stored in cloud object storage.

Integrate Astronomer/OpenLineage

Learn how to integrate Astronomer/OpenLineage with Atlan by configuring the connection, setting environment variables, and verifying connectivity.

Integrate Google Cloud Composer/OpenLineage

To integrate Google Cloud Composer/OpenLineage with Atlan, complete the following steps. To learn more about OpenLineage, refer to [OpenLineage configuration and facets](/product/connections/references/openlineage-configuration-and-facets).

Lineage

Frequently asked questions about Talend lineage in Atlan

Lineage

[Data lineage](/product/capabilities/lineage/how-tos/view-lineage) captures how data moves across your data landscape. This information is useful to:.

Lineage

Track and visualize data lineage across your data landscape to understand data flow and dependencies.

Lineage analysis

Use Lakehouse to run advanced lineage analysis across systems and connectors.

Lineage and asset loader

Complete configuration reference for the Lineage and asset loader app, including S3 authentication methods, CSV file schema, asset creation controls, and workflow settings.

Lineage Builder

Complete configuration reference for the Lineage Builder app properties and settings, including CSV file format and asset handling options.

Lineage full export

Export complete lineage information from your metadata lakehouse for reuse across impact, dashboard, and root cause analysis.

LINEAGE_ADJACENCY_LIST table

Reference documentation for the LINEAGE_ADJACENCY_LIST table containing directed lineage edges between assets and processes

Load lineage and assets from CSV

Create lineage and optionally generate assets in Atlan from a source-to-target CSV mapping file stored in Amazon S3. The Lineage and asset loader app reads CSV mappings, creates lineage relationships, and optionally creates missing source or target assets.

Metadata Propagator

Complete configuration reference for the Metadata Propagator app, including propagation direction, attribute selection, filters, and matching behavior.

Microsoft Dataverse crawler

Catalog Microsoft Dataverse entities and their relationships in Atlan as lineage. Reference for crawler configuration, entity selection, relationship mapping, and sync options.

Mine ClickHouse

Once you have [crawled assets from ClickHouse](/apps/connectors/database/clickhouse/how-tos/crawl-clickhouse), you can mine its query history to construct lineage.

Mine PostgreSQL

Once you have [crawled assets from PostgreSQL](/apps/connectors/database/postgresql/how-tos/crawl-postgresql), you can mine its query history to construct lineage.

Monitor data domains

Monitor data domains using the Statistics tab to track data products, enrichment metrics, creation trends, and domain usage. Setup requires the products module enabled and domain owner or admin permissions in Atlan.

Orchestration tasks attribute transfer

Complete configuration reference for the Orchestration tasks attribute transfer app, including connector selection, custom metadata mapping, transfer direction, and filter options.

PBI OLS Dataset Enricher

Complete configuration reference for the PBI OLS Dataset Enricher app, including S3 authentication, connection setup, and how it restores Power BI lineage broken by OLS.

Propagate metadata through lineage

Propagate metadata like owners, certificates, and tags from upstream assets to downstream dependents automatically using the Metadata Propagator app. The app follows existing lineage relationships to push governance metadata downstream without manual updates.

Routines and Process assets

Frequently asked questions about BigQuery routines and their relationships with Process assets for lineage generation

Set up Dagster

Configure Dagster integration with Atlan to enable asset and lineage capture from your Dagster assets

Set up lineage tables

Create helper lineage tables in your warehouse for advanced lineage queries that include direction (upstream/downstream), hop level, and asset names. These tables work with Lakehouse's native LINEAGE_ADJACENCY_LIST table.

Set up on-premises Databricks lineage extraction

The Docker-based databricks-extractor offline tool has been sunset. For on-premises or network-restricted Databricks lineage extraction, use Self-Deployed Runtime, Secure Agent, or direct connectivity via private link.

Set up SageMaker

Configure AWS credentials and permissions to connect Atlan to your SageMaker environment.

Source asset type

Override the asset type assigned to assets derived from source system metadata. Reference for supported asset types, qualified-name parsing rules, and configuration options.

Teradata

Catalog and govern Teradata assets in Atlan. Optionally mine query history to build lineage.

Transfer orchestration task attributes to assets

Propagate operational attributes from orchestration tasks and jobs to connected assets using the Orchestration tasks attribute transfer app. Transfer run status, schedules, run timestamps, and task links to upstream or downstream assets through lineage.

View event logs

Event logs help you track and debug events received from supported connectors, providing you with greater observability in Atlan. Event logs are currently stored in Atlan for 7 days.

View lineage

Explore data flows and transformations using the lineage graph. Navigate upstream and downstream assets, view column-level lineage, perform impact analysis, and visualize metadata context for dependencies and data quality signals.

What does Atlan crawl from Metabase?

During a Metabase crawl, Atlan extracts metadata for Collections, Dashboards, and Questions. Reference tables map each Metabase property to its Atlan asset.

What does Atlan crawl from Sigma?

During a Sigma crawl, Atlan extracts metadata for Workbooks, Pages, Data Elements, Data Element Fields, and Datasets. Reference tables map each Sigma property to its Atlan asset.