Skip to main content

310 docs tagged with "data"

View all tags

Access archived assets

Find and view archived assets in Atlan using the filter menu. Archived assets are soft-deleted and removed from search results by default, but can be recovered and accessed through discovery filters.

Add contract impact analysis in GitHub

Add contract impact analysis to GitHub pull requests using the Atlan GitHub Action. Setup requires an API token or OAuth client with persona and metadata policy permissions.

Add custom metadata

Add custom metadata fields to assets to store organization-specific information. Setup requires admin users to create custom metadata structures first, then users with edit permissions can populate values.

Add descriptions

Add descriptions to assets, columns, and tables in Atlan to provide contextual information. Descriptions support Markdown syntax and can be crawled from sources, viewed in asset sidebars, and searched using property filters.

Add options

:::warning Who can do this? You must be an admin user in Atlan to create options for custom metadata properties.

Atlan AI security

Security and compliance information for Atlan AI, including AI architecture, data handling, encryption, model management, and compliance frameworks.

Attach a tag

Attach tags to assets in Atlan to identify key characteristics and group assets for usage or data protection. Supports tags from imported sources and native tags.

Automate data profiling

Create profiling playbooks to scan data assets at scale, identify data quality issues, and improve asset quality. Setup requires admin user permissions in Atlan.

Can I connect to any source with an ODBC/JDBC driver?

A number of Atlan's [supported connectors](/product/connections/references/connectors-and-capabilities) use a JDBC- or REST API-based approach for metadata extraction. If you are attempting to connect to a source with no native integration, [contact Atlan support](/support/submit-request) to share more details about your use case.

Can I query any DW/DL?

You can query any data warehouse (DW) or data lake (DL) if the integration is supported via Atlan's [supported sources](/product/connections/references/supported-sources). Once integrated, you can query the underlying data using the [Insights](/product/capabilities/insights/how-tos/query-data) feature.

Can I turn off sample data preview for the entire organization?

Atlan recommends that you turn off sample data preview at a connection level. For example, you can configure the [Snowflake crawler](/apps/connectors/data-warehouses/snowflake/how-tos/crawl-snowflake) to prevent users from previewing any Snowflake data.

Connect on-premises databases to Kubernetes

You can configure and use [Atlan's metadata-extractor tool](/apps/connectors/database/on-premises-databases/how-tos/set-up-on-premises-database-access) to extract metadata from on-premises databases with Kubernetes deployment architecture, as an alternative to using Docker Compose.

Crawl Aiven Kafka

Extract metadata from Aiven Kafka to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring permissions.

Crawl AlloyDB for PostgreSQL

Extract metadata from your AlloyDB for PostgreSQL database and establish a connection between Atlan and your database

Crawl Amazon Athena

Extract metadata from Amazon Athena to catalog databases, tables, and views stored in S3. Discover and govern your SQL query data lake assets after configuring AWS permissions.

Crawl Amazon DynamoDB

Extract metadata from Amazon DynamoDB to catalog tables, items, and attributes. Extract asset information and column-level details after configuring AWS permissions.

Crawl Amazon MSK

Extract metadata from Amazon MSK to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring permissions.

Crawl Amazon QuickSight

Extract metadata from Amazon QuickSight to catalog dashboards, analyses, and datasets. Discover and govern your BI assets after configuring permissions.

Crawl Amazon Redshift

Crawl metadata from Amazon Redshift to catalog tables, schemas, views, and columns. Extract lineage and column-level asset details after configuring user access permissions.

Crawl AWS Glue

Crawl metadata from AWS Glue Data Catalog to catalog jobs, workflows, tables, and data transformations. Extract lineage information after configuring AWS access permissions.

Crawl BigID

Configure and run the Atlan BigID workflow to crawl metadata from BigID.

Crawl ClickHouse

Create a ClickHouse crawler workflow in Atlan to extract metadata from your database.

Crawl CrateDB

Configure and run the CrateDB crawler to extract metadata from your database

Crawl Cyera

Crawl classification metadata from Cyera to enrich assets with data sensitivity and privacy information. Extract data sensitivity classifications and security issues.

Crawl Databricks

Crawl metadata from Databricks to catalog tables, schemas, views, and volumes. Extract lineage and column information after configuring access permissions and authentication.

Crawl Databricks AI models

Discover and catalog AI models registered in the Databricks Unity Catalog Model Registry using Atlan's Databricks connector.

Crawl DataStax Enterprise

Extract metadata from DataStax Enterprise to catalog databases, tables, columns, and keyspaces. Discover and govern your NoSQL data assets after configuring user permissions.

Crawl dbt

Crawl metadata from dbt Cloud or dbt Core to catalog models, tests, and documentation. Extract transformation lineage and dbt-generated metadata after configuring project access.

Crawl Domo

Extract metadata from Domo to catalog dashboards, datasets, cards, and pages. Discover and govern your BI assets after configuring permissions.

Crawl Dremio

Crawl metadata from Dremio to catalog physical datasets, virtual datasets, folders, and spaces. Extract lineage and asset information from your data lakehouse.

Crawl Fivetran

Enrich Atlan with Fivetran metadata to track data pipeline lineage and transformations. Configure the Fivetran Platform Connector and connect a supported data warehouse destination.

Crawl Google BigQuery

Crawl metadata from Google BigQuery to catalog datasets, tables, views, and columns. Extract lineage and asset details after completing prerequisite setup.

Crawl Hightouch

Crawl metadata from Hightouch to catalog models and syncs. Extract reverse ETL lineage between data warehouse sources and operational business tools after configuring API credentials.

Crawl Hive

Extract metadata from Hive to catalog databases, tables, views, and columns. Extract lineage and asset information after configuring user permissions.

Crawl IBM Cognos Analytics

Extract metadata from IBM Cognos Analytics to catalog reports, queries, and dashboards. Discover and govern your BI assets after configuring user permissions.

Crawl Iceberg

Crawl metadata from Iceberg to catalog tables, namespaces, and columns. Extract table metadata and lineage from your Iceberg data lakehouse catalogs.

Crawl Looker

Crawl metadata from Looker to catalog dashboards, looks, explores, and data sources. Extract usage and lineage information after configuring user permissions.

Crawl Matillion

Crawl metadata from Matillion to catalog jobs, tasks, pipelines, and transformations. Extract lineage information after configuring user permissions and connectivity.

Crawl Metabase

Extract metadata from Metabase to catalog dashboards, questions, and databases. Discover and govern your BI assets after configuring user permissions.

Crawl Microsoft Azure Cosmos DB

Extract metadata from Microsoft Azure Cosmos DB to catalog databases, containers, and documents. Discover and govern your NoSQL data assets after configuring permissions.

Crawl Microsoft Azure Data Factory

Once you have [configured the Microsoft Azure Data Factory permissions](/apps/connectors/etl-tools/microsoft-azure-data-factory/how-tos/set-up-microsoft-.

Crawl Microsoft Azure Event Hubs

Extract metadata from Microsoft Azure Event Hubs to catalog topics, partitions, and consumer groups. Discover and govern your messaging infrastructure after configuring permissions.

Crawl Microsoft Azure Synapse Analytics

Extract metadata from Microsoft Azure Synapse Analytics to catalog databases, schemas, tables, and views. Discover and govern your cloud data warehouse assets after configuring permissions.

Crawl Microsoft Power BI

Crawl metadata from Microsoft Power BI to catalog workbooks, reports, dashboards, and data sources. Extract lineage and usage metrics after configuring service principal authentication.

Crawl Microsoft SQL Server

Crawl metadata from Microsoft SQL Server to catalog databases, schemas, tables, and views. Extract column details and enable data discovery, lineage tracking, and governance after configuring user permissions.

Crawl MicroStrategy

Extract metadata from MicroStrategy to catalog reports, dashboards, and metrics. Discover and govern your BI assets after configuring authentication.

Crawl Mode

Extract metadata from Mode to catalog reports, charts, and datasets. Discover and govern your BI assets after configuring user permissions.

Crawl MongoDB (self-managed)

Once you have [configured the MongoDB permissions](/apps/connectors/database/mongodb/onprem/how-tos/set-up-mongodb-onprem), you can establish a connection between Atlan and MongoDB.

Crawl MongoDB Atlas

Configure MongoDB Atlas connection and run the crawler to extract metadata from MongoDB into Atlan.

Crawl Monte Carlo

Crawl metadata from Monte Carlo to capture data quality anomalies and monitors. Extract data quality insights and incident tracking after configuring API credentials.

Crawl MySQL

Crawl metadata from MySQL to catalog databases, tables, schemas, and views. Extract column-level information and document your database structure after configuring user permissions.

Crawl on-premises databases

Extract metadata from on-premises databases to catalog tables, schemas, and views. Discover and govern your database assets after configuring the metadata-extractor tool.

Crawl on-premises Databricks

The Docker-based databricks-extractor offline tool has been sunset. For on-premises or network-restricted Databricks environments, use Self-Deployed Runtime, Secure Agent, or direct connectivity via private link.

Crawl on-premises IBM Cognos Analytics

Extract metadata from on-premises IBM Cognos Analytics to catalog reports, queries, and dashboards. Discover and govern your BI assets after configuring extractor tool.

Crawl on-premises Kafka

Extract metadata from on-premises Kafka to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring extractor tool.

Crawl on-premises Looker

Extract metadata from on-premises Looker to catalog dashboards, looks, explores, and data sources. Discover and govern your BI assets after configuring extractor tool.

Crawl on-premises Tableau

Extract metadata from on-premises Tableau to catalog workbooks, dashboards, and sheets. Discover and govern your BI assets after configuring extractor tool.

Crawl on-premises ThoughtSpot

Extract metadata from on-premises ThoughtSpot to catalog pinboards, answers, and liveboards. Discover and govern your BI assets after configuring extractor tool.

Crawl PrestoSQL

Extract metadata from PrestoSQL to catalog databases, tables, views, and columns. Extract query lineage and asset information after configuring user permissions.

Crawl Qlik Sense Cloud

Extract metadata from Qlik Sense Cloud to catalog applications, sheets, and visualizations. Discover and govern your BI assets after configuring permissions.

Crawl Qlik Sense Enterprise on Windows

Extract metadata from Qlik Sense Enterprise on Windows to catalog applications, sheets, and visualizations. Discover and govern your BI assets after configuring permissions.

Crawl Redash

Extract metadata from Redash to catalog queries, dashboards, and visualizations. Discover and govern your BI assets after configuring permissions.

Crawl Redpanda Kafka

Extract metadata from Redpanda Kafka to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring permissions.

Crawl Salesforce

Extract metadata from Salesforce to catalog objects, fields, and relationships. Discover and govern your CRM data assets after configuring user permissions.

Crawl SAP HANA

Extract metadata from SAP HANA to catalog databases, schemas, tables, views, and columns. Discover and govern your enterprise data assets after configuring user permissions.

Crawl Sigma

Extract metadata from Sigma to catalog workbooks, pages, and visualizations. Discover and govern your BI assets after configuring permissions.

Crawl Sisense

Extract metadata from Sisense to catalog dashboards, widgets, and datasets. Discover and govern your BI assets after configuring permissions.

Crawl Snowflake

Crawl metadata from Snowflake to catalog tables, schemas, views, and other objects. Extract lineage, column-level details, and documentation after configuring user permissions.

Crawl Snowflake AI models

Discover, catalog, and build lineage for AI models registered in the Snowflake Model Registry using Atlan's Snowflake connector.

Crawl Soda

Crawl metadata from Soda to capture data quality checks, measurements, and test results. Extract data quality insights after configuring API credentials.

Crawl Starburst Enterprise

Crawl metadata from Starburst Enterprise to catalog catalogs, schemas, tables, and data products. Extract domain definitions and lineage information after configuring user permissions.

Crawl Tableau

Crawl metadata from Tableau to catalog workbooks, dashboards, sheets, and data sources. Extract lineage and usage information after configuring user permissions.

Crawl Teradata

Extract metadata from Teradata to catalog databases, tables, views, and columns. Extract query lineage and asset information after configuring user permissions.

Crawl ThoughtSpot

Extract metadata from ThoughtSpot to catalog pinboards, answers, and liveboards. Discover and govern your BI assets after configuring permissions.

Crawl Trino

Crawl metadata from Trino to catalog schemas, tables, views, and columns. Extract distributed query lineage and asset information after configuring user permissions.

Create announcements

Adding an announcement to your data asset helps you call attention to an important feature or notify others about a change coming down the pipeline. Since announcements in Atlan display the time stamp and author information, you can easily identify whether an announcement is still relevant and who to ask for questions.

Data Models

Data models provide a framework to describe how data is structured, organized, and related within a system. It acts as a blueprint for organizations to design their business applications and processes. Data models can be of different types: relational, hierarchical, entity relationship, and network.

Data Pipelines

Learn how to connect your data pipelines to Atlan. Explore ETL tools, workflow orchestration, and lineage tracking to build a comprehensive view of your data movement.

Disable data access

:::warning Who can do this? You will need to be an admin user in Atlan to configure these options.

Discovery FAQs

Frequently asked questions about Atlan's Discovery capabilities.

Download impacted assets in Microsoft Excel

Once you've [connected Atlan with Microsoft Excel](/product/integrations/collaboration/spreadsheets/how-tos/integrate-atlan-with-microsoft-excel), you can download impacted assets in Microsoft Excel. This can help you assess the downstream impact of any changes made to an upstream asset for [impact analysis](/product/capabilities/lineage/concepts/what-is-lineage#impact-analysis).

Enable Snowflake OAuth

Atlan supports [Snowflake OAuth-based authentication](https://docs.snowflake.com/user-guide/oauth-snowflake-overview) for [Snowflake](/apps/connectors/data-ware.

Enable SSO for Amazon Redshift

You will need to [create a client application in Okta](https://help.okta.com/en-us/Content/Topics/Apps/Apps_App_Integration_Wizard_OIDC.htm) to use for [configuring the identity provider in AWS](/apps/connectors/data-warehouses/amazon-redshift/how-tos/enable-sso-for-amazon-redshift).

Enrich Atlan through dbt

Beyond the default mapped [dbt Cloud](/apps/connectors/etl-tools/dbt/references/what-does-atlan-crawl-from-dbt-cloud) or [dbt Core](/apps/connectors/etl-tools/dbt/references/what-does-atlan-crawl-from-dbt-core) properties, you can update any of Atlan's metadata attributes (except for `name`, `tenantId`, and `qualifiedName`) through your dbt model's `meta` property.

Extract lineage and usage from Databricks

Retrieve lineage from Unity Catalog and usage and popularity metrics from query history or system tables using REST API, offline, or system table extraction methods.

Extract on-premises Databricks lineage

The Docker-based databricks-extractor offline tool has been sunset. For on-premises or network-restricted Databricks lineage extraction, use Self-Deployed Runtime, Secure Agent, or direct connectivity via private link.

Find assets by usage

Data teams often lack clarity on which data assets can be considered trustworthy, whether these are frequently used, the freshness of the data itself, or how critical these are for enrichment and governance.

Generate Power BI columns to dataset lineage

Learn how to generate column-to-dataset lineage for Microsoft Power BI by using the PowerBI Columns -> Dataset lineage app with report.json files stored in cloud object storage.

Implement OpenLineage in Airflow operators

If you're using an Airflow operator supported by OpenLineage, the OpenLineage events will contain input and output details. This means that you do not have to modify your current DAG implementation and Atlan will be able to generate data lineage.

Integrate Amazon MWAA/OpenLineage

To learn more about OpenLineage, refer to [OpenLineage configuration and facets](/product/connections/references/openlineage-configuration-and-facets).

Integrate Apache Airflow/OpenLineage

To integrate Apache Airflow/OpenLineage with Atlan, complete the following steps. To learn more about OpenLineage, refer to [OpenLineage configuration and facets](/product/connections/references/openlineage-configuration-and-facets).

Integrate Apache Flink/OpenLineage

Atlan extracts job-level operational metadata from Apache Flink and generates job lineage through OpenLineage. To learn more about OpenLineage, refer to [OpenLineage configuration and facets](/product/connections/references/openlineage-configuration-and-facets).

Integrate Apache Spark/OpenLineage

Atlan extracts job-level operational metadata from Apache Spark and generates job lineage through OpenLineage. To learn more about OpenLineage, refer to [OpenLineage configuration and facets](/product/connections/references/openlineage-configuration-and-facets).

Integrate Atlan with Microsoft Excel

The Atlan add-in for Microsoft Excel makes it easy to enrich metadata in bulk for your data assets in Atlan. You can use the Atlan add-in for both the web and desktop versions of Microsoft Excel.

Integrate Jira Data Center

You will need to [configure an incoming link](https://confluence.atlassian.com/adminjiraserver/configure-an-incoming-link-1115659067.html) with an external application - in this case, Atlan. This will allow Atlan to access Jira data, which means that Jira will act as the OAuth provider.

Integrate ServiceNow

If your Atlan admin has [enabled the governance workflows and inbox module](/product/capabilities/governance/stewardship/how-tos/automate-data-governance) in your Atlan workspace, you can create a ServiceNow integration to allow your users to [grant or revoke data access](/product/capabilities/governance/stewardship/how-tos/automate-data-governance) for governed assets in Atlan or any other data source.

Link your account

To [export assets to and bulk enrich metadata from](/product/integrations/collaboration/spreadsheets/how-tos/export-assets) a supported spreadsheet tool,.

Link your ServiceNow account

To request or revoke data access through ServiceNow inside Atlan, you may first need to link your ServiceNow account. This is done automatically for the user that [set up the ServiceNow integration](/product/integrations/project-management/servicenow/how-tos/integrate-servicenow), but not for other users.

Manage Databricks tags

You must have a [Unity Catalog-enabled workspace](https://docs.databricks.com/en/data-governance/unity-catalog/get-started.html) and SQL warehouse configured to import Databricks tags in Atlan.

Manage Google BigQuery tags

Atlan imports your [Google BigQuery tags](https://docs.getdbt.com/references/resource-configs/tags) and lets you update your Google BigQuery assets with the imported tags. Note that object tagging in Google BigQuery currently requires [Enterprise edition or higher](https://cloud.google.com/bigquery/docs/editions-intro#editions_features).

Manage Snowflake tags

You can import your Snowflake tags to Atlan through one-way tag sync. The synced Snowflake tags will be matched to corresponding tags in Atlan through case-insensitive name match and your Snowflake assets will be enriched with their synced tags from Snowflake.

Migrate from dbt to Atlan action

The dbt-action is a custom action designed to perform impact analysis on changes to your dbt models in a [GitHub](/apps/connectors/etl-tools/dbt/how-tos/.

Mine Amazon Redshift

Once you have [crawled assets from Amazon Redshift](/apps/connectors/data-warehouses/amazon-redshift/how-tos/crawl-amazon-redshift), you can mine its query history to construct lineage and retrieve [usage and popularity metrics](/product/capabilities/usage-and-popularity/how-tos/interpret-usage-metrics).

Mine ClickHouse

Once you have [crawled assets from ClickHouse](/apps/connectors/database/clickhouse/how-tos/crawl-clickhouse), you can mine its query history to construct lineage.

Mine Google BigQuery

Once you have [crawled assets from Google BigQuery](/apps/connectors/data-warehouses/google-bigquery/how-tos/crawl-google-bigquery), you can mine its query history to construct lineage.

Mine PostgreSQL

Once you have [crawled assets from PostgreSQL](/apps/connectors/database/postgresql/how-tos/crawl-postgresql), you can mine its query history to construct lineage.

Mine query history

Mine query history from Microsoft SQL Server to construct lineage relationships between your database assets.

Mine Snowflake

Once you have [crawled assets from Snowflake](/apps/connectors/data-warehouses/snowflake/how-tos/crawl-snowflake), you can mine its query history to construct lineage.

Mine Teradata

Once you have [crawled assets from Teradata](/apps/connectors/database/teradata/how-tos/crawl-teradata), you can mine its query history to construct lineage.

Order workflows

The [order of operations](/product/connections/how-tos/order-workflows#order-of-operations) you run in Atlan is important. Follow the specific workflow sequence outlined below when crawling [data tools](/product/connections/references/supported-sources). The right order particularly ensures that lineage is constructed without needing to rerun crawlers.

Preflight checks for Amazon Redshift

Before [running the Amazon Redshift crawler](/apps/connectors/data-warehouses/amazon-redshift/how-tos/crawl-amazon-redshift), you can run [preflight chec.

Preflight checks for Anomalo

This check tests for the validity of the [host name URL and API key](/apps/connectors/observability/anomalo/how-tos/integrate-anomalo) you provided. If Atlan is unable to connect to your Anomalo instance, this may indicate that your credentials are either incorrect or invalid.

Preflight checks for Bigeye

This check tests for the validity of the [host name URL and API key](/apps/connectors/observability/bigeye/how-tos/ingest-bigeye-data) you provided. If Atlan is unable to connect to your Bigeye instance, this may indicate that your credentials are either incorrect or invalid.

Preflight checks for Databricks

Before [running the Databricks crawler](/apps/connectors/data-warehouses/databricks/how-tos/crawl-databricks), you can run [preflight checks](/product/co.

Preflight checks for Domo

Atlan uses the [DataSet API](https://developer.domo.com/portal/72ae9b3e80374-list-data-sets) to fetch dataset metadata from Domo.

Preflight checks for Google BigQuery

Each request requires an OAuth 2.0 access token generated via the [service account key](https://cloud.google.com/docs/authentication#service-accounts).

Preflight checks for Hive

Before [running the Hive crawler](/apps/connectors/database/hive/how-tos/crawl-hive), you can run [preflight checks](/product/connections/concepts/what-a.

Preflight checks for Metabase

Before [running the Metabase crawler](/apps/connectors/business-intelligence/metabase/how-tos/crawl-metabase), you can run [preflight checks](/product/co.

Preflight checks for Microsoft Azure Synapse Analytics

This check is performed for both [basic](/apps/connectors/data-warehouses/microsoft-azure-synapse-analytics/how-tos/set-up-microsoft-azure-synapse-analytics) and [service principal](/apps/connectors/data-warehouses/microsoft-azure-synapse-analytics/how-tos/set-up-microsoft-azure-synapse-analytics) authentication method.

Preflight checks for Mode

Before [running the Mode crawler](/apps/connectors/business-intelligence/mode/how-tos/crawl-mode), you can run [preflight checks](/product/connections/co.

Preflight checks for MongoDB Atlas

Before running the MongoDB Atlas crawler, you can run preflight checks to verify the connected user has the required MongoDB privileges for metadata extraction.

Preflight checks for MySQL

Before [running the MySQL crawler](/apps/connectors/database/mysql/how-tos/crawl-mysql), you can run [preflight checks](/product/connections/concepts/wha.

Preflight checks for Oracle

Before [running the Oracle crawler](/apps/connectors/database/oracle/how-tos/crawl-oracle), you can run [preflight checks](/product/connections/concepts/.

Preflight checks for PostgreSQL

Before [running the PostgreSQL crawler](/apps/connectors/database/postgresql/how-tos/crawl-postgresql), you can run [preflight checks](/product/connectio.

Preflight checks for PrestoSQL

Before [running the PrestoSQL crawler](/apps/connectors/database/prestosql/how-tos/crawl-prestosql), you can run [preflight checks](/product/connections/.

Preflight checks for Snowflake

Before [running the Snowflake crawler](/apps/connectors/data-warehouses/snowflake/how-tos/crawl-snowflake), you can run [preflight checks](/product/conne.

Preflight checks for Teradata

Before [running the Teradata crawler](/apps/connectors/database/teradata/how-tos/crawl-teradata), you can run [preflight checks](/product/connections/con.

Preflight checks for Trino

Before [running the Trino crawler](/apps/connectors/database/trino/how-tos/crawl-trino), you can run [preflight checks](/product/connections/concepts/wha.

Provide SSL certificates

SSL (Secure Sockets Layer) encryption helps establish a secure connection between your data source and Atlan. Atlan currently supports SSL certificates for [crawling Tableau](/apps/connectors/business-intelligence/tableau/how-tos/crawl-tableau) and [crawling Matillion](/apps/connectors/etl-tools/matillion/how-tos/crawl-matillion).

Security

The Secure Agent is designed with multiple security controls to protect metadata, credentials, and communication between systems. This document outlines its security mechanisms across authentication, encryption, container security, network security, and logging and monitoring.

Security and Compliance

Complete guide to Atlan's security features, compliance certifications, and data protection capabilities.

Set up Aiven Kafka

Atlan supports the [S3 extraction method](/apps/connectors/messaging/on-premises-event-buses/how-tos/set-up-on-premises-kafka-access) for fetching metadata from Aiven Kafka. This method uses Atlan's kafka-extractor tool to fetch metadata.

Set up Amazon S3

Create AWS IAM permissions and credentials for Atlan to access and catalog your S3 buckets and objects.

Set up an Azure private network link to Databricks

For all details, see [Databricks documentation](https://learn.microsoft.com/en-us/azure/databricks/administration-guide/cloud-configurations/azure/private-link-simplified?source=recommendations#create-the-workspace-and-private-endpoints-in-the-azure-portal-ui).

Set up Anomalo

Atlan supports the API authentication method for fetching metadata from [Anomalo](https://docs.anomalo.com/integrations/atlan-integration). This method uses an API key to fetch metadata.

Set up BigID

Create a BigID system user and API token for Atlan integration.

Set up ClickHouse

Configure a dedicated ClickHouse user with read-only access so Atlan can connect and extract metadata.

Set up Confluent Kafka

Atlan supports the API authentication method for fetching metadata from Confluent Kafka. This method uses an API key and API secret to fetch metadata.

Set up Cyera

Create a Cyera API token and obtain credentials for Atlan integration.

Set up Databricks

Atlan supports three authentication methods for fetching metadata from Databricks. You can set up any of the following authentication methods:.

Set up dbt Cloud

Configure authentication tokens in dbt Cloud to enable Atlan to fetch and enrich your assets with dbt metadata.

Set up Domo

:::warning Who can do this? You'll need your Domo administrator to complete these steps - you may not have access yourself.

Set up Dremio

Configure Dremio connection and authentication for Atlan integration.

Set up Fivetran

Configure the Fivetran Platform Connector and destination to enable Atlan to extract Fivetran metadata and logs.

Set up Google BigQuery

You must be a Google BigQuery administrator to run these commands. For more information, see [Google Cloud's Granting, changing, and revoking access to resources](https://cloud.google.com/iam/docs/granting-changing-revoking-access).

Set up Google Cloud Knowledge Catalog

Configure Google Cloud Knowledge Catalog connection and authentication by creating a service account with required permissions, or using Workload Identity Federation.

Set up Hightouch

Configure Hightouch roles, groups, and API keys to connect with Atlan.

Set up Hive

Configure permissions and authentication for Hive to enable metadata extraction in Atlan.

Set up IBM Cognos Analytics

:::warning Who can do this? You must be an IBM Cognos Analytics administrator to complete these steps - you may not have access yourself.

Set up Iceberg

Configure Generic REST Catalog and BigLake Metastore authentication for Iceberg integration in Atlan.

Set up Microsoft Azure Cosmos DB

If your Microsoft Azure Cosmos DB deployment includes a mix of vCore- and RU-based accounts, you must configure both to fetch metadata. You can then use the _vCore and RU_ deployment option to [crawl your Microsoft Azure Cosmos DB assets](/apps/connectors/database/microsoft-azure-cosmos-db/how-tos/crawl-microsoft-azure-cosmos-db).

Set up Microsoft Azure Data Factory

Atlan supports service principal authentication for fetching metadata from Microsoft Azure Data Factory. This method requires a client ID, client secret, and tenant ID to fetch metadata.

Set up Microsoft SQL Server

Configure authentication and permissions for Microsoft SQL Server to enable Atlan to crawl metadata from your database.

Set up MicroStrategy

Atlan supports basic authentication and API token authentication methods for fetching metadata from MicroStrategy.

Set up MongoDB (self-managed)

Atlan supports SCRAM authentication (SCRAM-SHA-1 and SCRAM-SHA-256) for fetching metadata from MongoDB. This method uses a [username and password](#create-database-user) to fetch metadata.

Set up MongoDB Atlas

Atlan supports the basic authentication method for fetching metadata from MongoDB. This method uses a username and password to fetch metadata

Set up Monte Carlo

:::warning Who can do this? You will probably need your Monte Carlo [account owner](https://docs.getmontecarlo.com/docs/authorizationmanaged-roles-and-groups).

Set up MySQL

:::warning Who can do this? You probably need your MySQL administrator to run these commands - you may not have access yourself.

Set up on-premises database access

In such cases you may want to decouple the extraction of metadata from its ingestion in Atlan. This approach gives you full control over your resources and metadata transfer to Atlan.

Set up on-premises Databricks access

The Docker-based databricks-extractor offline tool has been sunset. For on-premises or network-restricted Databricks environments, use Self-Deployed Runtime, Secure Agent, or direct connectivity via private link.

Set up on-premises Kafka access

In some cases you won't be able to expose your Kafka instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises Looker access

In some cases you won't be able to expose your Looker instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises Microsoft Azure Synapse Analytics miner access

In some cases you will not be able to expose your Microsoft Azure Synapse Analytics instance for Atlan to [mine query history from the Query Store](/apps/connectors/data-warehouses/microsoft-azure-synapse-analytics/how-tos/set-up-microsoft-azure-synapse-analytics). For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises Tableau access

In some cases you may not be able to expose your Tableau instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises Teradata miner access

In some cases you will not be able to expose your Teradata instance for Atlan to mine query history. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises ThoughtSpot access

In some cases you will not be able to expose your ThoughtSpot instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up PostgreSQL

:::warning Who can do this? You will probably need your PostgreSQL administrator to run these commands - you may not have access yourself.

Set up Redash

:::warning Who can do this? You will probably need your Redash administrator to complete the following steps - you may not have access yourself.

Set up Redpanda Kafka

Atlan supports the [S3 extraction method](/apps/connectors/messaging/on-premises-event-buses/how-tos/set-up-on-premises-kafka-access) for fetching metadata from Redpanda Kafka. This method uses Atlan's kafka-extractor tool to fetch metadata.

Set up SAP HANA

:::warning Who can do this? You will probably need your SAP HANA administrator to run these commands - you may not have access yourself.

Set up Sisense

Atlan supports the basic authentication method for fetching metadata from Sisense. This method uses a username and password to fetch metadata.

Set up Snowflake

:::warning Who can do this? You need your Snowflake administrator to run these commands - you may not have access yourself. :::.

Set up Soda

:::warning Who can do this? You will need your [Soda Cloud administrator](https://docs.soda.io/soda-cloud/roles-and-rights.html) to complete these steps -.

Set up Starburst Enterprise

Configure authentication and permissions for Starburst Enterprise to enable Atlan to crawl metadata from your instance.

Set up Tableau

:::warning Who can do this? You probably need your Tableau administrator to run these commands - you may not have access yourself.

Set up Teradata

:::warning Who can do this? You need your Teradata administrator to run these commands - you may not have access yourself.

Set up Trino

Configure authentication and permissions for Trino to enable Atlan to crawl metadata from your database.

Star assets

Star your most-used assets to bookmark them for quick access. Starred assets appear in a personalized widget, can be sorted by star count, and can trigger notifications for metadata updates via Slack or Microsoft Teams.

Structured search

Use Atlan's structured search to find assets with precision using keywords, filters, and facets. Filter by source, certification, owner, tags, and save searches for sharing and quick access.

Tags and Metadata Management

Complete guide to managing tags, classifications, and metadata in Atlan for effective data governance and organization.

Update column metadata in Google Sheets

Once you've [connected Atlan with Google Sheets](/product/integrations/collaboration/spreadsheets/how-tos/integrate-atlan-with-google-sheets), you can import the column metadata for all your data assets in Atlan and make changes to them directly in Google Sheets.

Update column metadata in Microsoft Excel

Once you've [connected Atlan with Microsoft Excel](/product/integrations/collaboration/spreadsheets/how-tos/integrate-atlan-with-microsoft-excel), you can import the column metadata for all your data assets in Atlan and make changes to them directly in Microsoft Excel.

Use filters menu

Refine asset searches using filters by source, type, domain, certification status, ownership, and custom metadata. Filters help you discover relevant assets faster and can be bookmarked, shared, or used to export assets in bulk.

view data models

Once you have [ingested your ER model assets in Atlan](/product/capabilities/data-models/concepts/what-are-data-models), you can:.

View query logs

You can also view additional details and run status for each query and use filters to track specific queries. Query logs are persisted throughout the lifecycle of the Atlan instance for your organization.

What are Power BI processes on the lineage graph?

Note that process entities may not have a counterpart entity in Microsoft Power BI. Consider these to be nodes that you can enrich with metadata to describe the process or relationship between two Microsoft Power BI assets.

What does Atlan crawl from Amazon Athena?

During an Amazon Athena crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Columns. Reference tables map each Athena property to its Atlan asset.

What does Atlan crawl from Amazon DynamoDB?

Atlan crawls and maps the following assets and properties from Amazon DynamoDB. Atlan also currently supports lineage between Amazon DynamoDB as a source to supported data warehouses as destinations, as enriched by Fivetran.

What does Atlan crawl from Amazon Redshift?

During an Amazon Redshift crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Columns. Reference tables map each Redshift property to its Atlan asset.

What does Atlan crawl from Anomalo?

Once you have [integrated Anomalo](/apps/connectors/observability/anomalo/how-tos/integrate-anomalo), Atlan will receive webhook events when checks are executed in Anomalo. These checks will be cataloged in Atlan to create a relationship with existing assets using the association information from the check.

What does Atlan crawl from AWS Glue?

During an AWS Glue crawl, Atlan extracts metadata for Catalogs, Databases, Schemas, Tables, and Jobs. Reference tables map each Glue property to its Atlan asset.

What does Atlan crawl from ClickHouse?

During a ClickHouse crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Columns. Reference tables map each ClickHouse property to its Atlan asset.

What does Atlan crawl from Databricks?

During a Databricks crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, Materialized Views, Volumes, External Locations, and AI Models. Reference tables map each Databricks property to its Atlan asset.

What does Atlan crawl from Fivetran?

During a Fivetran crawl, Atlan creates lineage between source and destination data assets and extracts metadata for Fivetran Connectors. Reference tables map each Fivetran Connector property to its Atlan asset.

What does Atlan crawl from Google BigQuery?

During a Google BigQuery crawl, Atlan extracts metadata for Projects, Datasets, Tables, Views, and Materialized Views. Reference tables map each BigQuery property to its Atlan asset.

What does Atlan crawl from Hive?

During a Hive crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, Materialized Views, and Columns. Reference tables map each Hive property to its Atlan asset.

What does Atlan crawl from MySQL?

During a MySQL crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Stored Procedures. Reference tables map each MySQL property to its Atlan asset.

What does Atlan crawl from Oracle?

During an Oracle crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Procedures. Reference tables map each Oracle property to its Atlan asset.

What does Atlan crawl from PostgreSQL?

During a PostgreSQL crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Functions. Reference tables map each PostgreSQL property to its Atlan asset.

What does Atlan crawl from SAP BW/4HANA?

What does Atlan crawl from SAP BW/4HANA? <Badge variant="preview" text="Private Preview" link="/get-started/references/product-release-stages#private-preview" />

What does Atlan crawl from SAP ECC?

What does Atlan crawl from SAP ECC? <Badge variant="preview" text="Public Preview" link="/get-started/references/product-release-stages#public-preview" />

What does Atlan crawl from SAP S/4HANA?

What does Atlan crawl from SAP S/4HANA? <Badge variant="preview" text="Public Preview" link="/get-started/references/product-release-stages#public-preview" />

What does Atlan crawl from Sisense?

During a Sisense crawl, Atlan extracts metadata for Dashboards, Widgets, Data Models, Data Model Tables, and Folders. Reference tables map each Sisense property to its Atlan asset.

What does Atlan crawl from Snowflake?

During a Snowflake crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Procedures. Reference tables map each Snowflake property to its Atlan asset.

What does Atlan crawl from Soda?

Atlan crawls datasets and then filters out all the datasets without any checks. It then crawls the checks associated with each of the datasets with checks from Soda. These checks are cataloged in Atlan to create a relationship with existing assets using the association information from the dataset.

What does Atlan crawl from Tableau?

During a Tableau crawl, Atlan extracts metadata for Projects, Workbooks, Worksheets, Dashboards, Datasources, and Flows. Reference tables map each Tableau property to its Atlan asset.

What does Atlan crawl from Teradata?

During a Teradata crawl, Atlan extracts metadata for Schemas, Tables, Views, and Columns. Reference tables map each Teradata property to its Atlan asset.

What is the default permission for a glossary?

By default, users can search and discover [glossaries](/product/capabilities/governance/glossary/concepts/what-is-a-glossary) in Atlan, irrespective of their user role. The rationale being that glossaries are meant to be accessible to all users who want to understand business context. You can define a [glossary policy](/product/capabilities/governance/custom-metadata/how-tos/control-access-metadata-data#glossary-policies) to control what users can do with glossary metadata and [create a persona](/product/capabilities/governance/access-control/how-tos/create-a-persona) to curate edit access.

What lineage does Atlan extract from Matillion?

Atlan uses Matillion's metadata API to generate lineage associated with [Matillion connectors](https://www.matillion.com/connectors). This is particularly useful for creating lineage between different tools.

What lineage does Atlan extract from Microsoft Power BI?

This document helps you understand how Atlan generates lineage to upstream SQL sources for your Microsoft Power BI assets using a custom query parser, and the steps you can take while developing reports and dashboards in Microsoft Power BI to create seamless lineage generation.

When does Atlan become a personal data processor or subprocessor?

Atlan personnel do not have access to any customer instance unless specifically provided by the customer. Accordingly, in the event that a customer instance contains personal data and Atlan personnel are provided access to that instance, Atlan may act as a personal data processor. In addition, depending on whether the customer is a data controller or processor, Atlan may act as a data processor or subprocessor, respectively.

Why is lineage available for table level but not column level?

The home icon on top of any asset on the [lineage graph](/product/capabilities/lineage/how-tos/view-lineage) indicates the current asset in focus. The lineage view will be different based on the asset you're viewing. To view column-level lineage for [supported sources](/product/connections/references/supported-sources), click **view columns** and then select a column to view data flows for that particular asset.

Workflows and Data Processing

Everything about managing data workflows, understanding lineage generation, and optimizing data processing pipelines in Atlan.