Skip to main content

279 docs tagged with "crawl"

View all tags

Add descriptions

You can add descriptions to your assets in Atlan, including tables, views, and individual columns. You can even add a description in the form of a [README](/product/integrations). Doing so enriches your data asset with the relevant contextual information.

Add options

:::warning Who can do this? You must be an admin user in Atlan to create options for custom metadata properties.

Automate data profiling

Create profiling playbooks to scan data assets at scale, identify data quality issues, and improve asset quality. Setup requires admin user permissions in Atlan.

Can I connect to any source with an ODBC/JDBC driver?

A number of Atlan's [supported connectors](/product/connections/references/connectors-and-capabilities) use a JDBC- or REST API-based approach for metadata extraction. If you are attempting to connect to a source with no native integration, [contact Atlan support](/support/submit-request) to share more details about your use case.

Can I turn off sample data preview for the entire organization?

Atlan recommends that you turn off sample data preview at a connection level. For example, you can configure the [Snowflake crawler](/apps/connectors/data-warehouses/snowflake/how-tos/crawl-snowflake) to prevent users from previewing any Snowflake data.

Crawl Aiven Kafka

Extract metadata from Aiven Kafka to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring permissions.

Crawl AlloyDB for PostgreSQL

Extract metadata from your AlloyDB for PostgreSQL database and establish a connection between Atlan and your database

Crawl Amazon Athena

To crawl metadata from Amazon Athena, review the [order of operations](/product/connections/how-tos/order-workflows) and then complete the following steps.

Crawl Amazon DynamoDB

Extract metadata from Amazon DynamoDB to catalog tables, items, and attributes. Extract asset information and column-level details after configuring AWS permissions.

Crawl Amazon MSK

To crawl metadata from Amazon MSK, review the [order of operations](/product/connections/how-tos/order-workflows) and then complete the following steps.

Crawl Amazon QuickSight

Extract metadata from Amazon QuickSight to catalog dashboards, analyses, and datasets. Discover and govern your BI assets after configuring permissions.

Crawl Amazon Redshift

Once you have configured the [Amazon Redshift access permissions](/apps/connectors/data-warehouses/amazon-redshift/how-tos/set-up-amazon-redshift), you can establish a connection between Atlan and Amazon Redshift.

Crawl Anaplan

Crawl metadata from Anaplan to catalog workspaces, models, dimensions, and planning modules. Extract model structure and planning asset information after configuring user access.

Crawl Apache Kafka

Crawl metadata from Apache Kafka to catalog topics, consumer groups, and clusters. Extract schema information from Confluent Schema Registry after configuring cluster permissions.

Crawl AWS Glue

Crawl metadata from AWS Glue Data Catalog to catalog jobs, workflows, tables, and data transformations. Extract lineage information after configuring AWS access permissions.

Crawl BigID

Configure and run the Atlan BigID workflow to crawl metadata from BigID.

Crawl ClickHouse

Create a ClickHouse crawler workflow in Atlan to extract metadata from your database.

Crawl CrateDB

Configure and run the CrateDB crawler to extract metadata from your database

Crawl Cyera

Crawl classification metadata from Cyera to enrich assets with data sensitivity and privacy information. Extract data sensitivity classifications and security issues.

Crawl Databricks

Crawl metadata from Databricks to catalog tables, schemas, views, and volumes. Extract lineage and column information after configuring access permissions and authentication.

Crawl Databricks AI models

Discover and catalog AI models registered in the Databricks Unity Catalog Model Registry using Atlan's Databricks connector.

Crawl DataStax Enterprise

Extract metadata from DataStax Enterprise to catalog databases, tables, columns, and keyspaces. Discover and govern your NoSQL data assets after configuring user permissions.

Crawl dbt

Configure and run the crawler to extract metadata from your dbt Cloud or dbt Core projects.

Crawl Domo

Extract metadata from Domo to catalog dashboards, datasets, cards, and pages. Discover and govern your BI assets after configuring permissions.

Crawl Dremio

Crawl metadata from Dremio to catalog physical datasets, virtual datasets, folders, and spaces. Extract lineage and asset information from your data lakehouse.

Crawl Fivetran

Enrich Atlan with Fivetran metadata to track data pipeline lineage and transformations. Configure the Fivetran Platform Connector and connect a supported data warehouse destination.

Crawl GCS assets

Configure and run the GCS crawler to catalog your GCP GCS buckets and objects in Atlan.

Crawl Google BigQuery

Crawl metadata from Google BigQuery to catalog datasets, tables, views, and columns. Extract lineage and asset details after completing prerequisite setup.

Crawl Hightouch

Crawl metadata from Hightouch to catalog models and syncs. Extract reverse ETL lineage between data warehouse sources and operational business tools after configuring API credentials.

Crawl Hive

Extract metadata from Hive to catalog databases, tables, views, and columns. Extract lineage and asset information after configuring user permissions.

Crawl IBM Cognos Analytics

Extract metadata from IBM Cognos Analytics to catalog reports, queries, and dashboards. Discover and govern your BI assets after configuring user permissions.

Crawl Iceberg

Crawl metadata from Iceberg to catalog tables, namespaces, and columns. Extract table metadata and lineage from your Iceberg data lakehouse catalogs.

Crawl Informatica CDI assets

Crawl metadata from Informatica Cloud Data Integration to catalog projects, workflows, tasks, and mappings. Extract data transformation lineage after configuring cloud access.

Crawl Looker

Once you have configured the [Looker user permissions](/apps/connectors/business-intelligence/looker/how-tos/set-up-looker), you can establish a connection between Atlan and Looker.

Crawl Matillion

Crawl metadata from Matillion to catalog jobs, tasks, pipelines, and transformations. Extract lineage information after configuring user permissions and connectivity.

Crawl Metabase

Once you have [configured the Metabase user permissions](/apps/connectors/business-intelligence/metabase/how-tos/set-up-metabase), you can establish a connection between Atlan and Metabase.

Crawl Microsoft Azure Cosmos DB

Extract metadata from Microsoft Azure Cosmos DB to catalog databases, containers, and documents. Discover and govern your NoSQL data assets after configuring permissions.

Crawl Microsoft Azure Data Factory

Once you have [configured the Microsoft Azure Data Factory permissions](/apps/connectors/etl-tools/microsoft-azure-data-factory/how-tos/set-up-microsoft-.

Crawl Microsoft Azure Event Hubs

Extract metadata from Microsoft Azure Event Hubs to catalog topics, partitions, and consumer groups. Discover and govern your messaging infrastructure after configuring permissions.

Crawl Microsoft Azure Synapse Analytics

Extract metadata from Microsoft Azure Synapse Analytics to catalog databases, schemas, tables, and views. Discover and govern your cloud data warehouse assets after configuring permissions.

Crawl Microsoft Fabric

Crawl metadata from Microsoft Fabric to catalog workspaces, reports, dashboards, and datasets. Extract lineage and analytics information from your Fabric environment.

Crawl Microsoft Power BI

Once you have configured the [Microsoft Power BI user permissions](/apps/connectors/business-intelligence/microsoft-power-bi/how-tos/set-up-microsoft-power-bi), you can establish a connection between Atlan and Microsoft Power BI.

Crawl Microsoft SQL Server

Crawl metadata from Microsoft SQL Server to catalog databases, schemas, tables, and views. Extract column details and enable data discovery, lineage tracking, and governance after configuring user permissions.

Crawl Microsoft SSRS

Crawl metadata from SQL Server Reporting Services (SSRS) to catalog reports, datasets, and data sources. Extract report definitions, lineage, and data model information.

Crawl MicroStrategy

Extract metadata from MicroStrategy to catalog reports, dashboards, and metrics. Discover and govern your BI assets after configuring authentication.

Crawl Mode

Once you have [configured the Mode user permissions](/apps/connectors/business-intelligence/mode/how-tos/set-up-mode), you can establish a connection between Atlan and Mode.

Crawl MongoDB (self-managed)

Once you have [configured the MongoDB permissions](/apps/connectors/database/mongodb/onprem/how-tos/set-up-mongodb-onprem), you can establish a connection between Atlan and MongoDB.

Crawl MongoDB Atlas

Configure MongoDB Atlas connection and run the crawler to extract metadata from MongoDB into Atlan.

Crawl Monte Carlo

Once you have [configured the Monte Carlo permissions](/apps/connectors/observability/monte-carlo/how-tos/set-up-monte-carlo), you can establish a connection between Atlan and Monte Carlo.

Crawl MySQL

To crawl metadata from MySQL, review the [order of operations](/product/connections/how-tos/order-workflows) and then complete the following steps.

Crawl NetSuite

Crawl metadata from NetSuite to catalog record types, record fields, and business processes. Extract enterprise data structure after configuring access credentials.

Crawl on-premises databases

Extract metadata from on-premises databases to catalog tables, schemas, and views. Discover and govern your database assets after configuring the metadata-extractor tool.

Crawl on-premises Databricks

The Docker-based databricks-extractor offline tool has been sunset. For on-premises or network-restricted Databricks environments, use Self-Deployed Runtime, Secure Agent, or direct connectivity via private link.

Crawl on-premises IBM Cognos Analytics

Extract metadata from on-premises IBM Cognos Analytics to catalog reports, queries, and dashboards. Discover and govern your BI assets after configuring extractor tool.

Crawl on-premises Kafka

Extract metadata from on-premises Kafka to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring extractor tool.

Crawl on-premises Looker

Extract metadata from on-premises Looker to catalog dashboards, looks, explores, and data sources. Discover and govern your BI assets after configuring extractor tool.

Crawl on-premises Tableau

Extract metadata from on-premises Tableau to catalog workbooks, dashboards, and sheets. Discover and govern your BI assets after configuring extractor tool.

Crawl on-premises ThoughtSpot

Extract metadata from on-premises ThoughtSpot to catalog pinboards, answers, and liveboards. Discover and govern your BI assets after configuring extractor tool.

Crawl Oracle

Crawl metadata from Oracle Database to catalog schemas, tables, views, and columns. Extract lineage and asset information after configuring user permissions and database connectivity.

Crawl PostgreSQL

Configure and run PostgreSQL metadata extraction workflows to catalog databases, schemas, tables, views, and columns in Atlan.

Crawl PrestoSQL

Once you have configured the [PrestoSQL user permissions](/apps/connectors/database/prestosql/how-tos/set-up-prestosql), you can establish a connection between Atlan and PrestoSQL.

Crawl Qlik Sense Cloud

Extract metadata from Qlik Sense Cloud to catalog applications, sheets, and visualizations. Discover and govern your BI assets after configuring permissions.

Crawl Qlik Sense Enterprise on Windows

Extract metadata from Qlik Sense Enterprise on Windows to catalog applications, sheets, and visualizations. Discover and govern your BI assets after configuring permissions.

Crawl Qualytics

Crawl metadata from Qualytics to capture data quality checks, anomalies, and scores. Enrich your assets with comprehensive quality insights from quality rules and validations.

Crawl Redash

Once you have [configured the Redash permissions](/apps/connectors/business-intelligence/redash/how-tos/set-up-redash), you can establish a connection between Atlan and Redash.

Crawl Redpanda Kafka

Extract metadata from Redpanda Kafka to catalog topics, partitions, and consumer groups. Discover and govern your Kafka messaging infrastructure after configuring permissions.

Crawl S3 assets

Configure and run the S3 crawler to catalog your Amazon S3 buckets and objects in Atlan.

Crawl SageMaker

Crawl lineage from AWS SageMaker to catalog machine learning pipelines, jobs, models, and datasets. Extract ML workflow lineage after configuring AWS credentials.

Crawl Salesforce

Extract metadata from Salesforce to catalog objects, fields, and relationships. Discover and govern your CRM data assets after configuring user permissions.

Crawl SAP Datasphere

Configure and run the crawler to extract metadata from SAP Datasphere spaces, views, analytical models, and columns into Atlan.

Crawl SAP ECC

Extract and catalog metadata from your SAP ECC system including tables, fields, and modules in Atlan.

Crawl SAP HANA

Extract metadata from SAP HANA to catalog databases, schemas, tables, views, and columns. Discover and govern your enterprise data assets after configuring user permissions.

Crawl SAP S/4HANA

Extract and catalog metadata from your SAP S/4HANA system including tables, fields, CDS views, and modules in Atlan.

Crawl Sigma

Once you have [configured the Sigma permissions](/apps/connectors/business-intelligence/sigma/how-tos/set-up-sigma), you can establish a connection between Atlan and Sigma.

Crawl Sisense

Extract metadata from Sisense to catalog dashboards, widgets, and datasets. Discover and govern your BI assets after configuring permissions.

Crawl Snowflake

To crawl metadata from Snowflake, review the [order of operations](/product/connections/how-tos/order-workflows) and then complete the following steps.

Crawl Snowflake AI models

Discover, catalog, and build lineage for AI models registered in the Snowflake Model Registry using Atlan's Snowflake connector.

Crawl Soda

Once you have [configured the Soda permissions](/apps/connectors/observability/soda/how-tos/set-up-soda), you can establish a connection between Atlan and Soda.

Crawl Starburst Enterprise

Crawl metadata from Starburst Enterprise to catalog catalogs, schemas, tables, and data products. Extract domain definitions and lineage information after configuring user permissions.

Crawl Tableau

Crawl metadata from Tableau to catalog workbooks, dashboards, sheets, and data sources. Extract lineage and usage information after configuring user permissions.

Crawl Talend assets

Crawl metadata from Talend to catalog jobs, components, and transformations. Extract transformation lineage from project files in GitHub or Atlassian Stash repositories.

Crawl Teradata

Extract metadata from Teradata to catalog databases, tables, views, and columns. Extract query lineage and asset information after configuring user permissions.

Crawl ThoughtSpot

Extract metadata from ThoughtSpot to catalog pinboards, answers, and liveboards. Discover and govern your BI assets after configuring permissions.

Crawl Trino

Crawl metadata from Trino to catalog schemas, tables, views, and columns. Extract distributed query lineage and asset information after configuring user permissions.

Disable data access

:::warning Who can do this? You will need to be an admin user in Atlan to configure these options.

Enrich Atlan through dbt

Beyond the default mapped [dbt Cloud](/apps/connectors/etl-tools/dbt/references/what-does-atlan-crawl-from-dbt-cloud) or [dbt Core](/apps/connectors/etl-tools/dbt/references/what-does-atlan-crawl-from-dbt-core) properties, you can update any of Atlan's metadata attributes (except for `name`, `tenantId`, and `qualifiedName`) through your dbt model's `meta` property.

Extract lineage and usage from Databricks

Retrieve lineage from Unity Catalog and usage and popularity metrics from query history or system tables using REST API, offline, or system table extraction methods.

Manage Databricks tags

You must have a [Unity Catalog-enabled workspace](https://docs.databricks.com/en/data-governance/unity-catalog/get-started.html) and SQL warehouse configured to import Databricks tags in Atlan.

Manage dbt tags

Atlan imports your [dbt tags](https://docs.getdbt.com/references/resource-configs/tags) and lets you update your dbt assets with the imported tags.

Manage Google BigQuery tags

Atlan imports your [Google BigQuery tags](https://docs.getdbt.com/references/resource-configs/tags) and lets you update your Google BigQuery assets with the imported tags. Note that object tagging in Google BigQuery currently requires [Enterprise edition or higher](https://cloud.google.com/bigquery/docs/editions-intro#editions_features).

Manage Snowflake tags

You can import your Snowflake tags to Atlan through one-way tag sync. The synced Snowflake tags will be matched to corresponding tags in Atlan through case-insensitive name match and your Snowflake assets will be enriched with their synced tags from Snowflake.

Mine Amazon Redshift

Once you have [crawled assets from Amazon Redshift](/apps/connectors/data-warehouses/amazon-redshift/how-tos/crawl-amazon-redshift), you can mine its query history to construct lineage and retrieve [usage and popularity metrics](/product/capabilities/usage-and-popularity/how-tos/interpret-usage-metrics).

Mine ClickHouse

Once you have [crawled assets from ClickHouse](/apps/connectors/database/clickhouse/how-tos/crawl-clickhouse), you can mine its query history to construct lineage.

Mine Google BigQuery

Once you have [crawled assets from Google BigQuery](/apps/connectors/data-warehouses/google-bigquery/how-tos/crawl-google-bigquery), you can mine its query history to construct lineage.

Mine Microsoft Power BI

Once you have crawled assets from Microsoft Power BI, you can mine its activity events to generate usage metrics.

Mine PostgreSQL

Once you have [crawled assets from PostgreSQL](/apps/connectors/database/postgresql/how-tos/crawl-postgresql), you can mine its query history to construct lineage.

Mine Snowflake

Once you have [crawled assets from Snowflake](/apps/connectors/data-warehouses/snowflake/how-tos/crawl-snowflake), you can mine its query history to construct lineage.

Mine Teradata

Once you have [crawled assets from Teradata](/apps/connectors/database/teradata/how-tos/crawl-teradata), you can mine its query history to construct lineage.

Order workflows

The [order of operations](/product/connections/how-tos/order-workflows#order-of-operations) you run in Atlan is important. Follow the specific workflow sequence outlined below when crawling [data tools](/product/connections/references/supported-sources). The right order particularly ensures that lineage is constructed without needing to rerun crawlers.

Preflight checks for Aiven Kafka

Before [running the Aiven Kafka crawler](/apps/connectors/messaging/aiven-kafka/how-tos/crawl-aiven-kafka), you can run [preflight checks](/product/conne.

Preflight checks for Amazon MSK

Before [running the Amazon MSK crawler](/apps/connectors/messaging/amazon-msk/how-tos/crawl-amazon-msk), you can run [preflight checks](/product/connecti.

Preflight checks for Amazon QuickSight

The [ListAnalyses](https://docs.aws.amazon.com/quicksight/latest/APIReference/API_ListAnalyses.html) REST API is used to fetch the actual list of analyses for which the user has view permission.

Preflight checks for Amazon Redshift

Before [running the Amazon Redshift crawler](/apps/connectors/data-warehouses/amazon-redshift/how-tos/crawl-amazon-redshift), you can run [preflight chec.

Preflight checks for Anaplan

The Anaplan REST API is used to fetch the actual list of workspaces, and apps for which the user has view permission.

Preflight checks for Apache Kafka

Before [running the Apache Kafka crawler](/apps/connectors/messaging/apache-kafka/how-tos/crawl-apache-kafka), run [preflight checks](/product/connection.

Preflight checks for Databricks

Before [running the Databricks crawler](/apps/connectors/data-warehouses/databricks/how-tos/crawl-databricks), you can run [preflight checks](/product/co.

Preflight checks for Domo

Atlan uses the [DataSet API](https://developer.domo.com/portal/72ae9b3e80374-list-data-sets) to fetch dataset metadata from Domo.

Preflight checks for Google BigQuery

Each request requires an OAuth 2.0 access token generated via the [service account key](https://cloud.google.com/docs/authentication#service-accounts).

Preflight checks for Hive

Before [running the Hive crawler](/apps/connectors/database/hive/how-tos/crawl-hive), you can run [preflight checks](/product/connections/concepts/what-a.

Preflight checks for Looker

First, the list of projects in the _Include Projects_ and _Exclude Projects_ fields is determined. Next, the [Query Projects](https://developers.looker.com/api/explorer/3.1/methods/Project#get_all_projects) REST API is used to fetch the actual list of projects for which the user has [view capability](https://cloud.google.com/looker/docs/access-control-and-permission-management).

Preflight checks for Metabase

Before [running the Metabase crawler](/apps/connectors/business-intelligence/metabase/how-tos/crawl-metabase), you can run [preflight checks](/product/co.

Preflight checks for Microsoft Azure Synapse Analytics

This check is performed for both [basic](/apps/connectors/data-warehouses/microsoft-azure-synapse-analytics/how-tos/set-up-microsoft-azure-synapse-analytics) and [service principal](/apps/connectors/data-warehouses/microsoft-azure-synapse-analytics/how-tos/set-up-microsoft-azure-synapse-analytics) authentication method.

Preflight checks for MicroStrategy

First, the list of projects in the _Include Projects_ and _Exclude Projects_ fields is determined. Next, the [Get Projects REST API](https://demo.microstrategy.com/MicroStrategyLibrary/api-docs/index.html#/Projects/getProjects_1) is used to fetch the actual list of projects for which the user has permissions.

Preflight checks for Mode

Before [running the Mode crawler](/apps/connectors/business-intelligence/mode/how-tos/crawl-mode), you can run [preflight checks](/product/connections/co.

Preflight checks for MongoDB Atlas

Before running the MongoDB Atlas crawler, you can run preflight checks to verify the connected user has the required MongoDB privileges for metadata extraction.

Preflight checks for Monte Carlo

Before [running the Monte Carlo crawler](/apps/connectors/observability/monte-carlo/how-tos/crawl-monte-carlo), you can run [preflight checks](/product/c.

Preflight checks for MySQL

Before [running the MySQL crawler](/apps/connectors/database/mysql/how-tos/crawl-mysql), you can run [preflight checks](/product/connections/concepts/wha.

Preflight checks for Oracle

Before [running the Oracle crawler](/apps/connectors/database/oracle/how-tos/crawl-oracle), you can run [preflight checks](/product/connections/concepts/.

Preflight checks for PostgreSQL

Before [running the PostgreSQL crawler](/apps/connectors/database/postgresql/how-tos/crawl-postgresql), you can run [preflight checks](/product/connectio.

Preflight checks for PrestoSQL

Before [running the PrestoSQL crawler](/apps/connectors/database/prestosql/how-tos/crawl-prestosql), you can run [preflight checks](/product/connections/.

Preflight checks for Redash

Before [running the Redash crawler](/apps/connectors/business-intelligence/redash/how-tos/crawl-redash), you can run [preflight checks](/product/connecti.

Preflight checks for Redpanda Kafka

Before [running the Redpanda Kafka crawler](/apps/connectors/messaging/redpanda-kafka/how-tos/crawl-redpanda-kafka), you can run [preflight checks](/prod.

Preflight checks for Salesforce

Before [running the Salesforce crawler](/apps/connectors/crm/salesforce/how-tos/crawl-salesforce), you can run [preflight checks](/product/connections/co.

Preflight checks for Sigma

First, the list of workbooks in the _Include Workbooks_ and _Exclude Workbooks_ fields is determined. Next, the [List Workbooks](https://help.sigmacomputing.com/hc/en-us/articles/4408555666323) REST API is used to fetch the actual list of workbooks for which the user credentials have view permission.

Preflight checks for Sisense

Atlan uses the [Folders API](https://sisense.dev/guides/restApi/v1/?platform=linux&spec=L2023.6#/folders) to check if it's responding with a response status code 200.

Preflight checks for Snowflake

Before [running the Snowflake crawler](/apps/connectors/data-warehouses/snowflake/how-tos/crawl-snowflake), you can run [preflight checks](/product/conne.

Preflight checks for Tableau

The [Server Info](https://help.tableau.com/current/api/rest_api/en-us/REST/rest_api_ref_server.htm#server_info) REST API is used to fetch the `restApiVersion` value.

Preflight checks for Teradata

Before [running the Teradata crawler](/apps/connectors/database/teradata/how-tos/crawl-teradata), you can run [preflight checks](/product/connections/con.

Preflight checks for Trino

Before [running the Trino crawler](/apps/connectors/database/trino/how-tos/crawl-trino), you can run [preflight checks](/product/connections/concepts/wha.

Provide SSL certificates

SSL (Secure Sockets Layer) encryption helps establish a secure connection between your data source and Atlan. Atlan currently supports SSL certificates for [crawling Tableau](/apps/connectors/business-intelligence/tableau/how-tos/crawl-tableau) and [crawling Matillion](/apps/connectors/etl-tools/matillion/how-tos/crawl-matillion).

Set up Amazon S3

Create AWS IAM permissions and credentials for Atlan to access and catalog your S3 buckets and objects.

Set up BigID

Create a BigID system user and API token for Atlan integration.

Set up Cyera

Create a Cyera API token and obtain credentials for Atlan integration.

Set up dbt Cloud

Configure authentication tokens in dbt Cloud to enable Atlan to fetch and enrich your assets with dbt metadata.

Set up Domo

:::warning Who can do this? You'll need your Domo administrator to complete these steps - you may not have access yourself.

Set up Fivetran

Configure the Fivetran Platform Connector and destination to enable Atlan to extract Fivetran metadata and logs.

Set up Google BigQuery

You must be a Google BigQuery administrator to run these commands. For more information, see [Google Cloud's Granting, changing, and revoking access to resources](https://cloud.google.com/iam/docs/granting-changing-revoking-access).

Set up Google Cloud Knowledge Catalog

Configure Google Cloud Knowledge Catalog connection and authentication by creating a service account with required permissions, or using Workload Identity Federation.

Set up Hightouch

Configure Hightouch roles, groups, and API keys to connect with Atlan.

Set up Hive

Configure permissions and authentication for Hive to enable metadata extraction in Atlan.

Set up IBM Cognos Analytics

:::warning Who can do this? You must be an IBM Cognos Analytics administrator to complete these steps - you may not have access yourself.

Set up Looker

:::warning Who can do this? You probably need your Looker administrator to run these commands - you may not have access yourself.

Set up Microsoft Azure Cosmos DB

If your Microsoft Azure Cosmos DB deployment includes a mix of vCore- and RU-based accounts, you must configure both to fetch metadata. You can then use the _vCore and RU_ deployment option to [crawl your Microsoft Azure Cosmos DB assets](/apps/connectors/database/microsoft-azure-cosmos-db/how-tos/crawl-microsoft-azure-cosmos-db).

Set up Microsoft SQL Server

Configure authentication and permissions for Microsoft SQL Server to enable Atlan to crawl metadata from your database.

Set up Mode

If you do not see the prompts to enter details for the user above, you are probably already signed in to Mode. Sign out of Mode first, and then accept the invite in the service account email.

Set up MongoDB (self-managed)

Atlan supports SCRAM authentication (SCRAM-SHA-1 and SCRAM-SHA-256) for fetching metadata from MongoDB. This method uses a [username and password](#create-database-user) to fetch metadata.

Set up MongoDB Atlas

Atlan supports the basic authentication method for fetching metadata from MongoDB. This method uses a username and password to fetch metadata

Set up Monte Carlo

:::warning Who can do this? You will probably need your Monte Carlo [account owner](https://docs.getmontecarlo.com/docs/authorizationmanaged-roles-and-groups).

Set up on-premises database access

In such cases you may want to decouple the extraction of metadata from its ingestion in Atlan. This approach gives you full control over your resources and metadata transfer to Atlan.

Set up on-premises Databricks access

The Docker-based databricks-extractor offline tool has been sunset. For on-premises or network-restricted Databricks environments, use Self-Deployed Runtime, Secure Agent, or direct connectivity via private link.

Set up on-premises Kafka access

In some cases you won't be able to expose your Kafka instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises Looker access

In some cases you won't be able to expose your Looker instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises Tableau access

In some cases you may not be able to expose your Tableau instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up on-premises ThoughtSpot access

In some cases you will not be able to expose your ThoughtSpot instance for Atlan to crawl and ingest metadata. For example, this may happen when security requirements restrict access to sensitive, mission-critical data.

Set up PostgreSQL

:::warning Who can do this? You will probably need your PostgreSQL administrator to run these commands - you may not have access yourself.

Set up SAP HANA

:::warning Who can do this? You will probably need your SAP HANA administrator to run these commands - you may not have access yourself.

Set up Sisense

Atlan supports the basic authentication method for fetching metadata from Sisense. This method uses a username and password to fetch metadata.

Set up Snowflake

:::warning Who can do this? You need your Snowflake administrator to run these commands - you may not have access yourself. :::.

Set up Tableau

:::warning Who can do this? You probably need your Tableau administrator to run these commands - you may not have access yourself.

Set up Teradata

:::warning Who can do this? You need your Teradata administrator to run these commands - you may not have access yourself.

Set up ThoughtSpot

:::warning Who can do this? You will probably need your ThoughtSpot instance administrator to complete these steps - you may not have access yourself.

Update column metadata in Google Sheets

Once you've [connected Atlan with Google Sheets](/product/integrations/collaboration/spreadsheets/how-tos/integrate-atlan-with-google-sheets), you can import the column metadata for all your data assets in Atlan and make changes to them directly in Google Sheets.

Update column metadata in Microsoft Excel

Once you've [connected Atlan with Microsoft Excel](/product/integrations/collaboration/spreadsheets/how-tos/integrate-atlan-with-microsoft-excel), you can import the column metadata for all your data assets in Atlan and make changes to them directly in Microsoft Excel.

view data models

Once you have [ingested your ER model assets in Atlan](/product/capabilities/data-models/concepts/what-are-data-models), you can:.

What does Atlan crawl from Amazon Athena?

During an Amazon Athena crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Columns. Reference tables map each Athena property to its Atlan asset.

What does Atlan crawl from Amazon DynamoDB?

Atlan crawls and maps the following assets and properties from Amazon DynamoDB. Atlan also currently supports lineage between Amazon DynamoDB as a source to supported data warehouses as destinations, as enriched by Fivetran.

What does Atlan crawl from Amazon Redshift?

During an Amazon Redshift crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Columns. Reference tables map each Redshift property to its Atlan asset.

What does Atlan crawl from Anomalo?

Once you have [integrated Anomalo](/apps/connectors/observability/anomalo/how-tos/integrate-anomalo), Atlan will receive webhook events when checks are executed in Anomalo. These checks will be cataloged in Atlan to create a relationship with existing assets using the association information from the check.

What does Atlan crawl from Apache Kafka?

During an Apache Kafka crawl, Atlan extracts metadata for Clusters, Topics, Consumer Groups, Schema Subjects, Schema Versions, and Schema Fields. Reference tables map each Kafka property to its Atlan asset.

What does Atlan crawl from Astronomer/OpenLineage?

Atlan maps the following assets and properties from Astronomer/OpenLineage. Asset lineage support depends on the [list of operators supported by OpenLineage](https://airflow.apache.org/docs/apache-airflow-providers-openlineage/1.6.0/supported_classes.html).

What does Atlan crawl from AWS Glue?

During an AWS Glue crawl, Atlan extracts metadata for Catalogs, Databases, Schemas, Tables, and Jobs. Reference tables map each Glue property to its Atlan asset.

What does Atlan crawl from ClickHouse?

During a ClickHouse crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Columns. Reference tables map each ClickHouse property to its Atlan asset.

What does Atlan crawl from Confluent Kafka?

During a Confluent Kafka crawl, Atlan extracts metadata for Clusters, Topics, Consumer Groups, Schema Subjects, Schema Versions, and Schema Fields. Reference tables map each Confluent Kafka property to its Atlan asset.

What does Atlan crawl from Databricks?

During a Databricks crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, Materialized Views, Volumes, External Locations, and AI Models. Reference tables map each Databricks property to its Atlan asset.

What does Atlan crawl from Fivetran?

During a Fivetran crawl, Atlan creates lineage between source and destination data assets and extracts metadata for Fivetran Connectors. Reference tables map each Fivetran Connector property to its Atlan asset.

What does Atlan crawl from Google BigQuery?

Atlan doesn't run any table scans. Atlan leverages the table preview options from [Google BigQuery](https://cloud.google.com/bigquery/docs/best-practices-costs#preview-data) that enable you to view data for free and without affecting any quotas using the `tabledata.list` API. Hence, [table](/apps/connectors/data-warehouses/google-bigquery/references/what-does-atlan-crawl-from-google-bigquery#tables) asset previews in Atlan are already cost-optimized. However, this doesn't apply to [views](/apps/connectors/data-warehouses/google-bigquery/references/what-does-atlan-crawl-from-google-bigquery#views) and [materialized views](/apps/connectors/data-warehouses/google-bigquery/references/what-does-atlan-crawl-from-google-bigquery#materialized-views).

What does Atlan crawl from Google Cloud Composer/OpenLineage?

Atlan maps the following assets and properties from Google Cloud Composer/OpenLineage. Asset lineage support depends on the [list of operators supported by OpenLineage](https://airflow.apache.org/docs/apache-airflow-providers-openlineage/1.6.0/supported_classes.html).

What does Atlan crawl from Hive?

During a Hive crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, Materialized Views, and Columns. Reference tables map each Hive property to its Atlan asset.

What does Atlan crawl from Looker?

During a Looker crawl, Atlan extracts metadata for Connections, Projects, Views, Models, Folders, Looks, Dashboards, Tiles, and Explores. Reference tables map each Looker property to its Atlan asset.

What does Atlan crawl from Monte Carlo?

What does Atlan crawl from Monte Carlo? <Badge variant="preview" text="Private Preview" link="/get-started/references/product-release-stages#private-preview" />

What does Atlan crawl from MySQL?

During a MySQL crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Stored Procedures. Reference tables map each MySQL property to its Atlan asset.

What does Atlan crawl from Oracle?

During an Oracle crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Procedures. Reference tables map each Oracle property to its Atlan asset.

What does Atlan crawl from PostgreSQL?

During a PostgreSQL crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Functions. Reference tables map each PostgreSQL property to its Atlan asset.

What does Atlan crawl from Salesforce?

During a Salesforce crawl, Atlan extracts metadata for Organizations, Objects, Fields, Reports, and Dashboards. Reference tables map each Salesforce property to its Atlan asset.

What does Atlan crawl from SAP BW/4HANA?

What does Atlan crawl from SAP BW/4HANA? <Badge variant="preview" text="Private Preview" link="/get-started/references/product-release-stages#private-preview" />

What does Atlan crawl from SAP ECC?

What does Atlan crawl from SAP ECC? <Badge variant="preview" text="Public Preview" link="/get-started/references/product-release-stages#public-preview" />

What does Atlan crawl from SAP S/4HANA?

What does Atlan crawl from SAP S/4HANA? <Badge variant="preview" text="Public Preview" link="/get-started/references/product-release-stages#public-preview" />

What does Atlan crawl from Sisense?

During a Sisense crawl, Atlan extracts metadata for Dashboards, Widgets, Data Models, Data Model Tables, and Folders. Reference tables map each Sisense property to its Atlan asset.

What does Atlan crawl from Snowflake?

During a Snowflake crawl, Atlan extracts metadata for Databases, Schemas, Tables, Views, and Procedures. Reference tables map each Snowflake property to its Atlan asset.

What does Atlan crawl from Soda?

Atlan crawls datasets and then filters out all the datasets without any checks. It then crawls the checks associated with each of the datasets with checks from Soda. These checks are cataloged in Atlan to create a relationship with existing assets using the association information from the dataset.

What does Atlan crawl from Tableau?

During a Tableau crawl, Atlan extracts metadata for Projects, Workbooks, Worksheets, Dashboards, Datasources, and Flows. Reference tables map each Tableau property to its Atlan asset.

What does Atlan crawl from Teradata?

During a Teradata crawl, Atlan extracts metadata for Schemas, Tables, Views, and Columns. Reference tables map each Teradata property to its Atlan asset.

What lineage does Atlan extract from Matillion?

Atlan uses Matillion's metadata API to generate lineage associated with [Matillion connectors](https://www.matillion.com/connectors). This is particularly useful for creating lineage between different tools.

What lineage does Atlan extract from Microsoft Power BI?

This document helps you understand how Atlan generates lineage to upstream SQL sources for your Microsoft Power BI assets using a custom query parser, and the steps you can take while developing reports and dashboards in Microsoft Power BI to create seamless lineage generation.

When does Atlan become a personal data processor or subprocessor?

Atlan personnel do not have access to any customer instance unless specifically provided by the customer. Accordingly, in the event that a customer instance contains personal data and Atlan personnel are provided access to that instance, Atlan may act as a personal data processor. In addition, depending on whether the customer is a data controller or processor, Atlan may act as a data processor or subprocessor, respectively.