Skip to main content

Databricks

Discovery
Lineage
Popularity
Browser Ext

Overview: Catalog Databricks workspaces, databases, schemas, and tables in Atlan. Gain visibility into lineage, usage, and governance for your Databricks assets.


Get started

Start by setting up the Databricks connector in Atlan. This involves configuring authentication and connection settings so Atlan can access your Databricks workspace.

  • Set up the connector: Configure the Databricks connector with authentication credentials and connection settings.
    • Additional configurations needed for your environment:
      • Enable SSO for Databricks: Set up SSO authentication if your organization uses single sign-on for Databricks access.
      • Set up cross-workspace extraction: Configure a single service principal to crawl metadata from all workspaces within a Databricks metastore, useful when you have multiple workspaces sharing the same metastore.

Crawl assets

After setting up the connector, crawl your Databricks assets to discover and catalog them in Atlan:

Advanced setup

Use these guides for specialized deployment scenarios or additional configuration options. These guides help you when your Databricks environment requires specific network configurations, runs on-premises, when you need to extract lineage and usage metrics, or when you need to manage tags.

Lineage and usage

Extract lineage and usage metrics to understand how data flows through your Databricks assets and which assets are most frequently accessed:

On-premises

Private network

Set up private network connections to Databricks when you need secure, private connectivity without exposing your Databricks workspace to the public internet:

Tag management

Configure and manage tags in Databricks to enhance metadata governance:


Concepts


References


Troubleshooting


FAQ