Skip to main content

Enable data quality on connection

Enable data quality on your Databricks connection in Atlan to start monitoring data quality.

Prerequisites

Before you enable Data Quality for Databricks, confirm that you have:

Enable data quality

Once the Atlan team grants the access, follow these steps to enable data quality on your Databricks connection.

  1. Turn on the data quality configuration toggle to configure:

    Enable Toggle for Databricks

    • Navigate to Admin in Atlan
    • Find the Labs section
    • Turn on the Databricks toggle under Data Quality section
  2. Select your connection and configure credentials:

    1. Navigate to Admin > Labs > Data Quality
    2. Click Connections for Databricks and select your connection from the list
    3. Click Configure for your selected connection
    4. Turn on Databricks data quality toggle
    5. Enter the following credential details:
      • Client ID: The service principal client ID created in Databricks setup
      • Client Secret: The service principal client secret
      • Tenant ID: The tenant ID (Azure only)
      • Workspace URL: Your Databricks workspace URL
      • SQL Warehouse: Your preferred SQL warehouse for DQ operations
    6. Enter your DQ catalog name: Atlan uses this catalog to store DQ related metadata for this connection in your Databricks environment.
    7. Click Run permissions check to verify:
      • Credentials have necessary permissions in Databricks
      • Databricks setup completed correctly
    8. Click Update to save the credentials

Next steps

After completing these steps:

  • Atlan takes approximately 15 minutes to complete the setup in the background
  • Once finished, you'll see data quality options available on your Databricks assets
  • You can start creating data quality rules on tables and views

Need help

If you have questions or need assistance with enabling data quality on your connection, reach out to Atlan Support by submitting a support request.

See also