Skip to main content

Microsoft Dataverse crawler
App

The Microsoft Dataverse Crawler app creates or reuses a Dataverse connection in Atlan, fetches entities and their attributes, and publishes them for discovery. It also fetches Dataverse relationships to publish lineage between entities and attributes, and archives assets in Atlan when they're deleted or archived in Dataverse.

This reference provides complete configuration details for the Microsoft Dataverse Crawler app. Use this page to look up field definitions, valid inputs, and behavior.

Access

The Microsoft Dataverse Crawler app isn't enabled by default. To use this app, contact Atlan support and request it be added to your tenant. Once enabled, admins or users with workflow permissions can configure and run the crawler.

Client ID

Use the application (client) ID from your Azure app registration. This value identifies the app that the crawler uses for OAuth. Find it in Azure Portal under App registrations > Overview.

Example:

123456ab-987c-12a1-1234-ab12345a1234

Client secret

Use the client secret value created for the Azure app registration. The crawler uses this value to request tokens for the app. Use the secret value itself, not the secret ID.

Example:

Abc1Z~abCD._a1CD2EF~A1-AbcDEfgHIj1kLmnop

Tenant ID

Use the tenant (directory) ID for the Microsoft Entra tenant that owns the Dataverse environment. The crawler uses it to target the correct identity provider. Find it in Azure Portal under Microsoft Entra ID > Overview.

Example:

123abc1a-1ab2-1234-1234-123456a12ab1

Dataverse URI

Provide the full Dynamics 365 URL for the Dataverse environment. Use the base environment URL that ends with .crm.dynamics.com. The crawler uses this to reach Dataverse APIs.

Example:

https://bag-mpd365-crm-sandbox.crm.dynamics.com

Connection

Choose the name of the Atlan connection created or reused by this app. This name appears in Atlan and groups all assets from the Dataverse source. The name must be unique across Dataverse connections.

Example:

dataverse-prod

Connection admins

Controls who has admin permissions on the Atlan connection. The default value adds all workspace admins. Update this to scope admin access to the right owners.

Example:

All_admins

Include entities by regex

Use a regex filter to ingest only entities that match the pattern. This filter limits the crawl scope so only matching entities and their attributes are processed.

Example:

^(Account|Contact|Lead)$

Exclude entities by regex

Use a regex filter to exclude entities that match the pattern. This filter removes matching entities and their attributes from the crawl scope.

Example:

^Test_.*

See also