Skip to main content

Crawl Microsoft Power BI

Once you have configured the Microsoft Power BI user permissions, you can establish a connection between Atlan and Microsoft Power BI.

To crawl metadata from Microsoft Power BI, review

the order of operations and then complete the following steps.

Select the source

To select Microsoft Power BI as your source:

  1. In the top right of any screen, navigate to New and then click New Workflow.
  2. From the list of packages, select Power BI Assets and click on Setup Workflow.

Provide credentials

To enter your Microsoft Power BI credentials:

  1. For Authentication, choose the method you want to use to access Microsoft Power BI:
    • For Delegated User authentication, enter the UsernamePasswordTenant IdClient Id, and Client Secret you configured when setting up Microsoft Power BI.
    • For Service Principal authentication, enter the Tenant IdClient Id, and Client Secret you configured when setting up Microsoft Power BI.
  2. At the bottom of the form, click the Test Authentication button to confirm connectivity to Microsoft Power BI using these details.
  3. Once successful, at the bottom of the screen click the Next button.

Configure the connection

To complete the Microsoft Power BI connection configuration:

  1. Provide a Connection Name that represents your source environment. For example, you might want to use values like production, development, gold, or analytics.

  2. (Optional) To change the users able to manage this connection, change the users or groups listed under Connection Admins.

    danger

    If you do not specify any user or group, nobody will be able to manage the connection - not even admins.

  3. At the bottom of the screen, click the Next button to proceed.

Configure the crawler

Before running the Microsoft Power BI crawler, configure metadata extraction and advanced options. You can override the default settings for the following fields.

Configure metadata

  • Include Workspaces: Select Microsoft Power BI workspaces to include. Defaults to all workspaces when left blank.
  • Exclude Workspaces: Select workspaces to exclude. No workspaces are excluded by default.
  • Include Dashboard and Reports Regex: Use a regular expression to include dashboards and reports based on naming patterns. Includes all by default.
  • Exclude Dashboard and Reports Regex: Use a regular expression to exclude dashboards and reports based on naming patterns. Excludes none by default.
  • Attach Endorsements from Power BI: Automatically certify assets endorsed in Power BI. To manually review before applying, change this setting to Send a Request. For more details, see What does Atlan crawl from Microsoft Power BI?

Configure advanced settings

  • Source Connections: When your tenant has multiple connections available for the same source system that share the similar metadata, confirm the advanced options and choose the correct connections from the Source Connections list drop down to avoid creating duplicate lineage to such connections.
  • Enable ODBC DSN Connectivity Mapping: Power BI provides multiple ways of connecting to a SQL source, including ODBC connectivity for building Reports and Dashboards. When datasets are populated using ODBC, provide a mapping of the DSN ( Data Source Name ) names to their appropriate database qualified names after enabling this toggle.
Did you know?

If a workspace appears in both the include and exclude filters, the exclude filter takes precedence.

Run the crawler

To run the Microsoft Power BI crawler, after completing the steps above:

  1. To check for any permissions or other configuration issues before running the crawler, click Preflight checks.
  2. You can either:
    • To run the crawler once immediately, at the bottom of the screen, click the Run button.
    • To schedule the crawler to run hourly, daily, weekly, or monthly, at the bottom of the screen, click the Schedule Run button.

Once the crawler has completed running, you will see the assets in Atlan's asset page! 🎉