Generate Power BI columns to dataset lineage App
You can generate lineage between Microsoft Power BI columns and datasets by using the PowerBI Columns -> Dataset lineage app. This helps you see which columns feed each dataset and report and makes it easier to troubleshoot issues or understand the impact of changes.
This guide walks you through preparing your report.json files and configuring the app to read them from an AWS S3 bucket by using an IAM role. You can use similar steps when reading from AWS S3 with access keys or from Azure Blob Storage.
Prerequisites
Before you begin, make sure you have:
- Access to the PowerBI Columns -> Dataset lineage app in your Atlan workspace. If you don't have access, contact Atlan support or your Atlan customer team.
- A Microsoft Power BI connection set up and crawled for the reports you want to include in lineage.
- The ability to enable Developer mode in Power BI Desktop. For detailed steps, see Deep dive into Power BI Desktop Developer Mode (Preview).
- Permissions to store and read files from an AWS S3 bucket or Azure Blob Storage that stores your Power BI
report.jsonfiles. For storage configuration details, see Object storage for apps.
Prepare Power BI report metadata
Before you can generate lineage, you first need to extract the internal Power BI metadata that the app uses. This metadata is stored in each report’s report.json file, which can only be accessed by enabling Developer Mode and saving the report as a Power BI project. Follow the steps below to prepare these files.
-
Open each Power BI report in Power BI Desktop and enable Developer Mode by going to File → Options and settings → Options → Preview features and selecting Power BI Project (
.pbip) save option. -
Save the report as a Power BI Project (
.pbip) file. On your machine, open the project folder and locate the.Reportdirectory, which contains thereport.jsonfile for that report. -
Upload the
report.jsonfiles to your chosen object storage location (AWS S3 or Azure Blob Storage).-
For AWS S3, create a folder for each report using the Power BI report ID as the folder name, and upload the
report.jsoninside that folder. -
For Azure Blob Storage, upload each file using the format:
<Power BI report ID>@@@<file name>.json
If you prefer to use AWS access keys or Azure object storage, review authentication examples in Object storage for apps before continuing.
-
Set up workflow
-
Go to the homepage and click New workflow in the top navigation bar.
-
Search for PowerBI Columns -> Dataset lineage, and then select Set up workflow.
-
In Workflow Name, enter a descriptive name that clearly identifies the scope of the lineage, such as
Power BI columns to dataset lineage - finance workspaces. -
In Output type, select Generate Lineage to create lineage from the input files. If you want to delete lineage previously created by this app for the same scope, select Delete Lineage.
-
In Connection, click Select a connection and choose the Atlan Microsoft Power BI connection that's associated with the reports whose
report.jsonfiles you uploaded. -
In Input, select the storage option that matches where you uploaded your report.json files, such as AWS S3 IAM Role, AWS S3 IAM User, or Azure Blob.
-
Based on your selected storage option, configure the required fields such as the bucket or container name, folder path, and region (or the equivalent settings for Azure). For role configuration examples, see the Amazon S3 section in Object storage for apps.
-
Review the configuration and save the workflow.
-
Run the workflow once to generate lineage between Power BI columns and datasets based on the uploaded
report.jsonfiles.
Verify lineage
After the workflow finishes, validate that lineage has been created as expected.
-
From the left navigation, click Assets.
-
Search for a Power BI dataset that was in scope for the workflow and open its asset profile.
-
Click the Lineage tab and zoom in on the dataset.
-
Check that:
- Power BI columns appear as upstream nodes connected to the dataset.
- The lineage graph reflects the column usage in your Power BI reports.
-
Repeat the check for a few additional datasets or reports to make sure lineage coverage matches your expectations.
If you need to adjust which assets are included in lineage, update the workflow configuration or the set of report.json files in your bucket and rerun the workflow.
Need help?
If you have any issues while configuring or running the app, contact Atlan support.