Crawl Talend assets
Create a workflow in Atlan to discover and catalog Talend assets from project files stored in a GitHub or Atlassian Stash repository. Each workflow connects to one repository. To catalog assets from multiple repositories, create a separate workflow for each.
Prerequisites
Before you begin, make sure you have:
- Completed Talend connector setup
- Repository URL and personal access token ready
- Branch name containing your Talend project files
Create crawler workflow
- In Atlan, navigate to New > New Workflow.
- In the Marketplace tab, search for or select Talend Connector.
- Click Set up workflow.
- Select GitHub Cloud Store as the cloud store type. Both GitHub and Atlassian Stash (Bitbucket Server) are supported through this connection type.
Configure connection
Enter the repository URL, access token, and connection settings for this workflow.
- Enter your repository details:
- Repository URL: Enter the repository path ending with
.git, for examplegithub.com/organization/repository.git. Thehttps://prefix is added automatically. - Git access token: Enter your personal access token with read permissions.
- Branch name: Enter the branch to crawl, for example
mainormaster. - Talend project path: If your Talend projects are in a subdirectory, enter the path, for example
talend_projects/. Leave this blank to crawl from the repository root.
- Repository URL: Enter the repository path ending with
- Enter a Connection name to identify this Talend connection in Atlan, for example
production-talend. - To change who manages this connection, update the users or groups listed under Connection admins.
Run crawler
Choose to run the crawler once immediately or schedule it to run at a recurring interval.
- Click Run to run once, or Schedule & Run to run hourly, daily, weekly, or monthly.
- Monitor progress in the workflow execution details.
After the crawler completes, assets appear in Atlan. Navigate to Assets and filter by Talend to browse projects, jobs, components, and component fields.
See also
What does Atlan crawl from Talend: Review the Talend asset types and metadata Atlan extracts
Need help
If you encounter issues while crawling Talend assets, see Connection and authentication issues. For additional assistance, contact Atlan support by submitting a request.