Skip to main content

Crawl MongoDB Atlas

Create a crawler workflow to automatically discover and catalog your MongoDB Atlas assets, including databases, collections, and schema metadata.

Prerequisites

Before you begin, make sure you have:

Create crawler workflow

Create a new MongoDB Atlas crawler workflow in Atlan by selecting the connector package, configuring your extraction method and connection details, and running the crawler to extract metadata.


  1. In the top right of any screen, navigate to New and then click New Workflow.
  2. From the list of packages, select MongoDB Atlas Assets and click Setup Workflow.

Configure extraction

Select your extraction method and provide the connection details.

In Direct extraction, Atlan connects to your database and crawls metadata directly.

Configure connection

Set up connection details including a descriptive name and admin access.

  1. Provide a Connection Name that represents your source environment. For example, you might use values like production, development, gold, or analytics.
  2. To change the users able to manage this connection, update the users or groups listed under Connection Admins. If you don't specify any user or group, nobody can manage the connection, including admins.
  3. Click Next at the bottom of the screen.

Configure crawler

Configure crawler settings to control which assets to include or exclude. If an asset appears in both filters, the exclude filter takes precedence.

On the Metadata Filters page, you can override the defaults. The options are the same for Direct and Agent extraction; when you use Agent extraction, filtering and document sampling run on your Self-Deployed Runtime.

  1. To select the assets you want to include in crawling, click Include Metadata. By default, all assets are included if none are specified.
  2. To select the assets you want to exclude from crawling, click Exclude Metadata. By default, no assets are excluded if none are specified.
  3. To have the crawler ignore collections by naming convention, enter a regular expression in the Exclude regex for collections field (for example .*_TMP|.*_TEMP|TMP.*|TEMP.*).

Run crawler

Run preflight checks to validate your configuration, then execute the crawler immediately or schedule it to run on a recurring basis.

  1. To verify permissions and configuration before running, click Preflight checks. This option is available for Direct extraction only.
  2. Choose your run option:
    • To run the crawler once immediately, click Run at the bottom of the screen.
    • To schedule the crawler to run hourly, daily, weekly, or monthly, click Schedule & Run at the bottom of the screen.

Once the crawler completes, you can view the assets in Atlan's asset page.

Need help

If you encounter issues, refer to the Troubleshooting MongoDB Atlas connectivityto resolve common connection errors. You can also contact the Atlan support team by submitting a support request.

See also