Skip to main content

Crawl Confluent Schema Registry

Configure and run the standalone Schema Registry crawler to extract subjects and schema versions into Atlan. To crawl Schema Registry alongside Confluent Kafka in a single workflow, see Crawl Confluent Kafka instead.

Prerequisites

Before you begin, make sure you have:

Create crawler workflow

  1. In Atlan, select New > New Workflow.

  2. Select Confluent Schema Registry Assets and click Setup Workflow.

  3. Choose your extraction method:

    • Direct -- Atlan connects to your Confluent Schema Registry and crawls metadata directly over the network.
    • Agent -- Self-Deployed Runtime executes metadata extraction within your organization's environment, keeping all connections inside your network perimeter.

Configure extraction

  1. For Host, enter your schema registry endpoint.
  2. For API Key, enter the API key you copied.
  3. For API Secret, enter the API secret you copied.
  4. Click Test Authentication to confirm connectivity, then click Next.

Configure connection

  1. Provide a Connection Name that represents your source environment -- for example, production, development, gold, or analytics.

  2. Under Connection Admins, add the users or groups that can manage this connection.

    warning

    If you don't specify any user or group, no one can manage the connection -- not even admins.

  3. At the bottom of the screen, click Next.

Configure crawling options

On the Metadata page, you can override the defaults for any of these options:

  • Click Exclude subjects to exclude specific subjects from crawling. Defaults to no exclusions if none are specified.
  • Click Include subjects to limit crawling to specific subjects. Defaults to all subjects if none are specified.

If an asset appears in both include and exclude filters, the exclude filter takes precedence.

Run crawler

  1. Click Preflight checks to validate permissions and configuration before running.
  2. After preflight checks pass, either:
    • Click Run to run the crawler once immediately.
    • Click Schedule & Run to schedule the crawler to run hourly, daily, weekly, or monthly.

Once the crawler completes, the assets appear on Atlan's asset page.

See also