Sigma assets package

The Sigma assets package crawls Sigma assets and publishes them to Atlan for discovery.

Will create a new connection

This should only be used to create the workflow the first time. Each time you run this method it will create a new connection and new assets within that connection — which could lead to duplicate assets if you run the workflow this way multiple times with the same settings.

Instead, when you want to re-crawl assets, re-run the existing workflow (see Re-run existing workflow below).

To crawl assets directly from Sigma:

Java
Python
Kotlin
Raw REST API

Direct extraction from Sigma
Workflow crawler = SigmaCrawler.creator( // (1)
      client, // (2)
      "production", // (3)
      List.of(client.getRoleCache().getIdForName("$admin")), // (4)
      null,
      null,
      true, // (5)
      true, // (6)
      10000L // (7)
    )
    .direct( // (8)
      "aws-api.sigmacomputing.com",
    )
    .apiToken(
      "client-id", // (9)
      "api-token" // (10)
    )
    .include( // (11)
      List.of("995aecc2-fecf-497a-b169-5b3f96073618")
    )
    .exclude(List.of()) // (12)
    .build()  // (13)
    .toWorkflow();  // (14)

WorkflowResponse response = crawler.run(client);  // (15)

The SigmaCrawler package will create a workflow to crawl assets from Sigma.
You must provide Atlan client.
You must provide a name for the connection that the Sigma assets will exist within.
You must specify at least one connection admin, either:
- everyone in a role (in this example, all $admin users).
- a list of groups (names) that will be connection admins.
- a list of users (names) that will be connection admins.
You can specify whether you want to allow queries to this connection (true, as in this example) or deny all query access to the connection (false).
You can specify whether you want to allow data previews on this connection (true, as in this example) or deny all sample data previews to the connection (false).
You can specify a maximum number of rows that can be accessed for any asset in the connection.
When crawling assets directly from Sigma, you are required to provide the following information:
- hostname of the Sigma host, for example aws-api.sigmacomputing.com
You must provide client ID through which to access Sigma.
You must provide API token through which to access Sigma.
You can also optionally specify the list of workbooks to include in crawling. For Sigma assets, this should be specified as a list of workbook GUIDs. If set to null, all workbooks will be crawled.
You can also optionally specify the list of workbooks to exclude from crawling. For Sigma assets, this should be specified as a list of workbook GUIDs. If set to null, no workbooks will be excluded.
Build the minimal package object.
Now, you can convert the package into a Workflow object.
You can then run the workflow using the run() method on the object you've created. Because this operation will execute work in Atlan, you must provide it an AtlanClient through which to connect to the tenant.

Workflows run asynchronously