Skip to main content

SDR Orchestrator FAQ

Find answers to common questions about troubleshooting and monitoring the SDR Orchestrator.

Did you know?

Some capabilities shown here may require additional enablement or licensing. Contact your Atlan representative for details.

General

Minimum requirements

Docker / Podman (VM): The SDR Orchestrator itself needs 0.5 vCPU, 256 MB RAM, and ~500 MB disk. Each connector app adds 0.5–1 vCPU, 512 MB–1 GB RAM, and ~1–2 GB disk on top of that. For example, a VM running the SDR Orchestrator plus 3 apps needs at least 2 vCPUs, 4 GB RAM, and 20 GB disk.

Kubernetes: The SDR pod requests 100m–500m CPU and 256Mi–512Mi memory. Each app pod requires approximately 0.5–1 vCPU and 512Mi–1Gi memory. The Kubernetes scheduler distributes pods across available nodes—no single-host sizing constraint.

Size your VM or cluster based on the total number of apps you plan to run. See System requirements for the full breakdown.

SDR state storage

Docker / Podman:

PathContentsBackup needed?
/mnt/config/config.yamlYour infrastructure config (read-only mount)Yes—this is your only config file
/mnt/deployments/Generated per-app deployment directories (docker-compose, .env, Dapr components)No—the SDR regenerates these on deploy

There's no database or persistent state beyond these two paths. If the VM is lost, re-run the SDR with the same config.yaml and Atlan re-deploys the apps automatically.

Kubernetes: State is stored in Kubernetes resources (ConfigMap and Secret) created by Helm. These are regenerated from values.yaml during install. If the namespace is deleted, re-run helm install with the same values.yaml and Atlan re-deploys the apps automatically.

Running multiple SDR instances

Each SDR instance manages apps on one VM (or one Kubernetes namespace). You can't run two SDR instances with the same deployment_name.

For multiple VMs or clusters, use a unique deployment_name per environment (for example, mycompany-ec2-prod, mycompany-ec2-staging). Each appears as a separate SDR in the Atlan UI under Settings > Self-Deployed Runtimes.

Troubleshooting

Permission denied on socket

If you see permission denied when the SDR tries to manage containers:

  • Docker: Make sure you included --group-add $(getent group docker | cut -d: -f3) in the docker run command. This gives the SDR permission to use the Docker socket.
  • Podman: Verify the socket exists at /run/user/$(id -u)/podman/podman.sock and that you started the socket with systemctl --user enable --now podman.socket.
  • Kubernetes: Check the SDR ServiceAccount has the correct RBAC permissions: kubectl get rolebinding -n atlan -l app=atlan-sdr. Verify the Role and RoleBinding exist and reference the correct ServiceAccount.

SDR shows as Disconnected in Atlan UI

  • Check that outbound HTTPS and gRPC (port 443) aren't blocked by your firewall or security group.

  • Verify the TEMPORAL_HOST environment variable points to your Atlan Temporal endpoint.

  • Check SDR logs:

    • Docker: docker logs atlan-sdr-{deployment_name}
    • Podman: podman logs atlan-sdr-{deployment_name}
    • Kubernetes: kubectl logs -n atlan -l app=atlan-sdr

    Look for connection errors or authentication failures.

Image pull fails

  • Docker / Podman: Verify you ran docker login -u atlanhq (or podman login) with the correct PAT before starting the SDR.
  • Kubernetes: Verify the atlan-registry-creds secret exists and is referenced in imagePullSecrets:
    kubectl get secret atlan-registry-creds -n atlan
    If the secret is missing, create it:
    kubectl create secret docker-registry atlan-registry-creds \
    --namespace atlan \
    --docker-server=docker.io \
    --docker-username=YOUR_USERNAME \
    --docker-password=YOUR_PAT_TOKEN
  • If using a private registry, check that container_registry.base, username, and password are correct in config.yaml (Docker/Podman), or sdr.config.containerRegistryBase is set correctly in values.yaml (Kubernetes).

Monitoring

Monitoring SDR

Docker / Podman:

  • Container status: docker ps | grep atlan-sdr or podman ps | grep atlan-sdr

  • Resource usage:

    # Docker:
    docker stats atlan-sdr-{deployment_name}

    # Podman:
    podman stats atlan-sdr-{deployment_name}
  • Logs:

    # Docker:
    docker logs --tail=100 -f atlan-sdr-{deployment_name}

    # Podman:
    podman logs --tail=100 -f atlan-sdr-{deployment_name}

Kubernetes:

  • Pod status: kubectl get pods -n atlan -l app=atlan-sdr
  • Resource usage: kubectl top pods -n atlan -l app=atlan-sdr
  • Logs: kubectl logs -n atlan -l app=atlan-sdr --tail=100 -f
  • All app pods: kubectl get pods -n atlan

The SDR doesn't expose a metrics endpoint. Monitor it externally via container/pod health status and log aggregation.

Log lines to watch for

Log messageMeaning
Connected to TemporalSDR successfully connected to Atlan—healthy
Worker started / K8s SDR worker listening on task_queueSDR is polling for tasks—healthy
connection refusedCan't reach Temporal endpoint—check firewall/DNS
permission deniedSocket access issue (Docker/Podman) or RBAC issue (Kubernetes)
pull access deniedRegistry auth failed—check docker login or imagePullSecrets

See also