What does Atlan crawl from SageMaker
Atlan crawls comprehensive metadata from your Amazon SageMaker environment, covering models, jobs, deployments, and associated assets.
Asset
Atlan crawls the following SageMaker assets, each with specific metadata fields.
Model
SageMaker models represent trained machine learning models that can be deployed for inference. Atlan maps model packages from Amazon SageMaker to its SageMakerModel asset type.
| Source field | Atlan field | Description |
|---|---|---|
ModelPackageName | name | Model package name (format: GroupName/Version) |
ModelPackageArn | cloudUniformResourceName | AWS ARN of the model package |
CreationTime | sourceCreatedAt | When the model package was created |
ContainerImage | sageMakerModelContainerImage | Docker container image for the model |
S3Uri | sageMakerS3Uri | S3 URI of model artifacts |
ModelPackageVersion | sageMakerModelVersion | Version of the model package |
ModelPackageStatus | sageMakerModelStatus | Status of the model package (ACTIVE or INACTIVE) |
ModelPackageGroupName | sageMakerModelModelGroupName | Name of the parent Model Group |
ModelPackageGroupArn | sageMakerModelModelGroupQualifiedName | Qualified name of the parent Model Group |
ModelPackageGroupArn | sageMakerModelGroup | Reference to the parent Model Group |
ExternalUrl | sourceURL | Link to AWS SageMaker console |
Model Group
SageMaker model groups represent collections of versioned model packages that can be organized and managed together. Atlan maps model package groups from Amazon SageMaker to its SageMakerModelGroup asset type.
| Source field | Atlan field | Description |
|---|---|---|
ModelPackageGroupName | name | Model package group name |
ModelPackageGroupArn | cloudUniformResourceName | AWS ARN of the model package group |
ModelPackageGroupDescription | description | Description of the model package group |
CreationTime | sourceCreatedAt | When the model package group was created |
CreationTime | sourceUpdatedAt | Last update time (defaults to creation time) |
ModelPackageGroupStatus | sageMakerModelGroupStatus | Current status of the model package group |
ModelPackageGroupStatus | aiModelStatus | Normalized AI model status (ACTIVE/INACTIVE) |
| (Derived) | aiModelVersion | Highest version number from model packages in the group |
Job
SageMaker jobs represent various types of ML job executions including training, processing, and transform jobs. Atlan maps jobs from Amazon SageMaker to its FlowControlOperation asset type.
| Source field | Atlan field | Description |
|---|---|---|
TrainingJobName | name | Job name |
TrainingJobArn | flowId | AWS ARN of the job |
JobType | assetUserDefinedType | Type of job (Training Job, Processing Job, Transform Job) |
TrainingJobStatus | flowStatus | Current status of the job |
CreationTime | sourceCreatedAt | When the job was created |
LastModifiedTime | sourceUpdatedAt | When the job was last updated |
TrainingStartTime | flowStartedAt | When the job started |
TrainingEndTime | flowFinishedAt | When the job completed |
ExternalUrl | sourceURL | Link to AWS SageMaker console |
Feature Group
SageMaker Feature Groups represent collections of related features for machine learning training and inference. Atlan maps feature groups from Amazon SageMaker to its SageMakerFeatureGroup asset type.
| Source field | Atlan field | Description |
|---|---|---|
FeatureGroupName | name | Feature group name |
FeatureGroupArn | cloudUniformResourceName | AWS ARN of the feature group |
Description | description | Feature group description |
CreationTime | sourceCreatedAt | When the feature group was created |
LastModifiedTime | sourceUpdatedAt | When the feature group was last updated |
FeatureGroupStatus | sageMakerFeatureGroupStatus | Current status of the feature group |
FeatureCount | sageMakerFeatureGroupFeatureCount | Number of features in the feature group |
RecordIdName | sageMakerFeatureGroupRecordIdName | Name of the record identifier feature |
OfflineStoreS3Uri | sageMakerS3Uri | S3 URI of offline store data |
GlueDatabase | sageMakerFeatureGroupGlueDatabaseName | AWS Glue database name |
GlueTable | sageMakerFeatureGroupGlueTableName | AWS Glue table name |
ExternalUrl | sourceURL | Link to AWS SageMaker console |
Model Deployment
SageMaker endpoints represent deployed models that serve real-time inference requests. Atlan maps model deployments from Amazon SageMaker to its SageMakerModelDeployment asset type.
| Source field | Atlan field | Description |
|---|---|---|
EndpointName | name | Endpoint name |
EndpointArn | cloudUniformResourceName | AWS ARN of the endpoint |
CreatedAt | sourceCreatedAt | When the endpoint was created |
LastModifiedTime | sourceUpdatedAt | When the deployment was last updated |
ModelDeploymentStatus | sageMakerModelDeploymentStatus | Current status of the endpoint |
EndpointConfigName | sageMakerModelDeploymentEndpointConfigName | Associated endpoint configuration |
ModelName | sageMakerModelDeploymentModelName | Name of the parent Model |
ModelArn | sageMakerModelDeploymentModelQualifiedName | Qualified name of the parent Model |
ModelArn | sageMakerModel | Reference to the parent Model |
ExternalUrl | sourceURL | Link to AWS SageMaker console |
Feature
SageMaker features represent individual features within Feature Groups, including their data type and metadata. Atlan maps features from Amazon SageMaker to its SageMakerFeature asset type.
| Source field | Atlan field | Description |
|---|---|---|
FeatureName | name | Feature name |
FeatureGroupArn | cloudUniformResourceName | AWS ARN of the feature group |
FeatureGroupArn | sageMakerFeatureGroup | Reference to the containing feature group |
CreationTime | sourceCreatedAt | When the feature was created |
FeatureGroupName | sageMakerFeatureGroupName | Name of the containing feature group |
FeatureGroupQualifiedName | sageMakerFeatureGroupQualifiedName | Qualified name of the containing feature group |
DataType | sageMakerFeatureDataType | Data type of the feature |
IsRecordIdentifier | sageMakerFeatureIsRecordIdentifier | Whether this feature serves as record identifier |
LastModifiedTime | sourceUpdatedAt | When the feature was last updated |