Skip to main content

What does Atlan crawl from SageMaker

Atlan crawls comprehensive metadata from your Amazon SageMaker environment, covering models, jobs, deployments, and associated assets.

Asset

Atlan crawls the following SageMaker assets, each with specific metadata fields.

Model

SageMaker models represent trained machine learning models that can be deployed for inference. Atlan maps model packages from Amazon SageMaker to its SageMakerModel asset type.

Source fieldAtlan fieldDescription
ModelPackageNamenameModel package name (format: GroupName/Version)
ModelPackageArncloudUniformResourceNameAWS ARN of the model package
CreationTimesourceCreatedAtWhen the model package was created
ContainerImagesageMakerModelContainerImageDocker container image for the model
S3UrisageMakerS3UriS3 URI of model artifacts
ModelPackageVersionsageMakerModelVersionVersion of the model package
ModelPackageStatussageMakerModelStatusStatus of the model package (ACTIVE or INACTIVE)
ModelPackageGroupNamesageMakerModelModelGroupNameName of the parent Model Group
ModelPackageGroupArnsageMakerModelModelGroupQualifiedNameQualified name of the parent Model Group
ModelPackageGroupArnsageMakerModelGroupReference to the parent Model Group
ExternalUrlsourceURLLink to AWS SageMaker console

Model Group

SageMaker model groups represent collections of versioned model packages that can be organized and managed together. Atlan maps model package groups from Amazon SageMaker to its SageMakerModelGroup asset type.

Source fieldAtlan fieldDescription
ModelPackageGroupNamenameModel package group name
ModelPackageGroupArncloudUniformResourceNameAWS ARN of the model package group
ModelPackageGroupDescriptiondescriptionDescription of the model package group
CreationTimesourceCreatedAtWhen the model package group was created
CreationTimesourceUpdatedAtLast update time (defaults to creation time)
ModelPackageGroupStatussageMakerModelGroupStatusCurrent status of the model package group
ModelPackageGroupStatusaiModelStatusNormalized AI model status (ACTIVE/INACTIVE)
(Derived)aiModelVersionHighest version number from model packages in the group

Job

SageMaker jobs represent various types of ML job executions including training, processing, and transform jobs. Atlan maps jobs from Amazon SageMaker to its FlowControlOperation asset type.

Source fieldAtlan fieldDescription
TrainingJobNamenameJob name
TrainingJobArnflowIdAWS ARN of the job
JobTypeassetUserDefinedTypeType of job (Training Job, Processing Job, Transform Job)
TrainingJobStatusflowStatusCurrent status of the job
CreationTimesourceCreatedAtWhen the job was created
LastModifiedTimesourceUpdatedAtWhen the job was last updated
TrainingStartTimeflowStartedAtWhen the job started
TrainingEndTimeflowFinishedAtWhen the job completed
ExternalUrlsourceURLLink to AWS SageMaker console

Feature Group

SageMaker Feature Groups represent collections of related features for machine learning training and inference. Atlan maps feature groups from Amazon SageMaker to its SageMakerFeatureGroup asset type.

Source fieldAtlan fieldDescription
FeatureGroupNamenameFeature group name
FeatureGroupArncloudUniformResourceNameAWS ARN of the feature group
DescriptiondescriptionFeature group description
CreationTimesourceCreatedAtWhen the feature group was created
LastModifiedTimesourceUpdatedAtWhen the feature group was last updated
FeatureGroupStatussageMakerFeatureGroupStatusCurrent status of the feature group
FeatureCountsageMakerFeatureGroupFeatureCountNumber of features in the feature group
RecordIdNamesageMakerFeatureGroupRecordIdNameName of the record identifier feature
OfflineStoreS3UrisageMakerS3UriS3 URI of offline store data
GlueDatabasesageMakerFeatureGroupGlueDatabaseNameAWS Glue database name
GlueTablesageMakerFeatureGroupGlueTableNameAWS Glue table name
ExternalUrlsourceURLLink to AWS SageMaker console

Model Deployment

SageMaker endpoints represent deployed models that serve real-time inference requests. Atlan maps model deployments from Amazon SageMaker to its SageMakerModelDeployment asset type.

Source fieldAtlan fieldDescription
EndpointNamenameEndpoint name
EndpointArncloudUniformResourceNameAWS ARN of the endpoint
CreatedAtsourceCreatedAtWhen the endpoint was created
LastModifiedTimesourceUpdatedAtWhen the deployment was last updated
ModelDeploymentStatussageMakerModelDeploymentStatusCurrent status of the endpoint
EndpointConfigNamesageMakerModelDeploymentEndpointConfigNameAssociated endpoint configuration
ModelNamesageMakerModelDeploymentModelNameName of the parent Model
ModelArnsageMakerModelDeploymentModelQualifiedNameQualified name of the parent Model
ModelArnsageMakerModelReference to the parent Model
ExternalUrlsourceURLLink to AWS SageMaker console

Feature

SageMaker features represent individual features within Feature Groups, including their data type and metadata. Atlan maps features from Amazon SageMaker to its SageMakerFeature asset type.

Source fieldAtlan fieldDescription
FeatureNamenameFeature name
FeatureGroupArncloudUniformResourceNameAWS ARN of the feature group
FeatureGroupArnsageMakerFeatureGroupReference to the containing feature group
CreationTimesourceCreatedAtWhen the feature was created
FeatureGroupNamesageMakerFeatureGroupNameName of the containing feature group
FeatureGroupQualifiedNamesageMakerFeatureGroupQualifiedNameQualified name of the containing feature group
DataTypesageMakerFeatureDataTypeData type of the feature
IsRecordIdentifiersageMakerFeatureIsRecordIdentifierWhether this feature serves as record identifier
LastModifiedTimesourceUpdatedAtWhen the feature was last updated

See also