Skip to content

Commit f3fc657

Browse files
author
AWS
committed
Amazon SageMaker Service Update: Release IPv6 support with dualstack in SageMaker Notebooks, Tiered Storage Checkpointing Support in SageMaker HyperPod and P5.4xlarge instance type for SageMaker Hosting.
1 parent e35e270 commit f3fc657

File tree

2 files changed

+72
-6
lines changed

2 files changed

+72
-6
lines changed
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
{
2+
"type": "feature",
3+
"category": "Amazon SageMaker Service",
4+
"contributor": "",
5+
"description": "Release IPv6 support with dualstack in SageMaker Notebooks, Tiered Storage Checkpointing Support in SageMaker HyperPod and P5.4xlarge instance type for SageMaker Hosting."
6+
}

services/sagemaker/src/main/resources/codegen-resources/service-2.json

Lines changed: 66 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8382,6 +8382,13 @@
83828382
"type":"string",
83838383
"pattern":"[a-z]{3}\\d-az\\d"
83848384
},
8385+
"ClusterConfigMode":{
8386+
"type":"string",
8387+
"enum":[
8388+
"Enable",
8389+
"Disable"
8390+
]
8391+
},
83858392
"ClusterEbsVolumeConfig":{
83868393
"type":"structure",
83878394
"members":{
@@ -8680,6 +8687,12 @@
86808687
"max":100,
86818688
"min":0
86828689
},
8690+
"ClusterInstanceMemoryAllocationPercentage":{
8691+
"type":"integer",
8692+
"box":true,
8693+
"max":100,
8694+
"min":0
8695+
},
86838696
"ClusterInstancePlacement":{
86848697
"type":"structure",
86858698
"members":{
@@ -9354,6 +9367,21 @@
93549367
"max":2,
93559368
"min":1
93569369
},
9370+
"ClusterTieredStorageConfig":{
9371+
"type":"structure",
9372+
"required":["Mode"],
9373+
"members":{
9374+
"Mode":{
9375+
"shape":"ClusterConfigMode",
9376+
"documentation":"<p>Specifies whether managed tier checkpointing is enabled or disabled for the HyperPod cluster. When set to <code>Enable</code>, the system installs a memory management daemon that provides disaggregated memory as a service for checkpoint storage. When set to <code>Disable</code>, the feature is turned off and the memory management daemon is removed from the cluster.</p>"
9377+
},
9378+
"InstanceMemoryAllocationPercentage":{
9379+
"shape":"ClusterInstanceMemoryAllocationPercentage",
9380+
"documentation":"<p>The percentage (int) of cluster memory to allocate for checkpointing.</p>"
9381+
}
9382+
},
9383+
"documentation":"<p>Defines the configuration for managed tier checkpointing in a HyperPod cluster. Managed tier checkpointing uses multiple storage tiers, including cluster CPU memory, to provide faster checkpoint operations and improved fault tolerance for large-scale model training. The system automatically saves checkpoints at high frequency to memory and periodically persists them to durable storage, like Amazon S3.</p>"
9384+
},
93579385
"CodeEditorAppImageConfig":{
93589386
"type":"structure",
93599387
"members":{
@@ -10541,6 +10569,10 @@
1054110569
"shape":"ClusterNodeRecovery",
1054210570
"documentation":"<p>The node recovery mode for the SageMaker HyperPod cluster. When set to <code>Automatic</code>, SageMaker HyperPod will automatically reboot or replace faulty nodes when issues are detected. When set to <code>None</code>, cluster administrators will need to manually manage any faulty cluster instances.</p>"
1054310571
},
10572+
"TieredStorageConfig":{
10573+
"shape":"ClusterTieredStorageConfig",
10574+
"documentation":"<p>The configuration for managed tier checkpointing on the HyperPod cluster. When enabled, this feature uses a multi-tier storage approach for storing model checkpoints, providing faster checkpoint operations and improved fault tolerance across cluster nodes.</p>"
10575+
},
1054410576
"NodeProvisioningMode":{
1054510577
"shape":"ClusterNodeProvisioningMode",
1054610578
"documentation":"<p>The mode for provisioning nodes in the cluster. You can specify the following modes:</p> <ul> <li> <p> <b>Continuous</b>: Scaling behavior that enables 1) concurrent operation execution within instance groups, 2) continuous retry mechanisms for failed operations, 3) enhanced customer visibility into cluster events through detailed event streams, 4) partial provisioning capabilities. Your clusters and instance groups remain <code>InService</code> while scaling. This mode is only supported for EKS orchestrated clusters.</p> </li> </ul>"
@@ -11792,7 +11824,7 @@
1179211824
},
1179311825
"LabelAttributeName":{
1179411826
"shape":"LabelAttributeName",
11795-
"documentation":"<p>The attribute name to use for the label in the output manifest file. This is the key for the key/value pair formed with the label that a worker assigns to the object. The <code>LabelAttributeName</code> must meet the following requirements.</p> <ul> <li> <p>The name can't end with \"-metadata\". </p> </li> <li> <p>If you are using one of the following <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/sms-task-types.html\">built-in task types</a>, the attribute name <i>must</i> end with \"-ref\". If the task type you are using is not listed below, the attribute name <i>must not</i> end with \"-ref\".</p> <ul> <li> <p>Verification (<code>VerificationSemanticSegmentation</code>) labeling jobs for this task type.</p> </li> <li> <p>Video frame object detection (<code>VideoObjectDetection</code>), and adjustment and verification (<code>AdjustmentVideoObjectDetection</code>) labeling jobs for this task type.</p> </li> <li> <p>Video frame object tracking (<code>VideoObjectTracking</code>), and adjustment and verification (<code>AdjustmentVideoObjectTracking</code>) labeling jobs for this task type.</p> </li> <li> <p>3D point cloud semantic segmentation (<code>3DPointCloudSemanticSegmentation</code>), and adjustment and verification (<code>Adjustment3DPointCloudSemanticSegmentation</code>) labeling jobs for this task type. </p> </li> <li> <p>3D point cloud object tracking (<code>3DPointCloudObjectTracking</code>), and adjustment and verification (<code>Adjustment3DPointCloudObjectTracking</code>) labeling jobs for this task type. </p> </li> </ul> </li> </ul> <p/> <important> <p>If you are creating an adjustment or verification labeling job, you must use a <i>different</i> <code>LabelAttributeName</code> than the one used in the original labeling job. The original labeling job is the Ground Truth labeling job that produced the labels that you want verified or adjusted. To learn more about adjustment and verification labeling jobs, see <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/sms-verification-data.html\">Verify and Adjust Labels</a>.</p> </important>"
11827+
"documentation":"<p>The attribute name to use for the label in the output manifest file. This is the key for the key/value pair formed with the label that a worker assigns to the object. The <code>LabelAttributeName</code> must meet the following requirements.</p> <ul> <li> <p>The name can't end with \"-metadata\". </p> </li> <li> <p>If you are using one of the <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/sms-task-types.html\">built-in task types</a> or one of the following, the attribute name <i>must</i> end with \"-ref\".</p> <ul> <li> <p>Image semantic segmentation (<code>SemanticSegmentation)</code> and adjustment (<code>AdjustmentSemanticSegmentation</code>) labeling jobs for this task type. One exception is that verification (<code>VerificationSemanticSegmentation</code>) <i>must not</i> end with -\"ref\".</p> </li> <li> <p>Video frame object detection (<code>VideoObjectDetection</code>), and adjustment and verification (<code>AdjustmentVideoObjectDetection</code>) labeling jobs for this task type.</p> </li> <li> <p>Video frame object tracking (<code>VideoObjectTracking</code>), and adjustment and verification (<code>AdjustmentVideoObjectTracking</code>) labeling jobs for this task type.</p> </li> <li> <p>3D point cloud semantic segmentation (<code>3DPointCloudSemanticSegmentation</code>), and adjustment and verification (<code>Adjustment3DPointCloudSemanticSegmentation</code>) labeling jobs for this task type. </p> </li> <li> <p>3D point cloud object tracking (<code>3DPointCloudObjectTracking</code>), and adjustment and verification (<code>Adjustment3DPointCloudObjectTracking</code>) labeling jobs for this task type. </p> </li> </ul> </li> </ul> <p/> <important> <p>If you are creating an adjustment or verification labeling job, you must use a <i>different</i> <code>LabelAttributeName</code> than the one used in the original labeling job. The original labeling job is the Ground Truth labeling job that produced the labels that you want verified or adjusted. To learn more about adjustment and verification labeling jobs, see <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/sms-verification-data.html\">Verify and Adjust Labels</a>.</p> </important>"
1179611828
},
1179711829
"InputConfig":{
1179811830
"shape":"LabelingJobInputConfig",
@@ -12367,6 +12399,10 @@
1236712399
"shape":"SecurityGroupIds",
1236812400
"documentation":"<p>The VPC security group IDs, in the form sg-xxxxxxxx. The security groups must be for the same VPC as specified in the subnet. </p>"
1236912401
},
12402+
"IpAddressType":{
12403+
"shape":"IPAddressType",
12404+
"documentation":"<p>The IP address type for the notebook instance. Specify <code>ipv4</code> for IPv4-only connectivity or <code>dualstack</code> for both IPv4 and IPv6 connectivity. When you specify <code>dualstack</code>, the subnet must support IPv6 CIDR blocks. If not specified, defaults to <code>ipv4</code>.</p>"
12405+
},
1237012406
"RoleArn":{
1237112407
"shape":"RoleArn",
1237212408
"documentation":"<p> When you send any requests to Amazon Web Services resources from the notebook instance, SageMaker AI assumes this role to perform tasks on your behalf. You must grant this role necessary permissions so SageMaker AI can perform these tasks. The policy must allow the SageMaker AI service principal (sagemaker.amazonaws.com) permissions to assume this role. For more information, see <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-roles.html\">SageMaker AI Roles</a>. </p> <note> <p>To be able to pass this role to SageMaker AI, the caller of this API must have the <code>iam:PassRole</code> permission.</p> </note>"
@@ -15613,6 +15649,10 @@
1561315649
"shape":"ClusterOrchestrator",
1561415650
"documentation":"<p>The type of orchestrator used for the SageMaker HyperPod cluster. </p>"
1561515651
},
15652+
"TieredStorageConfig":{
15653+
"shape":"ClusterTieredStorageConfig",
15654+
"documentation":"<p>The current configuration for managed tier checkpointing on the HyperPod cluster. For example, this shows whether the feature is enabled and the percentage of cluster memory allocated for checkpoint storage.</p>"
15655+
},
1561615656
"NodeRecovery":{
1561715657
"shape":"ClusterNodeRecovery",
1561815658
"documentation":"<p>The node recovery mode configured for the SageMaker HyperPod cluster.</p>"
@@ -18453,6 +18493,10 @@
1845318493
"shape":"InstanceType",
1845418494
"documentation":"<p>The type of ML compute instance running on the notebook instance.</p>"
1845518495
},
18496+
"IpAddressType":{
18497+
"shape":"IPAddressType",
18498+
"documentation":"<p>The IP address type configured for the notebook instance. Returns <code>ipv4</code> for IPv4-only connectivity or <code>dualstack</code> for both IPv4 and IPv6 connectivity.</p>"
18499+
},
1845618500
"SubnetId":{
1845718501
"shape":"SubnetId",
1845818502
"documentation":"<p>The ID of the VPC subnet.</p>"
@@ -18639,7 +18683,7 @@
1863918683
},
1864018684
"Status":{
1864118685
"shape":"PartnerAppStatus",
18642-
"documentation":"<p>The status of the SageMaker Partner AI App.</p>"
18686+
"documentation":"<p>The status of the SageMaker Partner AI App.</p> <ul> <li> <p>Creating: SageMaker AI is creating the partner AI app. The partner AI app is not available during creation.</p> </li> <li> <p>Updating: SageMaker AI is updating the partner AI app. The partner AI app is not available when updating.</p> </li> <li> <p>Deleting: SageMaker AI is deleting the partner AI app. The partner AI app is not available during deletion.</p> </li> <li> <p>Available: The partner AI app is provisioned and accessible.</p> </li> <li> <p>Failed: The partner AI app is in a failed state and isn't available. SageMaker AI is investigating the issue. For further guidance, contact Amazon Web Services Support.</p> </li> <li> <p>UpdateFailed: The partner AI app couldn't be updated but is available.</p> </li> <li> <p>Deleted: The partner AI app is permanently deleted and not available.</p> </li> </ul>"
1864318687
},
1864418688
"CreationTime":{
1864518689
"shape":"Timestamp",
@@ -20316,7 +20360,7 @@
2031620360
},
2031720361
"RootlessDocker":{
2031820362
"shape":"FeatureStatus",
20319-
"documentation":"<p>Indicates whether to use rootless Docker. Default value is <code>DISABLED</code>.</p>"
20363+
"documentation":"<p>Indicates whether to use rootless Docker.</p>"
2032020364
}
2032120365
},
2032220366
"documentation":"<p>A collection of settings that configure the domain's Docker interaction.</p>"
@@ -24114,6 +24158,13 @@
2411424158
"box":true,
2411524159
"min":1
2411624160
},
24161+
"IPAddressType":{
24162+
"type":"string",
24163+
"enum":[
24164+
"ipv4",
24165+
"dualstack"
24166+
]
24167+
},
2411724168
"IamIdentity":{
2411824169
"type":"structure",
2411924170
"members":{
@@ -35887,7 +35938,7 @@
3588735938
"type":"string",
3588835939
"max":256,
3588935940
"min":0,
35890-
"pattern":"arn:aws[a-z\\-]*:sagemaker:[a-z0-9\\-]*:[0-9]{12}:processing-job/.*"
35941+
"pattern":"arn:aws[a-z\\-]*:sagemaker:[a-z0-9\\-]*:[0-9]{12}:processing-job/[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}"
3589135942
},
3589235943
"ProcessingJobName":{
3589335944
"type":"string",
@@ -36577,7 +36628,8 @@
3657736628
"ml.c6in.24xlarge",
3657836629
"ml.c6in.32xlarge",
3657936630
"ml.p6-b200.48xlarge",
36580-
"ml.p6e-gb200.36xlarge"
36631+
"ml.p6e-gb200.36xlarge",
36632+
"ml.p5.4xlarge"
3658136633
]
3658236634
},
3658336635
"ProductionVariantList":{
@@ -41847,7 +41899,7 @@
4184741899
"type":"string",
4184841900
"max":256,
4184941901
"min":0,
41850-
"pattern":"arn:aws[a-z\\-]*:sagemaker:[a-z0-9\\-]*:[0-9]{12}:training-job/.*"
41902+
"pattern":"arn:aws[a-z\\-]*:sagemaker:[a-z0-9\\-]*:[0-9]{12}:training-job/[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}"
4185141903
},
4185241904
"TrainingJobDefinition":{
4185341905
"type":"structure",
@@ -43630,6 +43682,10 @@
4363043682
"shape":"ClusterRestrictedInstanceGroupSpecifications",
4363143683
"documentation":"<p>The specialized instance groups for training models like Amazon Nova to be created in the SageMaker HyperPod cluster.</p>"
4363243684
},
43685+
"TieredStorageConfig":{
43686+
"shape":"ClusterTieredStorageConfig",
43687+
"documentation":"<p>Updates the configuration for managed tier checkpointing on the HyperPod cluster. For example, you can enable or disable the feature and modify the percentage of cluster memory allocated for checkpoint storage.</p>"
43688+
},
4363343689
"NodeRecovery":{
4363443690
"shape":"ClusterNodeRecovery",
4363543691
"documentation":"<p>The node recovery mode to be applied to the SageMaker HyperPod cluster.</p>"
@@ -44649,6 +44705,10 @@
4464944705
"shape":"InstanceType",
4465044706
"documentation":"<p>The Amazon ML compute instance type.</p>"
4465144707
},
44708+
"IpAddressType":{
44709+
"shape":"IPAddressType",
44710+
"documentation":"<p>The IP address type for the notebook instance. Specify <code>ipv4</code> for IPv4-only connectivity or <code>dualstack</code> for both IPv4 and IPv6 connectivity. The notebook instance must be stopped before updating this setting. When you specify <code>dualstack</code>, the subnet must support IPv6 addressing.</p>"
44711+
},
4465244712
"RoleArn":{
4465344713
"shape":"RoleArn",
4465444714
"documentation":"<p>The Amazon Resource Name (ARN) of the IAM role that SageMaker AI can assume to access the notebook instance. For more information, see <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-roles.html\">SageMaker AI Roles</a>. </p> <note> <p>To be able to pass this role to SageMaker AI, the caller of this API must have the <code>iam:PassRole</code> permission.</p> </note>"

0 commit comments

Comments
 (0)