AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia instances using AWS Neuron SDK. For more documentation, please refer to Neuron Containers Overview.
To build the Dockerfiles successfully, ensure the files under the common/
directory is in the same parent folder as the Dockerfile. The common directory contains shared scripts and configuration files required by all container builds.
To build a specific container, navigate to the repository root and use Docker build with the appropriate context:
# Example: Build PyTorch 2.7.0 inference container
docker build -f docker/pytorch/inference/2.7.0/Dockerfile.neuronx -t my-neuron-container .
You can customize the containers by:
- Modifying existing Dockerfiles: Edit the Dockerfile to add additional packages or configurations
- Creating derived images: Use the pre-built containers as base images for your custom requirements
- Adding custom scripts: Place additional scripts and reference them in your Dockerfile
For detailed customization examples and best practices, refer to the Neuron DLC Customization Guide.
Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL | Other Packages |
---|---|---|---|---|---|---|
PyTorch 1.13.1 | aws-neuronx-tools, torch-neuron | Neuron 2.20.2 | inf1 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuron:1.13.1-neuron-py310-sdk2.20.2-ubuntu20.04 | torchserve 0.11.0 |
Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL | Other Packages |
---|---|---|---|---|---|---|
PyTorch 2.7.0 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, transformers-neuronx | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.7.0-neuronx-py310-sdk2.25.0-ubuntu22.04 | torchserve 0.11.0 |
PyTorch 2.6.0 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, transformers-neuronx | Neuron 2.23.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.6.0-neuronx-py310-sdk2.23.0-ubuntu22.04 | torchserve 0.11.0 |
PyTorch 2.5.1 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, transformers-neuronx | Neuron 2.22.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.5.1-neuronx-py310-sdk2.22.0-ubuntu22.04 | torchserve 0.11.0 |
PyTorch 2.1.2 | aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx | Neuron 2.20.2 | trn1,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuronx:2.1.2-neuronx-py310-sdk2.20.2-ubuntu20.04 | torchserve 0.11.0 |
PyTorch 1.13.1 | aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx | Neuron 2.20.2 | trn1,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-neuronx:1.13.1-neuronx-py310-sdk2.20.2-ubuntu20.04 | torchserve 0.11.0 |
Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL |
---|---|---|---|---|---|
PyTorch 2.7.0 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-training-neuronx:2.7.0-neuronx-py310-sdk2.25.0-ubuntu22.04 |
PyTorch 2.6.0 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.23.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-training-neuronx:2.6.0-neuronx-py310-sdk2.23.0-ubuntu22.04 |
PyTorch 2.5.1 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.22.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-training-neuronx:2.5.1-neuronx-py310-sdk2.22.0-ubuntu22.04 |
PyTorch 2.1.2 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.20.2 | trn1,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-training-neuronx:2.1.2-neuronx-py310-sdk2.20.2-ubuntu20.04 |
PyTorch 1.13.1 | aws-neuronx-tools, neuronx_distributed, neuronx_distributed_training, torch-neuronx | Neuron 2.20.2 | trn1,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-training-neuronx:1.13.1-neuronx-py310-sdk2.20.2-ubuntu20.04 |
Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL | Other Packages |
---|---|---|---|---|---|---|
JAX 0.6 | jax-neuronx, libneuronxla | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/jax-training-neuronx:0.6-neuronx-py310-sdk2.25.0-ubuntu22.04 | jaxlib 0.6 |
JAX 0.5 | jax-neuronx, libneuronxla | Neuron 2.23.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/jax-training-neuronx:0.5-neuronx-py310-sdk2.23.0-ubuntu22.04 | jaxlib 0.5 |
Framework | Neuron Packages | Neuron SDK Version | Supported EC2 Instance Types | Python Version Options | ECR Public URL |
---|---|---|---|---|---|
VLLM 0.9 | vllm-neuronx, libneuronxla, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, aws-neuronx-tools | Neuron 2.25.0 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-vllm-neuronx:0.9.1-neuronx-py310-sdk2.25.0-ubuntu22.04 |
VLLM 0.7 | vllm-neuronx, libneuronxla, neuronx_distributed, neuronx_distributed_inference, torch-neuronx, aws-neuronx-tools | Neuron 2.24.1 | trn1,trn2,inf2 | 3.10 (py310) | public.ecr.aws/neuron/pytorch-inference-vllm-neuronx:0.7.2-neuronx-py310-sdk2.24.1-ubuntu22.04 |
See SECURITY for more information.
This project is licensed under the Apache-2.0 License.