|
1 | 1 | # CHANGELOG
|
2 | 2 |
|
| 3 | +## torchx-0.5.0 |
| 4 | + |
| 5 | +* Milestone: https://github.com/pytorch/torchx/milestone/7 |
| 6 | + |
| 7 | +* `torchx.schedulers` |
| 8 | + * Kubernetes MCAD Scheduler (Prototype) |
| 9 | + * Newly added integration for easily scheduling jobs on Multi-Cluster-Application-Dispatcher (MCAD). |
| 10 | + * Features include: |
| 11 | + * scheduling different types of components including DDP components |
| 12 | + * scheduling on different compute resources (CPU, GPU) |
| 13 | + * support for docker workspace |
| 14 | + * support for bind, volume and device mounts |
| 15 | + * getting logs for jobs |
| 16 | + * describing, listing and cancelling jobs |
| 17 | + * can be used with a secondary scheduler on Kubernetes |
| 18 | + * AWS Batch |
| 19 | + * Add privileged option to enable running containers on EFA enabled instances with elevated networking permissions |
| 20 | + |
| 21 | +* `torchx.tracker` |
| 22 | + * MLflow backend (Prototype) |
| 23 | + * New support for MLFlow backend for torchx tracker |
| 24 | + * Add ability for fsspec tracker to read nested kwargs |
| 25 | + * Support for tracking apps not launched by torchx |
| 26 | + * Load tracker config from .torchxconfig |
| 27 | + |
| 28 | +* `torchx.components` |
| 29 | + * Add dist.spmd component to support Single-Process-Multiple-Data style applications |
| 30 | + |
| 31 | +* `torchx.workspace` |
| 32 | + * Add ability to access image and workspace path from Dockerfile while building docker workspace |
| 33 | + |
| 34 | +* Usability imporvements |
| 35 | + * Fix entrypoint loading to deal with deferred loading of modules to enable component registration to work properly |
| 36 | + |
| 37 | +* Changes to ease maintenance |
| 38 | + * Add ability to run integration tests for AWS Batch, Slurm, and Kubernetes, instead of running in a remote dedicated clusters. This makes the environment reproducible, reduces maintenance, and makes it easier for more users to contribute. |
| 39 | + |
| 40 | +* Additional changes |
| 41 | + * Bug fixes: Make it possible to launch jobs with more than 5 nodes on AWS Batch |
| 42 | + |
| 43 | + |
3 | 44 | ## torchx-0.4.0
|
4 | 45 |
|
5 | 46 | * Milestone: https://github.com/pytorch/torchx/milestone/6
|
|
0 commit comments