18 Mar 16:51

f3dc5fa

v0.28.1.7 Pre-release

Pre-release

Intel® Optimization for Horovod* v0.28.1.7 Release Notes

Intel® Optimization for Horovod* is the Intel optimized distributed training framework to extend the official Horovod (based on v0.28.1) which aims at providing distributed ability to run TensorFlow workloads on Intel GPU clusters. This release contains the following major features:

Supports Intel® Deep Learning Essentials (version 2025.0.1).
Supports TensorFlow 2.15.1 and Intel® Extension for TensorFlow* v2.15.0.3.
Turns TensorFlow NextPluggableDevice mode by default for Intel Device.
Supports both scale-up and scale-out on the Intel® Data Center Max GPU cluster
Fixes potential overflow of displacement arrays for large number of ranks and msg sizes

What's Changed

Adds support for PyTorch build with pre-C++11 ABI
Adds CMake build support to compile Horovod's SYCL kernels for _GLIBCXX_USE_CXX11_ABI=0
Adds checks on return status of MPI_Init_thread
Fixes build issue for sycl compiler 2025.1 with update to sycl include path

Documentation

Distributed Training (example using Intel® Extension for TensorFlow*) with Intel® Optimization for Horovod* on Intel® GPU
Distributed Training (example using Intel® Extension for PyTorch*) with Intel® Optimization for Horovod* on Intel® GPU

Assets 2

18 Dec 23:15

vsanghavi

v0.28.1.6

0364ddf

v0.28.1.6 Latest

Latest

Intel® Optimization for Horovod* v0.28.1.6 Release Notes

Supports Intel® oneAPI Base Toolkit 2025.0.1.
Supports TensorFlow 2.15.1 and Intel® Extension for TensorFlow* v2.15.0.2.
Turns TensorFlow NextPluggableDevice mode by default for Intel Device.
Supports both scale-up and scale-out on the Intel® Data Center Max GPU cluster
Fixes potential overflow of displacement arrays for large number of ranks and msg sizes

Assets 2

12 Aug 08:28

LuFinch

v0.28.1.5

0f95ae3

v0.28.1.5

Intel® Optimization for Horovod* v0.28.1.5 Release Notes

Major Features and Improvements

Supports Intel® oneAPI Base Toolkit 2024.2.1.
Supports TensorFlow 2.15.1 and Intel® Extension for TensorFlow* v2.15.0.1.
Turns TensorFlow NextPluggableDevice mode by default for Intel Device.
Updates usage of inplace ccl:reduce_scatter for better performance.
Supports both scale-up and scale-out on the Intel® Data Center Max GPU clusters.

Assets 2

28 Mar 11:29

LuFinch

v0.28.1.4

3879c56

v0.28.1.4

Intel® Optimization for Horovod* v0.28.1.4 Release Notes

Major Features and Improvements

Supports Intel® oneAPI Base Toolkit 2024.1.
Supports TensorFlow 2.15 and Intel® Extension for TensorFlow* v2.15.0.0.
Integrates the TensorFlow NextPluggableDevice as a new device type and implements XLA Horovod OPs on the Intel GPU backend to the OpenXLA ecosystem.
Supports both scale-up and scale-out on the Intel® Data Center Max GPU clusters.

Known Issues

Scale-out has hang issue due to OneCCL's bug in Intel® oneAPI Base Toolkit 2024.1. Please use Intel® oneAPI Base Toolkit 2024.0 when running scale-out tasks.

Assets 2

01 Dec 02:17

LuFinch

v0.28.1.2

4d8b248

Intel® Optimization for Horovod* 0.28.1.2 Pre-release

Pre-release

Major Features and Improvements

Intel® Optimization for Horovod* is Intel optimized distributed training framework to extend official Horovod (based on v0.28.1) which aims at providing distributed ability to run TensorFlow workloads in Intel GPU cluster. This release contains following major features:

Supported TorusAllreduce operation for cross node AllReduce. This collective operation development is based on oneAPI Collective Communications Library (oneCCL) to do inter-GPU communication primitives that are topology-aware and provide accelerated inter-GPU communication.
Supported TensorFlow 2.14.0 and Intel® Extension for TensorFlow* v2.14.0.0 in Intel® Optimization for Horovod*.
Supported scale-up and scale-out in Intel® Data Center Max GPU cluster.

Documentations to get started

Assets 2

28 Jul 06:30

zhangxiaoli73

v0.28.1.0

e3ec338

Intel® Optimization for Horovod* 0.28.1.0 Pre-release

Pre-release

Major Features and Improvements

Rebased Intel® Optimization for Horovod* to latest stock v0.28.1 Horovod. The main changes in this rebase includes:
- Horovod API compatibility issues with tf.keras 2.11 are fixed. Now, HVD wrapper for keras optimizer can work correctly without legacy limitation.
- Support new implemented methodology in reducescatter and enable batch memory copy for allgather/reducescatter.
Refined Intel ® Optimization for Horovod* version to four digits version format v0.28.1.0 by three digits from stock Horovod v0.28.1 and the last one initiated from 0 and increased. It will make it easier for users to understand Intel ® Optimization for Horovod* and stock Horovod version mapping relationship.
Supported TensorFlow 2.13.0 and Intel® Extension for TensorFlow* v2.13.0.0 in Intel® Optimization for Horovod*.
Supported scale-up and scale-out in Intel® Data Center Max GPU cluster.

Documentations to get started

Assets 2

27 Apr 02:18

zhangxiaoli73

v0.5.0

a4d2bd3

Intel® Optimization for Horovod* 0.5.0 Pre-release

Pre-release

Major Features and Improvements

Intel® Optimization for Horovod* is Intel optimized distributed training framework to extend official Horovod (based on v0.26.1) which aims at providing distributed ability to run TensorFlow workloads in Intel GPU cluster. This release contains following major features:

Enabled All2All/(grouped)AllGather/(grouped)ReduceScatter/BroadcastInplace(Resource) operations for TensorFlow in Intel® Data Center Max GPU cluster. Those collective operation development is based on oneAPI Collective Communications Library (oneCCL) to do inter-GPU communication primitives that are topology-aware and provide accelerated inter-GPU communication.
Switched CXX compiler from dpcpp to icpx. New source code build command in how to build.
Supported TF2.12 and Intel® Extension for TensorFlow* v1.2 in Intel® Optimization for Horovod*.
Supported scale-up and scale-out in Intel® Data Center Max GPU cluster.

Documentations to get started

Assets 2

06 Jan 08:40

GHGmc2

v0.4.0

64107e0

Intel® Optimization for Horovod* 0.4.0 Pre-release

Pre-release

Major Features

Intel® Optimization for Horovod* is Intel optimized distributed training framework to extend official Horovod, which aims at providing distributed ability to run TensorFlow and PyTorch workloads in Intel GPU cluster. It's developed based on public Horovod latest release v0.26.1.

This release contains following major features:

Enabled AllReduce/GroupedAllreduce/BroadCast operations for TensorFlow and PyTorch in Intel® Data Center Max GPU Series cluster. Those collective operation development is based on Intel® oneAPI Collective Communications Library (oneCCL) to do inter-GPU communication primitives that are topology-aware and provide accelerated inter-GPU communication.
Supported scale-up and scale-out in Intel® Data Center Max GPU Series cluster.
Co-worked with Intel® Extension for TensorFlow* v1.1 and Intel® Extension for PyTorch* v1.13.10+xpu.

Documentations to get started

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Intel® Optimization for Horovod* v0.28.1.7 Release Notes

What's Changed

Documentation

Uh oh!

Intel® Optimization for Horovod* v0.28.1.6 Release Notes

Uh oh!

Intel® Optimization for Horovod* v0.28.1.5 Release Notes

Major Features and Improvements

Uh oh!

Intel® Optimization for Horovod* v0.28.1.4 Release Notes

Major Features and Improvements

Known Issues

Uh oh!

Major Features and Improvements

Documentations to get started

Uh oh!

Major Features and Improvements

Documentations to get started

Uh oh!

Major Features and Improvements

Documentations to get started

Uh oh!

Major Features

Documentations to get started

Uh oh!

Releases: intel/intel-optimization-for-horovod

v0.28.1.7

Intel® Optimization for Horovod* v0.28.1.7 Release Notes

What's Changed

Documentation

Uh oh!

v0.28.1.6

Intel® Optimization for Horovod* v0.28.1.6 Release Notes

Uh oh!

v0.28.1.5

Intel® Optimization for Horovod* v0.28.1.5 Release Notes

Major Features and Improvements

Uh oh!

v0.28.1.4

Intel® Optimization for Horovod* v0.28.1.4 Release Notes

Major Features and Improvements

Known Issues

Uh oh!

Intel® Optimization for Horovod* 0.28.1.2

Major Features and Improvements

Documentations to get started

Uh oh!

Intel® Optimization for Horovod* 0.28.1.0

Major Features and Improvements

Documentations to get started

Uh oh!

Intel® Optimization for Horovod* 0.5.0

Major Features and Improvements

Documentations to get started

Uh oh!

Intel® Optimization for Horovod* 0.4.0

Major Features

Documentations to get started

Uh oh!