|
11 | 11 | ### Features:
|
12 | 12 | ### Bugfixes:
|
13 | 13 |
|
14 |
| -## 1.16.0-rc5 (April 02, 2024) |
15 |
| -### Features: |
16 |
| -#### UCS |
17 |
| -* Added support for PCIe gen5 bandwidth detection |
18 |
| -### Bugfixes: |
19 |
| -#### UCP |
20 |
| -* Fixed rndv_put transport selection for device to device scenario |
21 |
| -#### RDMA CORE (IB, ROCE, etc.) |
22 |
| -* Disabled MR multithreading registration |
23 |
| - |
24 |
| -## 1.16.0-rc4 (February 21, 2024) |
25 |
| -### Bugfixes: |
26 |
| -#### UCP |
27 |
| -* Disabled rendezvous pipeline protocol selection when using non-contiguous buffer |
28 |
| -#### RDMA CORE (IB, ROCE, etc.) |
29 |
| -* Fixed mlx5 WQE posting error due to compiler memory copy optimizations |
30 |
| -#### GPU (CUDA, ROCM) |
31 |
| -* Fixed cuda_ipc transport being disabled if a CUDA device is not set during initialization |
32 |
| -#### UCM |
33 |
| -* Fixed compilation error when building on PPC64 |
34 |
| -#### Packaging |
35 |
| -* Fixed already existing target error when using cmake find_package(ucx) twice |
36 |
| - |
37 |
| -## 1.16.0-rc3 (February 20, 2024) |
38 |
| -### Bugfixes: |
39 |
| -#### UCP |
40 |
| -* Fixed crash in rendezvous protocol rkey pack after failed memory registration |
41 |
| - |
42 |
| -## 1.16.0-rc2 (January 21, 2024) |
| 14 | +## 1.16.0 (April 15, 2024) |
43 | 15 | ### Features:
|
44 | 16 | #### UCP
|
45 | 17 | * Added tag offload rendezvous protocol in new infrastructure
|
|
86 | 58 | * Added support for VLAN over channel bonding interface
|
87 | 59 | * Added LRU cache and Usage Tracker datastructures
|
88 | 60 | * Improved cross-NUMA device detection
|
| 61 | +* Added support for PCIe gen5 bandwidth detection |
89 | 62 | #### Build
|
90 | 63 | * Added LCOV coverage report as a build option
|
91 | 64 | * Added binutils 2.40 library dependencies
|
|
125 | 98 | * Fixed memory corruption by proper memh handling in tag offload rendezvous
|
126 | 99 | * Changed default allocator to not use reserved huge pages
|
127 | 100 | * Fixed rndv put protocol to avoid early completion
|
| 101 | +* Fixed rndv_put transport selection for device to device scenario |
| 102 | +* Disabled rendezvous pipeline protocol selection when using non-contiguous buffer |
| 103 | +* Fixed crash in rendezvous protocol rkey pack after failed memory registration |
128 | 104 | #### RDMA CORE (IB, ROCE, etc.)
|
129 | 105 | * Fixed compilation failure when DevX is explicitly disabled
|
130 | 106 | * Fixed crash when using PCIe relaxed ordering
|
|
133 | 109 | * Fixed assertion failure when configured with UCX_IB_ADDR_TYPE=ib_global
|
134 | 110 | * Fixed overwritten MD attribute capabilities when querying a device
|
135 | 111 | * Fixed ibv_reg_mr error by registering memory in rcache callback
|
| 112 | +* Disabled MR multithreading registration |
| 113 | +* Fixed mlx5 WQE posting error due to compiler memory copy optimizations |
136 | 114 | #### TCP
|
137 | 115 | * Fixed assymetric lanes selection issue due to inconsistent device listing
|
138 | 116 | #### GPU (CUDA, ROCM)
|
139 | 117 | * Fixed compilation flags to support ROCm 6.0
|
140 | 118 | * Fixed values of D2H_THRESH and latencey params
|
141 | 119 | * Fixed Cuda memory support for iov datatype
|
142 | 120 | * Increased max number of agents in ROCm
|
| 121 | +* Fixed cuda_ipc transport being disabled if a CUDA device is not set during initialization |
143 | 122 | #### Shared Memoey
|
144 | 123 | * Fixed posix and cma transport selection by enhancing reachability checks
|
145 | 124 | * Fixed UGNI build failure
|
|
153 | 132 | * Fixed floating point division by zero during protocols initialization
|
154 | 133 | #### UCM
|
155 | 134 | * Fixed occasional crash in bisto hooks by adding a lock before hooking
|
| 135 | +* Fixed compilation error when building on PPC64 |
156 | 136 | #### Java
|
157 | 137 | * Fixed go tests by setting CUDA device before allocating CUDA memory
|
158 | 138 | * Fixed perftest error detection and hanging issue
|
|
164 | 144 | * Fixed ROCm building and testing
|
165 | 145 | * Removed libnvidia-compute version dependency
|
166 | 146 | * Removed libibmad/libumad from default build configuration to avoid runtime dependency
|
| 147 | +#### Packaging |
| 148 | +* Fixed already existing target error when using cmake find_package(ucx) twice |
167 | 149 |
|
168 | 150 | ## 1.15.0 (September 28, 2023)
|
169 | 151 | ### Features:
|
|
0 commit comments