Bump version to 1.16.1

OpenNMT · Nov 23, 2020 · 129047e · 129047e
1 parent 7c54f53
commit 129047e
Show file tree

Hide file tree

Showing 2 changed files with 10 additions and 1 deletion.
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -4,6 +4,15 @@
 
 ### Fixes and improvements
 
+## [v1.16.1](https://github.com/OpenNMT/CTranslate2/releases/tag/v1.16.1) (2020-11-23)
+
+### Fixes and improvements
+
+* Fuse dequantization and bias addition on GPU for improved INT8 performance
+* Improve performance of masked softmax on GPU
+* Fix error when building the CentOS 7 GPU Docker image
+* The previous version listed "Pad size of INT8 matrices to a multiple of 16 when the GPU has INT8 Tensor Cores". However, the padding was not applied due to a bug and fixing it degraded the performance, so this behavior is not implemented for now.
+
 ## [v1.16.0](https://github.com/OpenNMT/CTranslate2/releases/tag/v1.16.0) (2020-11-18)
 
 ### Changes

diff --git a/python/setup.py b/python/setup.py
@@ -35,7 +35,7 @@ def _maybe_add_library_root(lib_name):
 
 setup(
     name="ctranslate2",
-    version="1.16.0",
+    version="1.16.1",
     license="MIT",
     description="Fast inference engine for OpenNMT models",
     long_description=_get_long_description(),