Skip to content

Commit

Permalink
Bump version to 1.16.1
Browse files Browse the repository at this point in the history
  • Loading branch information
guillaumekln committed Nov 23, 2020
1 parent 7c54f53 commit 129047e
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 1 deletion.
9 changes: 9 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,15 @@

### Fixes and improvements

## [v1.16.1](https://github.com/OpenNMT/CTranslate2/releases/tag/v1.16.1) (2020-11-23)

### Fixes and improvements

* Fuse dequantization and bias addition on GPU for improved INT8 performance
* Improve performance of masked softmax on GPU
* Fix error when building the CentOS 7 GPU Docker image
* The previous version listed "Pad size of INT8 matrices to a multiple of 16 when the GPU has INT8 Tensor Cores". However, the padding was not applied due to a bug and fixing it degraded the performance, so this behavior is not implemented for now.

## [v1.16.0](https://github.com/OpenNMT/CTranslate2/releases/tag/v1.16.0) (2020-11-18)

### Changes
Expand Down
2 changes: 1 addition & 1 deletion python/setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ def _maybe_add_library_root(lib_name):

setup(
name="ctranslate2",
version="1.16.0",
version="1.16.1",
license="MIT",
description="Fast inference engine for OpenNMT models",
long_description=_get_long_description(),
Expand Down

0 comments on commit 129047e

Please sign in to comment.