Skip to content

Commit 12a31d2

Browse files
committed
Bump version to 3.18.0
1 parent b7b59f8 commit 12a31d2

2 files changed

Lines changed: 32 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,37 @@
44

55
### Fixes and improvements
66

7+
## [v3.18.0](https://github.com/OpenNMT/CTranslate2/releases/tag/v3.18.0) (2023-08-03)
8+
9+
### Changes
10+
11+
Converted models now uses the same floating point precision as the original models. For example, a model saved in float16 will be converted to a float16 model. Before this change, the weights were casted to float32 by default.
12+
13+
Similarly, selecting int8 keeps non quantized weights in their original precision unless a more specific quantization type is selected:
14+
15+
* int8_float32
16+
* int8_float16
17+
* int8_bfloat16
18+
19+
### New features
20+
21+
* Add property `compute_type` to model instances
22+
* Extend the Python class `StorageView` with additional methods and properties:
23+
* `to(dtype)`
24+
* `device_index`
25+
* `device`
26+
* `dtype`
27+
* `shape`
28+
29+
### Fixes and improvements
30+
31+
* Update the function `get_supported_compute_types` to correctly return bfloat16 when supported
32+
* Update the HF Llama converter to accept extra tokens in the vocabulary
33+
* Fix a shape error when enabling `return_alternatives` with a model using relative positions
34+
* Fix a conversion error when using `torch<1.13`
35+
* Fix a type error when running Whisper models with the bfloat16 type
36+
* Update pybind11 to 2.11.1
37+
738
## [v3.17.1](https://github.com/OpenNMT/CTranslate2/releases/tag/v3.17.1) (2023-07-20)
839

940
### Fixes and improvements

python/ctranslate2/version.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
11
"""Version information."""
22

3-
__version__ = "3.17.1"
3+
__version__ = "3.18.0"

0 commit comments

Comments
 (0)