Skip to content

Commit

Permalink
bump version 4.4.0 (#1774)
Browse files Browse the repository at this point in the history
* bump version 4.4.0

* fix changelog

* fix wrong commit
  • Loading branch information
minhthuc2502 authored Sep 9, 2024
1 parent f89fa2b commit 8f4d134
Show file tree
Hide file tree
Showing 2 changed files with 18 additions and 1 deletion.
17 changes: 17 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,23 @@

### Fixes and improvements

## [v4.4.0](https://github.com/OpenNMT/CTranslate2/releases/tag/v4.4.0) (2024-09-09)
**Removed**: Flash Attention support in the Python package due to significant package size increase with minimal performance gain.
Note: Flash Attention remains supported in the C++ package with the `WITH_FLASH_ATTN` option.
Flash Attention may be re-added in the future if substantial improvements are made.

### New features
* Support Llama3 (#1751)
* Support Gemma2 (1772)
* Add log probs for all tokens in vocab (#1755)
* Grouped conv1d (#1749 + #1758)

### Fixes and improvements
* Fix pipeline (#1723 + #1747)
* Some improvements in flash attention (#1732)
* Fix crash when using return_alternative on CUDA (#1733)
* Quantization AWQ GEMM + GEMV (#1727)

## [v4.3.1](https://github.com/OpenNMT/CTranslate2/releases/tag/v4.3.1) (2024-06-10)
Note: Because of exceeding project's size on Pypi (> 20 GB), the release v4.3.0 was pushed unsuccessfully.

Expand Down
2 changes: 1 addition & 1 deletion python/ctranslate2/version.py
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
"""Version information."""

__version__ = "4.3.1"
__version__ = "4.4.0"

0 comments on commit 8f4d134

Please sign in to comment.