Skip to content

Latest commit

 

History

History
66 lines (43 loc) · 2.34 KB

CHANGELOG.md

File metadata and controls

66 lines (43 loc) · 2.34 KB

[Unreleased]

New features

Fixes and improvements

v1.2.2 (2019-11-25)

Fixes and improvements

  • Fix PositionEncoder internal state that was shared with other instances on the same thread
  • Replace Boost.Python by pybind11
  • Include a Python source distribution in the Docker images

v1.2.1 (2019-11-06)

Fixes and improvements

  • Avoid copying decoder states when possible to improve decoding performance (10% to 20% faster)
  • Fix execution profiling on GPU (device was not synchronized before measuring the time)
  • Include Mul operation in profiling report
  • Add a Python 3 wheel in Ubuntu Docker images

v1.2.0 (2019-10-28)

New features

  • Accept Transformer models with custom number of layers and heads
  • --log-profiling client option to profile ops execution

Fixes and improvements

  • Fix conversion error for models having 2 different weights with the same values
  • Fix invalid MKL function override after a refactoring
  • Add more information and context to several error messages

v1.1.0 (2019-10-18)

New features

  • New Docker images: latest-ubuntu16-gpu, latest-ubuntu18, latest-ubuntu18-gpu
  • Support OpenNMT-tf Transformer models with shared embeddings
  • Update to TensorRT 6
  • Make OpenMP runtime configurable

Fixes and improvements

  • Reduce the size of models with shared weights on disk and in memory
  • Shared words vocabulary is no longer duplicated on disk and in memory
  • Improve performance of translation with a vocabulary map on GPU
  • Statically link against Intel MKL
  • Remove some implementation details from public headers

v1.0.1 (2019-10-08)

Fixes and improvements

  • Fix loading of newer OpenNMT-py models
  • Promote FP16 to FP32 in model converter scripts
  • Improve INT8 performance on CPU and GPU
  • Improve performance on GPU by fusing the layer normalization operation x * gamma + beta
  • Enable INT8 and INT16 computation on all platforms with Intel MKL 2019.5 and above

v1.0.0 (2019-09-23)

First stable release.