What's Changed
Bug Fixes
- Fix error code for empty requests by @vrdn-23 in #727
- Fix the infinite loop when
max_input_lengthis bigger thanmax-batch-tokensby @kozistr in #725 - Fix reading
modules.jsonforDensemodules in local models by @alvarobartt in #738
Tests, Documentation & Release
- Add
test_gemma3.rsfor EmbeddingGemma by @alvarobartt in #718 - Fix OpenAI client usage example for embeddings by @ZahraDehghani99 in #720
- Handle
HF_TOKENinApiBuilderforcandle/testsby @alvarobartt in #724 - Fix
cargo installcommands forcandlewith CUDA by @alvarobartt in #719 - Update
versionto 1.8.3 by @alvarobartt in #745
New Contributors
- @ZahraDehghani99 made their first contribution in #720
- @vrdn-23 made their first contribution in #727
Full Changelog: v1.8.2...v1.8.3