Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix forward batch #1572

Merged
merged 1 commit into from
Dec 4, 2023
Merged

Conversation

minhthuc2502
Copy link
Collaborator

A bug generated after the dev on Mistral and the optimization in the rotary embedding. In case of forward batch, step initialized is -1 and there is any cache used. It cause the bad calculation for the offset in the rotary embedding and unexpected usage of cache.

@minhthuc2502
Copy link
Collaborator Author

minhthuc2502 commented Dec 4, 2023

reported in this issue #1570

@minhthuc2502 minhthuc2502 merged commit 01cf79d into OpenNMT:master Dec 4, 2023
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant