-
Notifications
You must be signed in to change notification settings - Fork 538
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade to v0.10.0 #1427
Open
hlahkar
wants to merge
399
commits into
mosaicml:habana_alpha
Choose a base branch
from
hlahkar:version_0.10.0_hpu
base: habana_alpha
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Upgrade to v0.10.0 #1427
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…al when the checkpoint is ready (mosaicml#813) * working without sharded checkpointing.. * add more debugs * try this * more debugging * yikes dumb bug * add notes * fixes * remove prints * small updates * fix typo * refactor * fix docstring formatting * fighting with docstrings * try this * add unit tests * point to composer update * values -> items * serialize time * fix merge * nits * warning, small comment update * add error --------- Co-authored-by: Daniel King <[email protected]>
Co-authored-by: Daniel King <[email protected]>
* if condition in tie weights added * unit test for tie weights
* add oom observer callback * fix format
This reverts commit e3f214e.
Co-authored-by: Irene Dea <[email protected]>
…g; Make ComposerHFT5 experimental (mosaicml#1007) * Deprecate features and mark experimental * fix typo --------- Co-authored-by: Daniel King <[email protected]>
* add magic filename for sharded state dicts * Update scripts/train/train.py Co-authored-by: Daniel King <[email protected]> * oops forgot to push this * no shard if no fsdp * default to full on foundry --------- Co-authored-by: Daniel King <[email protected]>
* fix bug on metrics * lint * lint * add unit test * lint
Red button because CI running jobs it doesn't need. Tests passed on main.
Co-authored-by: Daniel King <[email protected]>
* add retry * pyright * slight refactor --------- Co-authored-by: v-chen_data <[email protected]>
* bumping mlflow version to include buffering * capping at mlflow 2.15
…#1301) * ignore logger if excephook is active * remove logger in data scripts and callback * undo format of imports * moved env var check into helper * formatted * removed import format * added docstring * ran pre-commit --------- Co-authored-by: Daniel King <[email protected]>
…cml#1315) * Update config_utils.py * lint
…iding window, reuse prev layer kv cache etc. (mosaicml#1299) * [WIP] Allows interweaving of arbitrary kinds of 'attention' layers, like RNN, sliding window etc. * lint * applying overrides to blocks rather than just attentions * add docstring * minor * changing yaml specification style * .. * fixes * fix * fix * fix * refactoring * add warning * compute only query vector when reusing kv * refactor * fixing * adding test for reusing previous layer kv cache * adding error messages * .. * adding test * add logging * adding logging * minor * bug fix, adding test * minor * addressing some comments * addressing some comments * setting absolute absolute value for reuse_kv_layer_idx * lint * adding tests for override_block_args * adding error if reusing kv cache from a mismatch layer * fixing test * fixing code, test * fix * .. * refactoring * fix * .. * .. * .. * refactoring * .. * .. * .. * adding test for _get_modules_order_expanded * fixing test * fixing test * lint * lint * adding test * addressing comment * .. * fixing test * changing yaml format * fix configuation * fixing test * allowing repeat at top level * allowing overriding error * addressing comments * lint * addressing comments * fix * .. * .. * .. * .. * .. * addressing comment * fixing test
--------- Co-authored-by: Mihir Patel <[email protected]> Co-authored-by: Daniel King <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR upgrades the Habana support to llm-foundry v0.10.0