Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjust tf/fedopt_ctl to include updates for the model's non-trainable #3058

Merged
merged 10 commits into from
Nov 25, 2024

Conversation

falibabaei
Copy link
Contributor

Hi,

tf/fedopt_ctl does not handle models with non-trainable variables, causing training to fail with an error about mismatched gradients and variables.

Copy link
Collaborator

@YuanTingHsieh YuanTingHsieh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

one question, otherwise LGTM, please verify this with one of the example you already have, thanks!

@YuanTingHsieh
Copy link
Collaborator

@falibabaei can you run the code refactor and make sure the runtest.sh is passing? thanks

@falibabaei
Copy link
Contributor Author

done :)

holgerroth
holgerroth previously approved these changes Nov 22, 2024
Copy link
Collaborator

@holgerroth holgerroth left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

@holgerroth holgerroth enabled auto-merge (squash) November 22, 2024 18:53
@holgerroth
Copy link
Collaborator

/build

@holgerroth holgerroth disabled auto-merge November 22, 2024 19:16
@YuanTingHsieh
Copy link
Collaborator

/build

@YuanTingHsieh YuanTingHsieh merged commit ee0c093 into NVIDIA:main Nov 25, 2024
20 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants