You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add per-sample gradient norm computation as a functionality (#724)
Summary:
Pull Request resolved: #724
Per-sample gradient norm is computed for Ghost Clipping, but it can be useful generally. Exposed it as a functionality.
```
...
loss.backward()
per_sample_norms = model.per_sample_gradient_norms
```
Reviewed By: iden-kalemaj
Differential Revision: D68634969
fbshipit-source-id: 7d5cb8a05de11d7492d3c1ae7f7384243cc03c73
0 commit comments