Skip to content

Enable observe training entropy without computing entropy loss#1464

Open
zyzshishui wants to merge 1 commit into
radixark:mainfrom
zyzshishui:observe_entropy
Open

Enable observe training entropy without computing entropy loss#1464
zyzshishui wants to merge 1 commit into
radixark:mainfrom
zyzshishui:observe_entropy

Conversation

@zyzshishui

Copy link
Copy Markdown
Contributor

Motivation

Algorithm bros want to observe entropy from the training forward pass even when the entropy is not included in calculating loss. The previous --entropy-coef 0.0 path skipped training entropy entirely, leaving only rollout-time entropy available for dashboards.

Usage

--entropy-coef 0.0 --observe-training-entropy

This logs entropy_loss from the training logits as a detached metric. It does not add the entropy to the loss and does not attach the observed entropy to backward.

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant