Convert acceptance rate of nan to 0 in dual averaging #29

WardBrian · 2025-09-25T18:31:31Z

While looking at #26 @bob-carpenter noticed it was possible for a model to have the parameters end up as nan and stay there forever. I tracked it down to this sequence of events:

One particularly bad trajectory during warmup ends up with the returned log density being nan
This nan value is used to compute an acceptance probability, which also ends up nan
The step size dual averaging parameters end up getting 'poisoned' by this nan
The next iteration, the step size itself is nan, which means after one gradient step, the parameters will be too.

I believe Stan traps this behavior here:
https://github.com/stan-dev/stan/blob/develop/src/stan/mcmc/hmc/nuts/base_nuts.hpp#L259-L260

Rather than do exactly what they do, I've moved this to inside the dual averaging. This moves it out of the code that we're relying on dead code optimization to handle for us when using the NoOpHandler in non-adaptive walnuts.

I believe 0 is the correct value, working through what would happen if I did exactly what Stan did and set the bad energy value to inf, the acceptance probability would work out to exp(-inf) = 0

bob-carpenter

Just a slight generalization to the test.

bob-carpenter · 2025-09-25T18:51:22Z

include/walnuts/dual_average.hpp

   * @pre alpha > 0
   */
  inline void observe(S alpha) noexcept {
+    if (std::isnan(alpha)) {


I think this should add || std::isinf(alpha) test to catch the -infinity case, too. I don't think we should ever see +infinity, but it could be std::isinf(alpha) && alpha < 0 or I think we can actually just compare to -infinity.

More generally, we could condition to [0, 1] by mapping NaN and anything < 0 to 0 and anything > 1 to 1. I opened another issue to test his upper bounding for adaptation---as is, we pass in Metropolis ratios here, which can be greater than 1.

bob-carpenter

looks good. thanks!

Convert accpetance rate of nan to 0

6b396ec

bob-carpenter requested changes Sep 25, 2025

View reviewed changes

Use std::isfinite instead of isnan

22ff57b

WardBrian requested a review from bob-carpenter September 25, 2025 18:56

bob-carpenter approved these changes Sep 25, 2025

View reviewed changes

bob-carpenter merged commit 2653184 into main Sep 25, 2025
4 checks passed

WardBrian deleted the catch-bad-acceptance-rate branch September 25, 2025 18:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert acceptance rate of nan to 0 in dual averaging #29

Convert acceptance rate of nan to 0 in dual averaging #29

Uh oh!

WardBrian commented Sep 25, 2025

Uh oh!

bob-carpenter left a comment

Uh oh!

bob-carpenter Sep 25, 2025 •

edited

Loading

Uh oh!

WardBrian Sep 25, 2025

Uh oh!

bob-carpenter left a comment

Uh oh!

Uh oh!

Uh oh!

Convert acceptance rate of nan to 0 in dual averaging #29

Convert acceptance rate of nan to 0 in dual averaging #29

Uh oh!

Conversation

WardBrian commented Sep 25, 2025

Uh oh!

bob-carpenter left a comment

Choose a reason for hiding this comment

Uh oh!

bob-carpenter Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WardBrian Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

bob-carpenter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bob-carpenter Sep 25, 2025 •

edited

Loading