Model compare: figure out how to add noise while comparing #87

barakugav · 2022-10-24T10:45:52Z

No description provided.

barakugav · 2022-10-24T14:56:26Z

Maybe sample random openings from an opening book, which are still considered equal

poja · 2022-10-24T14:59:38Z

poja · 2022-10-24T14:59:54Z

What do you think about the noisy-beginning idea?
I prefer not to use external knowledge about the game (e.g. opening book)

barakugav · 2022-10-24T16:34:42Z

The noisy beginning will not result in an equalized position for the rest of the game, im not sure.
Why not opening book? only to choose initial position

poja · 2022-10-25T13:59:53Z

I think it could be okay that the position is not equalized, because (a) the moves are still chosen by the players (just with some added 'luck'), (b) that's why we do many comparisons - there is some 'luck' involved I think it's much more elegant if the whole training flow has no human knowledge, or at least, no more human knowledge than in AlphaZero. Or at least, we should have a *default* workflow as such. (and possibly other ones too) BTW you probably saw it, but I like this formulation in their paper ![image](https://user-images.githubusercontent.com/4618146/197798679-17019750-46c7-4ec3-9f07-3c6e616bf720.png)

barakugav · 2022-10-25T15:44:18Z

Alright, i agree its more elegant without opening book, but im still think we should look for a better solution.
We will not run hundreds of comparison games...
First of all we can ensure both playres game the same 'luck' by running twice from the same noisy position with the players switched.
But i still think it will cause us to miss evaluate the models

poja · 2022-11-02T21:19:18Z

Do you think we can rely here on the noise from the floating-point errors in the network activation? And multithreading

barakugav · 2022-11-03T07:53:42Z

fp no, multithreading yes, but we dont have multithreading in a single search, we have multithreading of multiple searchs, so currently it doesn't have any affect

barakugav · 2022-11-03T08:36:47Z

In the paper it say "t -> 0", so maybe they just use very small temperature, i think that reasonable

barakugav added the priority-medium label Oct 24, 2022

poja mentioned this issue Oct 31, 2022

Tests: determinism test of the algorithm #93

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model compare: figure out how to add noise while comparing #87

Model compare: figure out how to add noise while comparing #87

barakugav commented Oct 24, 2022

barakugav commented Oct 24, 2022

poja commented Oct 24, 2022

poja commented Oct 24, 2022 •

edited

Loading

barakugav commented Oct 24, 2022

poja commented Oct 25, 2022 via email •

edited

Loading

barakugav commented Oct 25, 2022

poja commented Nov 2, 2022

barakugav commented Nov 3, 2022

barakugav commented Nov 3, 2022

Model compare: figure out how to add noise while comparing #87

Model compare: figure out how to add noise while comparing #87

Comments

barakugav commented Oct 24, 2022

barakugav commented Oct 24, 2022

poja commented Oct 24, 2022

poja commented Oct 24, 2022 • edited Loading

barakugav commented Oct 24, 2022

poja commented Oct 25, 2022 via email • edited Loading

barakugav commented Oct 25, 2022

poja commented Nov 2, 2022

barakugav commented Nov 3, 2022

barakugav commented Nov 3, 2022

poja commented Oct 24, 2022 •

edited

Loading

poja commented Oct 25, 2022 via email •

edited

Loading