Remove razoring #3278

unaiic · 2020-12-26T12:48:49Z

STC https://tests.stockfishchess.org/tests/view/5fe653403932f79192d3981a
LLR: 2.95 (-2.94,2.94) {-1.25,0.25}
Total: 63448 W: 5965 L: 5934 D: 51549
Ptnml(0-2): 230, 4738, 21769, 4745, 242

LTC https://tests.stockfishchess.org/tests/view/5fe6f0f03932f79192d39856
LLR: 2.93 (-2.94,2.94) {-0.75,0.25}
Total: 65368 W: 2485 L: 2459 D: 60424
Ptnml(0-2): 33, 2186, 28230, 2192, 43

bench: 4184328

Simplify razoring.

FauziAkram · 2020-12-26T12:59:09Z

Can you please also update the elo estimation for each step, since you calculated it (Maybe in a different PR)

unaiic · 2020-12-26T13:07:45Z

@FauziAkram Yeah, I'm still doing more elo estimations, so I think I'll wait until I finish with all of them. But sure, good idea :)

STC https://tests.stockfishchess.org/tests/view/5fe653403932f79192d3981a LLR: 2.95 (-2.94,2.94) {-1.25,0.25} Total: 63448 W: 5965 L: 5934 D: 51549 Ptnml(0-2): 230, 4738, 21769, 4745, 242 LTC https://tests.stockfishchess.org/tests/view/5fe6f0f03932f79192d39856 LLR: 2.93 (-2.94,2.94) {-0.75,0.25} Total: 65368 W: 2485 L: 2459 D: 60424 Ptnml(0-2): 33, 2186, 28230, 2192, 43 bench: 4184328 Simplify razoring.

anshulongithub · 2020-12-26T16:08:19Z

@joergoster cud you elaborate on why you think that removing Razoring is not a good idea?

Vizvezdenec · 2020-12-26T16:55:15Z

I think that updating elo estimations should be done separately.
Also IIRC they were done at LTC - and some of this is pretty TC sensitive.

unaiic · 2020-12-26T16:57:28Z

@Vizvezdenec You suggest testing them at LTC then? It'd be done separately, of course :)

Vizvezdenec · 2020-12-26T17:14:17Z

well if we will go pretty idle otherwise it's not a priority.

joergoster · 2020-12-27T10:40:07Z

@anshulongithub Why does one want to remove it in the first place?

It is a well known pruning technique and gains elo. The fact that it can be removed is only because of the current strength of Stockfish and the fact, that at this strength elo gets compressed and small gains or losses are hardly measurable. That's also the reason why it is much easier to get simplifications passed than elo gaining patches.

However, I do no longer care that much about the development of SF. Too much to my dislike in the past ...
I occasionally jump in if there is something that catches my interest and that's all.

ddobbelaere · 2020-12-27T13:40:17Z

It is a well known pruning technique and gains elo.

Both linked tests seem to indicate only the slightest of elo regression: estimated elo = -0.07/-0.02 (STC/LTC). 95% confidence intervals are [-1.37,1.14] and [-0.84,0.74], respectively.

Note that funnily enough W > L for both tests, probably this is a weird artifact of the pentanomial model (related to asymmetry of outcomes of the game pairs) that I don't understand?
For trinomial, I think W > L should always imply that estimated elo > 0.

vdbergh · 2020-12-27T14:21:07Z

It is a well known pruning technique and gains elo.

Both linked tests seem to indicate only the slightest of elo regression: estimated elo = -0.07/-0.02 (STC/LTC). 95% confidence intervals are [-1.37,1.14] and [-0.84,0.74], respectively.

Note that funnily enough W > L for both tests, probably this is a weird artifact of the pentanomial model (related to asymmetry of outcomes of the game pairs) that I don't understand?
For trinomial, I think W > L should always imply that estimated elo > 0.

No this is not true. The Elo estimate takes into account the length of the test. It is not so easy to
reason about it intuitively. The formal definition is

P(elo estimate<=true elo)=50%.

This is called a median unbiased estimator (if you repeat the same test many times the elo estimate
will be too low in half the cases).

ddobbelaere · 2020-12-27T15:20:49Z

@vdbergh Thanks! Oh wow this is subtle, I didn't even think about length.

Is the following intuitive reasoning correct? Assume a trinomial model. Take test 1 with final outcome W=L and some D. Take test 2 with final outcome W'=L'=2W and D'=2D. Then, as the estimated elo is based on the brownian motion paper, intuitively "estimated elo 2 < estimated elo 1" should hold for [-1.25; 0.25] bounds as test 2 needs double amount of games so chances are higher that true elo lies more closely to -0.5.

Vizvezdenec · 2020-12-27T16:01:11Z

Razoring is known to get almost nothing for ages, in fact previous attempts were close to like 2.5 LLR twice (when we tried to remove it) and now it seems to be even less useful.
Maybe we can try to reintroduce a stronger form of it if it will be removed with losing like 0,2 elo...
For example on of my tests on depth 3 razoring was close to passing somewhere between sf11 and 12. Maybe it can be good to pass from scratch now?

AlexandreMasta · 2020-12-28T02:29:45Z

Razoring is a so extended tested technique. It is proved to gain elo in all A/B engines. I can´t imagine why this is being discussed. Ok...you want one more loss-elo patch from Unaiic ok...go ahead...remove one or 2 lines of code for a regressive patch as is being done recently. SF is so ahead no one will notice.

ianfab · 2020-12-28T10:26:48Z

Razoring is known to get almost nothing for ages, in fact previous attempts were close to like 2.5 LLR twice (when we tried to remove it) and now it seems to be even less useful.
Maybe we can try to reintroduce a stronger form of it if it will be removed with losing like 0,2 elo...
For example on of my tests on depth 3 razoring was close to passing somewhere between sf11 and 12. Maybe it can be good to pass from scratch now?

Exactly. Something similar happened with IID already. If a search/pruning technique no longer gains Elo, simplifying it away opens up the room for something better to replace it. If there are doubts that the passed tests provide sufficient evidence that the change is not a (considerable) regression or that the code simplification is not worth the potential minor regression, then there should be a discussion about the testing conditions in general (not in this thread), not about the patch.

vondele added the to be merged Will be merged shortly label Dec 31, 2020

vondele closed this in 8ec97d1 Dec 31, 2020

unaiic deleted the pm branch April 9, 2021 09:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove razoring #3278

Remove razoring #3278

unaiic commented Dec 26, 2020

FauziAkram commented Dec 26, 2020

unaiic commented Dec 26, 2020

anshulongithub commented Dec 26, 2020

Vizvezdenec commented Dec 26, 2020

unaiic commented Dec 26, 2020

Vizvezdenec commented Dec 26, 2020

joergoster commented Dec 27, 2020

ddobbelaere commented Dec 27, 2020 •

edited

vdbergh commented Dec 27, 2020 •

edited

ddobbelaere commented Dec 27, 2020 •

edited

Vizvezdenec commented Dec 27, 2020 •

edited

AlexandreMasta commented Dec 28, 2020

ianfab commented Dec 28, 2020

Remove razoring #3278

Remove razoring #3278

Conversation

unaiic commented Dec 26, 2020

FauziAkram commented Dec 26, 2020

unaiic commented Dec 26, 2020

anshulongithub commented Dec 26, 2020

Vizvezdenec commented Dec 26, 2020

unaiic commented Dec 26, 2020

Vizvezdenec commented Dec 26, 2020

joergoster commented Dec 27, 2020

ddobbelaere commented Dec 27, 2020 • edited

vdbergh commented Dec 27, 2020 • edited

ddobbelaere commented Dec 27, 2020 • edited

Vizvezdenec commented Dec 27, 2020 • edited

AlexandreMasta commented Dec 28, 2020

ianfab commented Dec 28, 2020

ddobbelaere commented Dec 27, 2020 •

edited

vdbergh commented Dec 27, 2020 •

edited

ddobbelaere commented Dec 27, 2020 •

edited

Vizvezdenec commented Dec 27, 2020 •

edited