Use only nnue evaluation #3173

voyag · 2020-10-07T09:14:12Z

Is it possible to add an uci option for using only the nnue evaluation instead of the hybrid one?

Sopel97 · 2020-10-07T12:15:58Z

nodchip/master has a 3 state Use NNUE already with options "true", "false", "pure"; so doing this would not be hard

rconstant42 · 2020-10-14T00:01:05Z

It seems that pure NNUE evaluations finds quickly solutions on bad positions. In some positions, standard SF (with hybrid eval) cannot find the best move to get out of losing positions because the imbalance is so important that it uses only standard evaluation and not NNUE eval any more. And if you "force" CFish for exemple, with "pure" NNUE UCI option to use only NNUE eval, it finds all the best moves in a second.

May be SF should not switch to standard evaluation unless the imbalance between player and opponent is strickly positive and not absolute positive (eval > threshold and not abs(eval) > threshold). This should make SF use more often Pure NNUE eval when in negative imbalance so that it would find quickly a way out of losing positions. A try should be done in SF testing framework.

Here are some exemples :
rn1qrnk1/p4pp1/1p1pp3/6P1/2Pp1PN1/2PQ4/P5P1/2KR3R w - - 0 1 bm Nh6+
4k3/4Pp2/1P1p1P1P/pPpPpK2/pr2pbP1/7r/3RP3/NN5b w - - 0 1 bm Rb2
r1n2N1k/2n2K1p/3pp3/5Pp1/b5R1/8/1PPP4/8 w - - 0 1 bm Ng6
rn2kbnr/pp2ppp1/2p5/3pP1Bp/3P3P/3Q4/PqP2PP1/RN2K1NR w KQkq - 0 1 bm e6

Sopel97 · 2020-10-14T14:26:22Z

@rconstant42 https://tests.stockfishchess.org/tests/view/5f86dac4dcdad978fe8c51c6

rconstant42 · 2020-10-14T16:12:55Z

Thanks for trying

vondele · 2020-10-14T17:38:07Z

This has come up a number of times, and there I still see no new arguments to do 'pure' nnue, it is weaker than what we do so far. There are various parts of the engine that could be disabled to make it weaker in general, but maybe stronger for specific positions.

mstembera · 2020-10-16T04:03:22Z

Given the same reasoning it makes no sense to have pure classic either. Yet we do. How do we reconcile this inconsistency?

vondele · 2020-10-16T05:47:59Z

because there is value in retaining what is probably the best hand-crafted evaluation ever written. We do already effectively use it in our 'hybrid' use? I think it is pretty different. The best comparison is, why don't we have an option to disable TT, NMP, LMR, ...

mstembera · 2020-10-16T07:52:35Z

Disabling TT, NMP, etc. doesn't seems like a very close comparison because those simply handicap the engine w/o providing any real benefit. (This is probably why users are not asking for those but are for pure NNUE.) Having pure eval methods on the other hand can provide unique perspectives on positions. In any case I am not advocating for removing classical. I am simply pointing out that both pure NNUE and pure classical are weaker than hybrid. It therefore seems inconsistent that we support a pure version of one and not the other.

Sopel97 · 2020-10-16T09:55:23Z

because there is value in retaining what is probably the best hand-crafted evaluation ever written.

Right now it's probably at stockfish 11 strength. I've seen many people using it as if it was the best but it's not. I think it's even harmful.

vondele · 2020-10-16T10:05:23Z

because there is value in retaining what is probably the best hand-crafted evaluation ever written.

Right now it's probably at stockfish 11 strength. I've seen many people using it as if it was the best but it's not. I think it's even harmful.

the evaluation has been improved, but overall, because integration with search, the whole eval+search, might eventually be weaker.

Sopel97 · 2020-10-16T21:15:50Z

Seems that classical is getting back to shape with latest patches and it's possible that it may beat the last classical versions in the future.
https://tests.stockfishchess.org/tests/view/5f89c85beae8a6e60644d6c8

Therefore I think it's best to keep Use NNUE value false as an option.

Though going back to the original discussion. I think one possible reason for pure NNUE that was not mentioned before is that some (even theoretical) hardware could show the pure version to have better playing performance than the hybrid version. Fishtest uses a very narrow set of mostly AVX2 workers. A test with fixed nodes / nodestime could shed more light on this.

I also agree with mstembra's statement:

Given the same reasoning it makes no sense to have pure classic either. Yet we do. How do we reconcile this inconsistency?

but not in the way of "let's remove classic" but in the way of showing that pure has it's place. Since hybrid is now the default going in any of the two possible ways should be equal.

syzygy1 · 2020-10-19T01:56:11Z

Also, we don't really know what is best under TCEC-like conditions. At least at first there were many reports of bad scaling of the hybrid solution. Having a pure NNUE option could lead to more data on this question.

vondele closed this as completed Oct 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use only nnue evaluation #3173

Use only nnue evaluation #3173

voyag commented Oct 7, 2020

Sopel97 commented Oct 7, 2020

rconstant42 commented Oct 14, 2020 •

edited

Sopel97 commented Oct 14, 2020

rconstant42 commented Oct 14, 2020

vondele commented Oct 14, 2020

mstembera commented Oct 16, 2020

vondele commented Oct 16, 2020

mstembera commented Oct 16, 2020 •

edited

Sopel97 commented Oct 16, 2020

vondele commented Oct 16, 2020 •

edited

Sopel97 commented Oct 16, 2020 •

edited

syzygy1 commented Oct 19, 2020

Use only nnue evaluation #3173

Use only nnue evaluation #3173

Comments

voyag commented Oct 7, 2020

Sopel97 commented Oct 7, 2020

rconstant42 commented Oct 14, 2020 • edited

Sopel97 commented Oct 14, 2020

rconstant42 commented Oct 14, 2020

vondele commented Oct 14, 2020

mstembera commented Oct 16, 2020

vondele commented Oct 16, 2020

mstembera commented Oct 16, 2020 • edited

Sopel97 commented Oct 16, 2020

vondele commented Oct 16, 2020 • edited

Sopel97 commented Oct 16, 2020 • edited

syzygy1 commented Oct 19, 2020

rconstant42 commented Oct 14, 2020 •

edited

mstembera commented Oct 16, 2020 •

edited

vondele commented Oct 16, 2020 •

edited

Sopel97 commented Oct 16, 2020 •

edited