Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Very bad Chess960 performance #203

Closed
Matthies opened this issue Apr 29, 2020 · 4 comments
Closed

Very bad Chess960 performance #203

Matthies opened this issue Apr 29, 2020 · 4 comments

Comments

@Matthies
Copy link
Owner

After implementing Chess960 tests show that Rubi plays extremely bad in this variant. Probably evaluation too much hardcoded for the normal chess setup. Improving Chess960 performance with more flexible evaluation may also help in normal chess.

Gauntlet normal chess:

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Ethereal-12.00                 : 2592  110  77    20    70.0 %   2445   60.0 %
  2 Fire-7.1                       : 2572  149 142    20    67.5 %   2445   25.0 %
  3 Komodo-10                      : 2498  119 116    20    57.5 %   2445   45.0 %
  4 Rubi-20200429                  : 2432   44  44   160    57.2 %   2382   34.4 %
  5 SlowChess-2.0                  : 2428  117 118    20    47.5 %   2445   45.0 %
  6 Vajolet-2.8                    : 2393  128 130    20    42.5 %   2445   35.0 %
  7 Pedone-2.0                     : 2298  136 144    20    30.0 %   2445   30.0 %
  8 Senpai-2                       : 2230  127 142    20    22.5 %   2445   35.0 %
  9 Monolith-2                     : 1934  344  88    20     5.0 %   2445    0.0 %

Gauntlet Chess960:

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Ethereal-12.00                 : 2858   88 222    20    95.0 %   2346   10.0 %
  2 Fire-7.1                       : 2615  154 115    20    82.5 %   2346   35.0 %
  3 Komodo-10                      : 2561  161 148    20    77.5 %   2346   25.0 %
  4 SlowChess-2.0                  : 2381  112 110    20    55.0 %   2346   50.0 %
  5 Senpai-2                       : 2363  118 117    20    52.5 %   2346   45.0 %
  6 Rubi-20200429                  : 2361   44  45   160    40.9 %   2424   33.1 %
  7 Pedone-2.0                     : 2329  128 129    20    47.5 %   2346   35.0 %
  8 Vajolet-2.8                    : 2276  134 137    20    40.0 %   2346   30.0 %
  9 Monolith-2                     : 2131  127 142    20    22.5 %   2346   35.0 %
@Matthies Matthies mentioned this issue Apr 29, 2020
Merged
@Matthies
Copy link
Owner Author

Making the psq symmetric in chess960 by using (psq_left + psq_right) / 2 makes it even worse.

@Matthies
Copy link
Owner Author

There is propably something wrong with the (castle-) move handling or maybe incremental psq evaluation. Look at this log and the score in the first depths:

>RubiChess(4): position fen qrkrbbnn/pppppppp/8/8/8/8/PPPPPPPP/QRKRBBNN w DBdb - 0 1 moves d2d4 g8f6 c2c4 e7e6 e2e3 d7d5 c4d5 f6d5 e3e4 d5f6 f2f3 h8g6 c1c2 a7a5 b1c1 b7b5 c2b1
>RubiChess(4): isready
<RubiChess(4): readyok
>RubiChess(4): go wtime 92527 btime 105458 winc 1000 binc 1000
<RubiChess(4): info depth 1 seldepth 2 multipv 1 time 0 score cp 4085  nodes 46 nps 0 tbhits 0 hashfull 0 pv f8d6  
<RubiChess(4): info depth 2 seldepth 4 multipv 1 time 0 score cp 2595  nodes 245 nps 0 tbhits 0 hashfull 0 pv a8e4  f3e4  f6e4  e1a5  
<RubiChess(4): info depth 3 seldepth 5 multipv 1 time 0 score cp 153  nodes 626 nps 0 tbhits 0 hashfull 0 pv d8d4  b2b3  d4d1  c1d1  
<RubiChess(4): info depth 4 seldepth 6 multipv 1 time 1 score cp 153  nodes 744 nps 0 tbhits 0 hashfull 0 pv d8d4  b2b3  d4d1  c1d1  

@Matthies
Copy link
Owner Author

Probably this one bad evaluation value for queen with no mobility VALUE(-4097, 83) which is the result of some fake tuning and has no relevance for normal chess but it has for FRC.

@Matthies
Copy link
Owner Author

New gauntlet with latest FRC evaluation fix:

10.08.2020 19:08:58 :

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Ethereal-12.00                 : 2637  142 106    20    80.0 %   2396   40.0 %
  2 Komodo-10                      : 2637  154 138    20    80.0 %   2396   30.0 %
  3 Fire-7.1                       : 2611  182 165    20    77.5 %   2396   15.0 %
  4 SlowChess-2.0                  : 2466  137 134    20    60.0 %   2396   30.0 %
  5 Rubi-20200810                  : 2396   46  46   160    48.8 %   2404   28.8 %
  6 Vajolet-2.8                    : 2288  135 140    20    35.0 %   2396   30.0 %
  7 Pedone-2.0                     : 2288  135 140    20    35.0 %   2396   30.0 %
  8 Senpai-2                       : 2249  121 129    20    30.0 %   2396   40.0 %
  9 Monolith-2                     : 2058  188 219    20    12.5 %   2396   15.0 %

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant