Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

validation in 0.16 version has significant pause between each match compared to 0.15 #1987

Open
simonnier opened this issue Nov 3, 2018 · 16 comments

Comments

@simonnier
Copy link

simonnier commented Nov 3, 2018

Dear developers,

I tried the new validation program in latest 0.16 release with leelaz 0.16 on ubuntu , and found that it has significant pause between each match compared to previous validation in 0.15 with leelaz 0.15. This is very obvious if we validate two small weight for example No.91 vs No.116. It seems due to that leelaz 0.16 starts much slower than leelaz 0.15, am I right?

best regards

@alreadydone
Copy link
Contributor

leelaz should start faster if you specify precision in the commands.

@simonnier
Copy link
Author

@alreadydone Thank you for reply. But why I should specify a precision? I though the leelaz will automatically choose the best precision, am I right?

What is more, I just found the -k option is not working in new validation. I use for example

./validation -k sgf -n 91.gz -o "${opt1}" -n 116.gz -o "${opt2}" -g 1 -- ./leelaz -- ./leelaz

But found no sgf files generated in sgf folder. Is anything wrong with my command line?

@barrtgt
Copy link

barrtgt commented Nov 4, 2018

I think there is a bug with validation in combination with the binary commands and -k command. I found that you can generate sgf files if you put the -k command to the right of the binary commands, but it doesn't create a folder.

@simonnier
Copy link
Author

@alreadydone @barrtgt so, should we tag this as a bug?

@alreadydone
Copy link
Contributor

Precision detection takes time, and you can do it once and then use the detected precision afterwards by specifying it in the commands. It should be possible to not restart leelaz after each game, but the cache and tree would need to be cleared. If specifying precision indeed reduce the pause, I think it's not worth doing. sgf saving is a separate issue.

@simonnier
Copy link
Author

simonnier commented Nov 5, 2018

@alreadydone Thank you so much. I finally understand what you mean. And yes, setting precision explicitly remove the pause. and also putting -k option after two '-- binary' options actually works, the sgf files are generated.

However, I found another problem, the genreated sgf file has wrong infomation for White. The White always shows "Human". Something like below

(;GM[1]FF[4]RU[Chinese]DT[2018-11-05]SZ[19]KM[7.5]PB[Leela Zero 0.16 91_6x128]PW[Human]RE[W+83.5]
C[Leela Zero options: -p1 --precision half -g --noponder -t1 -d -r0 -w 91_6x128.gz --seed 6782667934433283899]

@alreadydone
Copy link
Contributor

I think that's because the black leelaz, not autogtp/validation, is responsible for writing the sgf, and has no idea who its opponents is. If you look at the matches on zero.sjeng.org, the sgfs (generated by autogtp instead of validation) don't show white's options, but do show white's network hash, so it may not be so hard to fix this.

@Splee99
Copy link

Splee99 commented Nov 11, 2018

To make an engine stronger, calculation speed is only one aspect. In LZ the half precision calculation makes the engine much weaker although more playouts is done per move, as appeared from several games with Phoenixgo. Maybe it's my GPU which doesn't support half precision, but I think with so many calculations for the 40 block weight, the half precision calculation may miss many best moves when the game situation is subtle. That's why I'm wondering whether it is valid to only compare the calculation speed to determine if half precision should be used.

@gcp
Copy link
Member

gcp commented Nov 19, 2018

In LZ the half precision calculation makes the engine much weaker

Citation needed. IIRC when we tested this the influence was negligible.

@gcp
Copy link
Member

gcp commented Nov 19, 2018

However, I found another problem, the genreated sgf file has wrong infomation for White. The White always shows "Human".

AutoGTP fixes up the SGF with the correct opponent, I guess Validation could do it as well.

@marcocalignano
Copy link
Member

The last fix shouldn't be difficult but I have to test it.

@Splee99
Copy link

Splee99 commented Nov 20, 2018

LZ has no problem to win 2 handicap game with Phoenixgo using single precision. However, LZ lost the following two handicap game with half precision.

http://eidogo.com/#sHnGpIoW

You would say this is only one game, but LZ is able to win more than 10 single precision games in a row. If we check this half precision game closely, we see the win rate variate wildly, especially at the end of the game. LZ thinks it still has 15% winrate at the very end, while at the middle of the game the winrate even reached 65% (which is justified). If there is such variation I doubt it can find the best moves from time to time.

@wonderingabout
Copy link
Contributor

wonderingabout commented Nov 20, 2018

@Splee99

this behaviour can also happen with leela zero against other bots

in ai vs ai, especially when the bots are different, the winrates can dramatically change
for example tianrang ai :
https://online-go.com/game/15357763

you may want to gather more statistical data to reach a conclusion

@Splee99
Copy link

Splee99 commented Nov 21, 2018

Not trying to draw any conclusion, just want to discuss what is actually going on by looking in detail and compare the decision made by LZ single and LZ half. Note that the quantization error is accumulated and only appear after several moves. Here is again the game I post before at move 231:
board
The LZ half choose H4 which is a blunder, letting black F6 escape when black plays G5. Here is the log from several moves earlier:

Click to see the log
Leelaz-Half> play B H10
= 
Leelaz-Half> genmove W
= H11
Thinking at most 14.0 seconds...
NN eval=0.819683
Playouts: 177, Win: 64.29%, PV: H11 J11 K2 F11 G11 G9 F10 J3 J2 H5
Playouts: 364, Win: 64.61%, PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2
Playouts: 555, Win: 67.76%, PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2
Playouts: 769, Win: 68.78%, PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2 T5 S2
Playouts: 975, Win: 69.49%, PV: H11 J11 K2 G9 F9 F11 F10 J3 J2 H5 G5 H7 G7 G6 J7
 H11 ->    1023 (V: 69.81%) (N: 55.34%) PV: H11 J11 K2 J3 J4 J5 H4 K4 J2 B11 T3 H5 G5 H7 G7 J7 G6 G9 F9 F11 F10
  J4 ->      20 (V: 59.48%) (N:  9.03%) PV: J4 K4 H11 J11 S4 B11 K2
  S5 ->       8 (V: 61.65%) (N:  3.34%) PV: S5 T5 T3 G9 H11
  K2 ->       7 (V: 50.65%) (N:  8.14%) PV: K2 G9 G11
 G11 ->       6 (V: 58.83%) (N:  1.93%) PV: G11 K11 J4 K4
  O8 ->       3 (V: 56.75%) (N:  2.26%) PV: O8 N8 P9
  G9 ->       1 (V: 50.79%) (N:  1.65%) PV: G9 
 F15 ->       1 (V: 49.76%) (N:  0.82%) PV: F15 
 K11 ->       1 (V: 40.12%) (N:  1.99%) PV: K11 
  S4 ->       1 (V: 38.89%) (N:  2.52%) PV: S4 
  Q7 ->       1 (V: 38.42%) (N:  3.42%) PV: Q7 
  T3 ->       1 (V: 24.96%) (N:  1.52%) PV: T3 
9.7 average depth, 22 max depth
559 non leaf nodes, 1.92 average children
1074 visits, 161772 nodes, 1073 playouts, 76 n/s
? Leelaz-Half> play B J11
= 
? Leelaz-Half> genmove W
= K2
Thinking at most 14.0 seconds...
NN eval=0.894983
Playouts: 187, Win: 67.67%, PV: K2 J3 J4 K4 J2 F11 G11 G9 F9 F10 E11 F15 F13
Playouts: 388, Win: 65.84%, PV: K2 J3 J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S4 T6 T3 O11
Playouts: 584, Win: 63.19%, PV: K2 J3 J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
Playouts: 745, Win: 58.41%, PV: K2 J3 J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
Playouts: 945, Win: 56.58%, PV: K2 J3 J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
  K2 ->    1870 (V: 56.80%) (N: 27.12%) PV: K2 J3 J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
  J4 ->      55 (V: 55.09%) (N: 31.82%) PV: J4 K4 T3 B11 O11 G9 F9 F11
  S5 ->      31 (V: 65.56%) (N:  4.87%) PV: S5 T5 T3 B11 O11 H7 G7
  S4 ->      21 (V: 65.39%) (N:  4.20%) PV: S4 B11 K2 H5 G5
  Q7 ->      14 (V: 60.06%) (N:  5.42%) PV: Q7 Q8 S5 T5 T3 J3
  G9 ->      13 (V: 55.63%) (N:  7.57%) PV: G9 B11 K2 J3 J2 H5 G5
 F15 ->      12 (V: 54.84%) (N:  7.05%) PV: F15 B11 J4 K4
 A12 ->       4 (V: 76.26%) (N:  0.37%) PV: A12 T3
  T3 ->       4 (V: 55.02%) (N:  1.72%) PV: T3 G9 F9
  O8 ->       3 (V: 46.31%) (N:  2.13%) PV: O8 N8 P9
 O11 ->       1 (V: 53.15%) (N:  1.03%) PV: O11 
  J3 ->       1 (V: 49.96%) (N:  1.04%) PV: J3 
  L3 ->       1 (V: 29.78%) (N:  0.70%) PV: L3 
9.4 average depth, 21 max depth
941 non leaf nodes, 2.16 average children
2031 visits, 302637 nodes, 1064 playouts, 76 n/s
? Leelaz-Half> play B J3
= 
? Leelaz-Half> genmove W
= J4
Thinking at most 14.0 seconds...
NN eval=0.848305
Playouts: 190, Win: 50.15%, PV: J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
Playouts: 365, Win: 47.08%, PV: J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
Playouts: 571, Win: 45.41%, PV: J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
Playouts: 792, Win: 45.52%, PV: J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
Playouts: 1046, Win: 45.89%, PV: J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
  J4 ->    1916 (V: 49.14%) (N: 14.41%) PV: J4 K4 J2 H7 G7 H5 H4 J5 G5 J7 G6 B11 S5 T5 T3 T6 O11 S4
  J2 ->     620 (V: 45.82%) (N: 62.92%) PV: J2 H5 G5 H7 G7 H4 L3 K4 J7 J6 H7 T3
  J5 ->      22 (V: 50.83%) (N:  3.02%) PV: J5 J2 J4 K4 L2 T3 T2 T6
  H4 ->      18 (V: 58.19%) (N:  1.01%) PV: H4 J2 L3 J5 J1 T3 S4
  L3 ->      15 (V: 48.62%) (N:  3.28%) PV: L3 H5 G5 H7 G7 J6
 A12 ->      12 (V: 52.21%) (N:  0.26%) PV: A12 B12 B11 J2 J4
  S5 ->      11 (V: 46.42%) (N:  2.99%) PV: S5 T5 T3 H5 H4 H7
  Q7 ->       6 (V: 42.40%) (N:  2.59%) PV: Q7 Q8 J2 H5 G5 H7
  O8 ->       3 (V: 46.76%) (N:  0.78%) PV: O8 N8 J2
 F15 ->       1 (V: 43.99%) (N:  0.12%) PV: F15 
 A17 ->       1 (V: 41.16%) (N:  0.14%) PV: A17 
  A7 ->       1 (V: 40.45%) (N:  0.08%) PV: A7 
  T6 ->       1 (V: 37.75%) (N:  0.07%) PV: T6 
 O11 ->       1 (V: 36.48%) (N:  0.07%) PV: O11 
  G9 ->       1 (V: 35.91%) (N:  0.12%) PV: G9 
 H15 ->       1 (V: 35.91%) (N:  0.05%) PV: H15 
  H7 ->       1 (V: 34.51%) (N:  0.04%) PV: H7 
  N8 ->       1 (V: 34.48%) (N:  0.07%) PV: N8 
  S6 ->       1 (V: 33.83%) (N:  0.08%) PV: S6 
 S14 ->       1 (V: 33.68%) (N:  0.06%) PV: S14 
 R14 ->       1 (V: 33.42%) (N:  0.06%) PV: R14 
 T11 ->       1 (V: 33.40%) (N:  0.05%) PV: T11 
 K11 ->       1 (V: 31.88%) (N:  0.04%) PV: K11 
  S4 ->       1 (V: 31.73%) (N:  0.68%) PV: S4 
 B11 ->       1 (V: 31.12%) (N:  0.04%) PV: B11 
  P7 ->       1 (V: 28.07%) (N:  0.05%) PV: P7 
 A11 ->       1 (V: 28.02%) (N:  0.05%) PV: A11 
 B12 ->       1 (V: 27.78%) (N:  0.05%) PV: B12 
  N6 ->       1 (V: 27.27%) (N:  0.05%) PV: N6 
 H14 ->       1 (V: 26.23%) (N:  0.05%) PV: H14 
  H5 ->       1 (V: 25.15%) (N:  1.48%) PV: H5 
 T10 ->       1 (V: 22.64%) (N:  0.04%) PV: T10 
  T9 ->       1 (V: 22.06%) (N:  0.04%) PV: T9 
  P9 ->       1 (V: 21.59%) (N:  0.05%) PV: P9 
  J6 ->       1 (V: 21.43%) (N:  0.04%) PV: J6 
  Q8 ->       1 (V: 21.41%) (N:  0.05%) PV: Q8 
  S9 ->       1 (V: 21.16%) (N:  0.04%) PV: S9 
  T5 ->       1 (V: 16.86%) (N:  0.07%) PV: T5 
 A10 ->       1 (V: 16.11%) (N:  0.04%) PV: A10 
 T12 ->       1 (V: 15.55%) (N:  0.04%) PV: T12 
  T3 ->       1 (V: 15.26%) (N:  0.48%) PV: T3 
  T7 ->       1 (V: 14.79%) (N:  0.05%) PV: T7 
  F9 ->       1 (V: 14.39%) (N:  0.04%) PV: F9 
 S12 ->       1 (V: 14.09%) (N:  0.04%) PV: S12 
  T8 ->       1 (V: 13.53%) (N:  0.04%) PV: T8 
  G7 ->       1 (V: 12.96%) (N:  0.04%) PV: G7 
  G5 ->       1 (V: 12.25%) (N:  0.04%) PV: G5 
  K4 ->       1 (V: 12.01%) (N:  0.04%) PV: K4 
 F13 ->       1 (V: 11.88%) (N:  0.03%) PV: F13 
  J7 ->       1 (V: 11.86%) (N:  0.03%) PV: J7 
 F11 ->       1 (V: 11.80%) (N:  0.03%) PV: F11 
  R2 ->       1 (V: 11.11%) (N:  0.05%) PV: R2 
 F10 ->       1 (V: 10.24%) (N:  0.04%) PV: F10 
 D11 ->       1 (V: 10.17%) (N:  0.03%) PV: D11 
 G11 ->       1 (V: 10.15%) (N:  0.03%) PV: G11 
  R6 ->       1 (V:  9.62%) (N:  0.05%) PV: R6 
 A16 ->       1 (V:  9.41%) (N:  0.04%) PV: A16 
  T2 ->       1 (V:  9.34%) (N:  0.04%) PV: T2 
  O7 ->       1 (V:  9.22%) (N:  0.05%) PV: O7 
  H1 ->       1 (V:  9.07%) (N:  0.04%) PV: H1 
  L1 ->       1 (V:  9.05%) (N:  0.04%) PV: L1 
  L2 ->       1 (V:  9.00%) (N:  0.04%) PV: L2 
 F12 ->       1 (V:  8.84%) (N:  0.03%) PV: F12 
 H19 ->       1 (V:  8.64%) (N:  0.03%) PV: H19 
 O18 ->       1 (V:  8.61%) (N:  0.04%) PV: O18 
 K19 ->       1 (V:  8.46%) (N:  0.04%) PV: K19 
  J1 ->       1 (V:  8.46%) (N:  0.04%) PV: J1 
 D10 ->       1 (V:  8.13%) (N:  0.03%) PV: D10 
  S1 ->       1 (V:  8.05%) (N:  0.05%) PV: S1 
 E11 ->       1 (V:  7.96%) (N:  0.04%) PV: E11 
 R19 ->       1 (V:  7.61%) (N:  0.04%) PV: R19 
  D8 ->       1 (V:  7.58%) (N:  0.03%) PV: D8 
 R16 ->       1 (V:  7.47%) (N:  0.03%) PV: R16 
 S16 ->       1 (V:  7.43%) (N:  0.04%) PV: S16 
 L11 ->       1 (V:  7.37%) (N:  0.03%) PV: L11 
 M12 ->       1 (V:  7.35%) (N:  0.04%) PV: M12 
 S13 ->       1 (V:  7.31%) (N:  0.04%) PV: S13 
  D2 ->       1 (V:  7.26%) (N:  0.04%) PV: D2 
  A2 ->       1 (V:  7.25%) (N:  0.04%) PV: A2 
 M18 ->       1 (V:  6.80%) (N:  0.04%) PV: M18 
  H2 ->       1 (V:  6.74%) (N:  0.05%) PV: H2 
  E9 ->       1 (V:  6.70%) (N:  0.04%) PV: E9 
 T13 ->       1 (V:  6.60%) (N:  0.05%) PV: T13 
  G3 ->       1 (V:  6.52%) (N:  0.04%) PV: G3 
  C9 ->       1 (V:  6.48%) (N:  0.03%) PV: C9 
 M19 ->       1 (V:  6.46%) (N:  0.04%) PV: M19 
 S17 ->       1 (V:  6.43%) (N:  0.04%) PV: S17 
  F5 ->       1 (V:  6.29%) (N:  0.04%) PV: F5 
  G6 ->       1 (V:  6.26%) (N:  0.04%) PV: G6 
 S15 ->       1 (V:  6.16%) (N:  0.04%) PV: S15 
 L19 ->       1 (V:  6.07%) (N:  0.04%) PV: L19 
  F8 ->       1 (V:  5.94%) (N:  0.04%) PV: F8 
 T14 ->       1 (V:  5.90%) (N:  0.04%) PV: T14 
 M13 ->       1 (V:  5.90%) (N:  0.04%) PV: M13 
 T17 ->       1 (V:  5.79%) (N:  0.04%) PV: T17 
  Q2 ->       1 (V:  5.76%) (N:  0.04%) PV: Q2 
  R1 ->       1 (V:  5.70%) (N:  0.04%) PV: R1 
  C2 ->       1 (V:  5.70%) (N:  0.04%) PV: C2 
 T15 ->       1 (V:  5.57%) (N:  0.04%) PV: T15 
  Q5 ->       1 (V:  5.57%) (N:  0.04%) PV: Q5 
  B3 ->       1 (V:  5.57%) (N:  0.04%) PV: B3 
  N3 ->       1 (V:  5.56%) (N:  0.04%) PV: N3 
 N14 ->       1 (V:  5.52%) (N:  0.03%) PV: N14 
  P1 ->       1 (V:  5.45%) (N:  0.04%) PV: P1 
 O16 ->       1 (V:  5.31%) (N:  0.03%) PV: O16 
 R17 ->       1 (V:  5.29%) (N:  0.03%) PV: R17 
  B2 ->       1 (V:  5.19%) (N:  0.04%) PV: B2 
  C1 ->       1 (V:  5.14%) (N:  0.04%) PV: C1 
 N12 ->       1 (V:  5.14%) (N:  0.03%) PV: N12 
 N18 ->       1 (V:  5.12%) (N:  0.04%) PV: N18 
 P16 ->       1 (V:  5.07%) (N:  0.03%) PV: P16 
 T18 ->       1 (V:  5.04%) (N:  0.04%) PV: T18 
 S18 ->       1 (V:  5.02%) (N:  0.04%) PV: S18 
 A13 ->       1 (V:  4.87%) (N:  0.04%) PV: A13 
  A6 ->       1 (V:  4.86%) (N:  0.04%) PV: A6 
  G4 ->       1 (V:  4.86%) (N:  0.04%) PV: G4 
 L18 ->       1 (V:  4.75%) (N:  0.04%) PV: L18 
 A15 ->       1 (V:  4.71%) (N:  0.04%) PV: A15 
 M11 ->       1 (V:  4.69%) (N:  0.03%) PV: M11 
  E6 ->       1 (V:  4.68%) (N:  0.03%) PV: E6 
  M1 ->       1 (V:  4.57%) (N:  0.04%) PV: M1 
  F7 ->       1 (V:  4.49%) (N:  0.04%) PV: F7 
  P4 ->       1 (V:  4.47%) (N:  0.04%) PV: P4 
 J15 ->       1 (V:  4.43%) (N:  0.03%) PV: J15 
  N1 ->       1 (V:  4.32%) (N:  0.04%) PV: N1 
  B4 ->       1 (V:  4.31%) (N:  0.04%) PV: B4 
 P13 ->       1 (V:  4.29%) (N:  0.03%) PV: P13 
 P19 ->       1 (V:  4.29%) (N:  0.04%) PV: P19 
 O13 ->       1 (V:  4.25%) (N:  0.03%) PV: O13 
 N19 ->       1 (V:  4.22%) (N:  0.04%) PV: N19 
 N17 ->       1 (V:  4.21%) (N:  0.03%) PV: N17 
 T19 ->       1 (V:  4.17%) (N:  0.04%) PV: T19 
 T16 ->       1 (V:  4.14%) (N:  0.04%) PV: T16 
 Q19 ->       1 (V:  4.10%) (N:  0.04%) PV: Q19 
 M17 ->       1 (V:  4.09%) (N:  0.03%) PV: M17 
 O19 ->       1 (V:  4.03%) (N:  0.04%) PV: O19 
 L16 ->       1 (V:  4.01%) (N:  0.03%) PV: L16 
  O1 ->       1 (V:  3.90%) (N:  0.04%) PV: O1 
  S2 ->       1 (V:  3.81%) (N:  0.05%) PV: S2 
 E19 ->       1 (V:  3.75%) (N:  0.04%) PV: E19 
  K1 ->       1 (V:  3.73%) (N:  0.04%) PV: K1 
 J18 ->       1 (V:  3.65%) (N:  0.04%) PV: J18 
pass ->       1 (V:  3.54%) (N:  0.05%) PV: pass 
 O12 ->       1 (V:  3.53%) (N:  0.04%) PV: O12 
 B19 ->       1 (V:  3.45%) (N:  0.03%) PV: B19 
 E14 ->       1 (V:  3.37%) (N:  0.03%) PV: E14 
  N2 ->       1 (V:  3.33%) (N:  0.04%) PV: N2 
 S19 ->       1 (V:  3.32%) (N:  0.04%) PV: S19 
 K18 ->       1 (V:  3.32%) (N:  0.04%) PV: K18 
 A19 ->       1 (V:  3.26%) (N:  0.03%) PV: A19 
  T1 ->       1 (V:  3.25%) (N:  0.04%) PV: T1 
  Q1 ->       1 (V:  3.15%) (N:  0.04%) PV: Q1 
  P2 ->       1 (V:  3.13%) (N:  0.04%) PV: P2 
 E18 ->       1 (V:  2.76%) (N:  0.03%) PV: E18 
 P12 ->       1 (V:  2.68%) (N:  0.03%) PV: P12 
9.8 average depth, 23 max depth
1239 non leaf nodes, 2.23 average children
2770 visits, 406112 nodes, 1381 playouts, 98 n/s
? Leelaz-Half> undo 
= 
? Leelaz-Half> play W J2
= 
? Leelaz-Half> play B H5
= 
? Leelaz-Half> genmove W
= H4
Thinking at most 14.0 seconds...
NN eval=0.512030
Playouts: 204, Win: 56.74%, PV: H4 G5 J4 J5 K4 G9 F9 B11 S5 T5 T3 F15 F13 T6 O11
Playouts: 424, Win: 54.98%, PV: H4 G9 F9 F11 F10 F13 F12 F15 F13 E14 G11 H7 G7 G6 J7
Playouts: 699, Win: 56.33%, PV: H4 G9 F9 F11 F10 F15 G11 F13 F12 E14 F13 H7 G7 G6 J7 J6 H7 G5
Playouts: 947, Win: 57.75%, PV: H4 G9 F9 F11 F10 F15 G11 F13 F12 E14 F13 H7 G7 G6 J7 J6 H7 G5 J4
Playouts: 1167, Win: 57.63%, PV: H4 G9 F9 F11 F10 F15 G11 F13 F12 E14 F13 H7 G7 G6 J7 J6 H7 G5 J4 J5
  H4 ->    1361 (V: 59.10%) (N:  6.33%) PV: H4 G9 F9 F11 F10 F15 G11 F13 F12 E14 F13 H7 G7 G6 J7 J6 H7 G5 J4 J5
  G5 ->     158 (V: 48.55%) (N: 57.28%) PV: G5 H7 G7 H4 L3 K4 J7 J6 H7 T3
  S5 ->       9 (V: 40.95%) (N:  6.89%) PV: S5 T5 T3 G6 H7
  G6 ->       9 (V: 32.92%) (N:  8.63%) PV: G6 B11 T3 G5 F5
 A12 ->       7 (V: 42.37%) (N:  3.84%) PV: A12 A13 G5 H7
  G9 ->       5 (V: 46.98%) (N:  2.12%) PV: G9 G6 H7 G4
  Q7 ->       3 (V: 44.45%) (N:  1.78%) PV: Q7 Q8 G5
  O8 ->       2 (V: 38.12%) (N:  1.29%) PV: O8 N8
  S4 ->       2 (V: 32.37%) (N:  1.46%) PV: S4 G6
  T3 ->       1 (V: 15.29%) (N:  1.34%) PV: T3 
9.9 average depth, 21 max depth
942 non leaf nodes, 1.65 average children
1558 visits, 225575 nodes, 1260 playouts, 90 n/s

From the exact starting point, LZ-single knows already that the game is now in favor of the opponent and ignores the fake hope H4 which LZ-half sees as 59% win. Instead, LZ-single plays G5 (position A in the picture above) to make sure b F6 is captured.

Click to see the log
Leelaz-Single> play B H10
= 
Leelaz-Single> genmove W
= H11
Thinking at most 14.0 seconds...
NN eval=0.823463
Playouts: 157, Win: 66.04%, PV: H11 J11 A12 B12 B11 G9 F9 F11 F10 F15
Playouts: 318, Win: 66.54%, PV: H11 J11 K2 J3 J2 H5 G5 H7 G7 J6 G6 T3
Playouts: 491, Win: 68.32%, PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2 T6
Playouts: 699, Win: 67.00%, PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2 T6
Playouts: 871, Win: 67.03%, PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2 T5
 H11 ->     929 (V: 67.46%) (N: 55.32%) PV: H11 J11 K2 J3 J4 K4 J2 H5 H4 J5 G5 T3 T2 T5
  K2 ->      16 (V: 63.82%) (N:  8.17%) PV: K2 G9 J4 K4 J3 T3
  J4 ->      14 (V: 59.72%) (N:  8.99%) PV: J4 K4 H11 J11 S5 T5 T3
  Q7 ->       8 (V: 62.93%) (N:  3.47%) PV: Q7 Q8 H11 J11 J4 K4
  O8 ->       5 (V: 60.22%) (N:  2.23%) PV: O8 N8 H11
 K11 ->       4 (V: 61.71%) (N:  1.96%) PV: K11 J11 G11
 G11 ->       1 (V: 53.30%) (N:  1.90%) PV: G11 
  S4 ->       1 (V: 43.86%) (N:  2.56%) PV: S4 
  S5 ->       1 (V: 40.05%) (N:  3.34%) PV: S5 
  G9 ->       1 (V: 36.47%) (N:  1.61%) PV: G9 
 F15 ->       1 (V: 32.29%) (N:  0.82%) PV: F15 
  T3 ->       1 (V: 24.30%) (N:  1.56%) PV: T3 
8.9 average depth, 17 max depth
537 non leaf nodes, 1.83 average children
983 visits, 148854 nodes, 982 playouts, 70 n/s
? Leelaz-Single> play B J11
= 
? Leelaz-Single> genmove W
= K2
Thinking at most 14.0 seconds...
NN eval=0.828084
Playouts: 184, Win: 66.56%, PV: K2 H5 J4 K4 J3 J5 G5 T3 T2 T5 S2 B11 J7
Playouts: 364, Win: 65.94%, PV: K2 H5 J4 K4 J3 J5 G5 T3 T2 T5 S4 T6 S2 B11 O11
Playouts: 539, Win: 64.38%, PV: K2 H5 J4 K4 J3 J5 G5 T3 S4 T5 T2 T7 S5 T6 S2 B11 F15 H7
Playouts: 718, Win: 63.23%, PV: K2 H5 J4 K4 J3 J5 G5 T3 T2 T6 S4 T5 S2 H7 G7 J7 G6 B11 O11 T10 S9 N8
Playouts: 952, Win: 62.24%, PV: K2 H5 J4 K4 J3 J5 G5 T3 S4 T5 T2 T7 S5 T6 S2 B11 F15 H7 G7 J7 G6
  K2 ->    1613 (V: 61.56%) (N: 15.63%) PV: K2 H5 J4 K4 J3 J5 G5 T3 S4 T5 T2 T7 S5 T6 S2 B11 F15 H7 G7 J7 G6
 A12 ->     118 (V: 66.02%) (N:  6.91%) PV: A12 B12 B11 F11 G11 G9 F10 T3 T2 T6 S4 T5 K2
  J4 ->      50 (V: 52.69%) (N: 29.04%) PV: J4 K4 Q7 Q8 S5 T5 T3 T6 K2 J2
  S5 ->      27 (V: 61.31%) (N:  6.96%) PV: S5 T5 T3 B11 K2 J3 J2 H5 G5
 O11 ->      22 (V: 64.24%) (N:  5.49%) PV: O11 B11 K2 H5
  G9 ->      18 (V: 61.52%) (N:  4.86%) PV: G9 B11 K2 J3 J2
 F15 ->      17 (V: 55.07%) (N:  7.75%) PV: F15 B11 K2 H5
  S4 ->      15 (V: 60.61%) (N:  4.68%) PV: S4 B11 K2 H5 G5
  Q7 ->       9 (V: 60.35%) (N:  2.99%) PV: Q7 Q8 J4 K4 S5 T5 T3 T6
  O8 ->       9 (V: 52.54%) (N:  5.59%) PV: O8 N8 P9 O7 K2
  T3 ->       4 (V: 53.08%) (N:  2.24%) PV: T3 B11 K2
 B11 ->       3 (V: 61.11%) (N:  0.63%) PV: B11 T3 T2
  L3 ->       1 (V: 49.23%) (N:  0.65%) PV: L3 
  T5 ->       1 (V: 31.94%) (N:  1.07%) PV: T5 
10.2 average depth, 25 max depth
1001 non leaf nodes, 1.91 average children
1908 visits, 282743 nodes, 1092 playouts, 78 n/s
? Leelaz-Single> play B J3
= 
? Leelaz-Single> genmove W
= J4
Thinking at most 14.0 seconds...
NN eval=0.696038
Playouts: 194, Win: 64.22%, PV: J4 K4 J2 G9 F9 H5 H4 H7 G7 J5 G5 J7 S5
Playouts: 366, Win: 61.36%, PV: J4 H4 J5 K4 H7 J2 S4 B11 K1 G9 F9 F11 F10
Playouts: 528, Win: 57.61%, PV: J4 H4 J5 K4 H7 J2 K1 B11 S4 T6 T3 F15
Playouts: 684, Win: 52.61%, PV: J4 H4 J5 K4 H7 J2 K1 B11 S4 T6 T3 F15
Playouts: 848, Win: 48.10%, PV: J4 H4 J5 K4 H7 J2 K1 B11 S4 T6 T3 F15
  J4 ->     960 (V: 47.12%) (N: 26.31%) PV: J4 H4 J5 K4 H7 J2 K1 B11 S4 T6 T3 F15
  J2 ->     158 (V: 51.62%) (N: 58.67%) PV: J2 H5 G5 H4 L3 K4 S5 T5 T3 B11 H7
  J5 ->       6 (V: 57.53%) (N:  1.51%) PV: J5 G9 F9
  H4 ->       3 (V: 52.70%) (N:  1.74%) PV: H4 J2 L3
  S4 ->       1 (V: 42.65%) (N:  0.89%) PV: S4 
  L3 ->       1 (V: 38.79%) (N:  1.69%) PV: L3 
  H5 ->       1 (V: 35.70%) (N:  0.85%) PV: H5 
  S5 ->       1 (V: 31.41%) (N:  1.00%) PV: S5 
7.8 average depth, 16 max depth
482 non leaf nodes, 2.35 average children
1132 visits, 168320 nodes, 947 playouts, 67 n/s
? Leelaz-Single> undo 
= 
? Leelaz-Single> play W J2
= 
? Leelaz-Single> play B H5
= 
? Leelaz-Single> genmove W
= G5
Thinking at most 14.0 seconds...
NN eval=0.664921
Playouts: 185, Win: 49.21%, PV: G5 H4 L3 K4 A12 A13 H7 T3 T2 T5 S2
Playouts: 367, Win: 51.38%, PV: G5 H4 L3 K4 A12 A13 S5 H7 G6 G9
Playouts: 567, Win: 51.20%, PV: G5 H4 L3 K4 A12 A13 S5 T5 T3 H7 G7
Playouts: 756, Win: 51.27%, PV: G5 H4 L3 K4 S5 T5 T3 B11 H7 T6 O11 O8 Q7
Playouts: 939, Win: 50.42%, PV: G5 H4 L3 K4 A12 T3 T2 T5 S4 S6 S2 H7
  G5 ->    1103 (V: 49.86%) (N: 58.16%) PV: G5 H4 L3 K4 A12 T3 T2 T5 S4 T6 S2 H7 G7
  H4 ->      21 (V: 48.47%) (N:  4.93%) PV: H4 H7 G7 G6 J7 J6 H7
  G6 ->       9 (V: 34.67%) (N:  8.27%) PV: G6 B11 L3 K4 S4
 A12 ->       8 (V: 42.83%) (N:  4.76%) PV: A12 A13 G5 H7 G7 J6
  S5 ->       8 (V: 42.77%) (N:  3.67%) PV: S5 T5 T3 H7
  O8 ->       6 (V: 41.18%) (N:  3.23%) PV: O8 N8 G5 H7 G7
 O11 ->       3 (V: 37.72%) (N:  2.63%) PV: O11 T3 G5
  Q7 ->       2 (V: 42.38%) (N:  1.65%) PV: Q7 Q8
  S4 ->       2 (V: 32.57%) (N:  2.32%) PV: S4 G6
  G9 ->       1 (V: 39.26%) (N:  0.77%) PV: G9 
  L3 ->       1 (V: 37.70%) (N:  0.62%) PV: L3 
  T3 ->       1 (V: 35.83%) (N:  0.76%) PV: T3 
  J4 ->       1 (V: 11.25%) (N:  2.01%) PV: J4 
8.6 average depth, 17 max depth
661 non leaf nodes, 1.76 average children
1167 visits, 170210 nodes, 1051 playouts, 75 n/s

@barrtgt
Copy link

barrtgt commented Nov 21, 2018

These are the results for my pc that I tested earlier:

Single vs. half precision with net a53 at 10 playouts
1600 games 785:815 49.06%

Have you tried detaching and reattaching multiple times with both single and half on the same position to see if it is consistent?

@Splee99
Copy link

Splee99 commented Nov 21, 2018

Yes that was exactly what I have done. I opened two Sabaki windows at the same time though. I would think 10 playouts maybe too short.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants