I have an idea

Recently, I've been playing b18 versus b28.

I found some conclusions:

1. theie evaluations on scores and points are often similar.
2. when it differs, their is quite a possibility(maybe 20%?) that b18 gets a better result locally.
3. even in exterme case, it happens that b18 find a notlikely mentioned points in 28 that might reverse the result.

I come up with a way to increase win rate for b28, is that both looking at the variation of b18 and b28, when it differs, seek through the b18's path for a while and judge again.

nets are
b28c512nbt-s8536703232-d4684449769
b18c384nbt-s9996604416-d4316597426
/.
and the game is actually trivial, I just played it once and it occurs that situation.

I then wonder, what if we integrate 6b when the AI-weakness, which is that some simple moves but judges wrong by high rank AI occurs? And I wonder whether can integrate this process into training, and restart it whole, which means that 10b's training is integrated with the best of 6b's judges. and so on? (just seek 1 before can be helpful I think).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I have an idea #1064

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

I have an idea #1064

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions