Skip to content

I have an idea #1064

@Chenvincentkevin

Description

@Chenvincentkevin

Recently, I've been playing b18 versus b28.

I found some conclusions:

  1. theie evaluations on scores and points are often similar.
  2. when it differs, their is quite a possibility(maybe 20%?) that b18 gets a better result locally.
  3. even in exterme case, it happens that b18 find a notlikely mentioned points in 28 that might reverse the result.

I come up with a way to increase win rate for b28, is that both looking at the variation of b18 and b28, when it differs, seek through the b18's path for a while and judge again.

nets are
b28c512nbt-s8536703232-d4684449769
b18c384nbt-s9996604416-d4316597426
/.
and the game is actually trivial, I just played it once and it occurs that situation.

I then wonder, what if we integrate 6b when the AI-weakness, which is that some simple moves but judges wrong by high rank AI occurs? And I wonder whether can integrate this process into training, and restart it whole, which means that 10b's training is integrated with the best of 6b's judges. and so on? (just seek 1 before can be helpful I think).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions