You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I read the paper Adversarial Policies Beat Superhuman Go AIs (T. Wang et al, 2023) and test b28-s6797 recently. b28 successfully handled the cyclic groups in that paper, but still failed to evaluated static state containing large cyclic groups across more than half of 19*19 board after >100k visits. I know that these states barely appear in real games. However, I think that the patch against cyclic group problem may be only some data augmentation. It didn't make KataGo aware of liberties or other concepts that are simple for human players.
According to the "最佳选点" visualization in Katrain, I think that KataGo's main flaw is viewing equivalent moves as different ones. It searches very broad space with shallow depth when many equivalent moves exist. In the example of d.png in zip file, it distributes visits on several dozen of moves with several k visits. Human players understand the concept of liberties by merging equivalent states and compressing the state space to only one integer, but KataGo appears not in its searching pattern. By the way, the board state with same stone positions and different orders can be merged in MCTS, but I haven't find an elegant implementation.
I think that identifying equivalent states may reduce searching cost and enhance robustness in large group battles, since the essence of many battles is liberties. But equivalent moves may have different P and V because of the random noises in NN training, thus I don't know how to implement the identification now.
The text was updated successfully, but these errors were encountered:
I read the paper Adversarial Policies Beat Superhuman Go AIs (T. Wang et al, 2023) and test b28-s6797 recently. b28 successfully handled the cyclic groups in that paper, but still failed to evaluated static state containing large cyclic groups across more than half of 19*19 board after >100k visits. I know that these states barely appear in real games. However, I think that the patch against cyclic group problem may be only some data augmentation. It didn't make KataGo aware of liberties or other concepts that are simple for human players.
test.zip
According to the "最佳选点" visualization in Katrain, I think that KataGo's main flaw is viewing equivalent moves as different ones. It searches very broad space with shallow depth when many equivalent moves exist. In the example of d.png in zip file, it distributes visits on several dozen of moves with several k visits. Human players understand the concept of liberties by merging equivalent states and compressing the state space to only one integer, but KataGo appears not in its searching pattern. By the way, the board state with same stone positions and different orders can be merged in MCTS, but I haven't find an elegant implementation.
I think that identifying equivalent states may reduce searching cost and enhance robustness in large group battles, since the essence of many battles is liberties. But equivalent moves may have different P and V because of the random noises in NN training, thus I don't know how to implement the identification now.
The text was updated successfully, but these errors were encountered: