KataGo icon indicating copy to clipboard operation
KataGo copied to clipboard

Best network

Open michito744 opened this issue 3 years ago • 7 comments

As indicated by the ratings, kata1-b60c320-s6321537280-d2951683615 is a very well balanced and powerful network. It withstands very deep searches better than any network in the past, with more accurate predictions in PvP. It would be best to do larger searches on this network and feed back towards upstream.

*Even KataGo's best networks are clearly inferior to FineArt. (Hundreds of thousands more searches are needed to get the same results with KataGo as with FineArt for evaluation).

michito744 avatar Aug 01 '22 09:08 michito744

Thanks. Is this in response to the recent ratings of the different networks, in particular the network after it seeming to be much worse?

I didn't realize at first from your message why you were singling out this particular network, since it looks very similar to many of the networks before it, but it does seem the latest one might be worse, so sure, I went ahead and manually disabled the newest one from training data so it's back to using b60c320-s6.32G.

lightvector avatar Aug 03 '22 04:08 lightvector

@lightvector

In various complex phases, it is quite a bit of luck whether the search can be developed in a balanced manner.

Up to s6.32, that accuracy tended to increase, but with s6.34, the balance has clearly shifted. I have no idea why this happened and am not interested, but the probable conclusion from the tests is that s6.32 is currently the best we can do.

michito744 avatar Aug 03 '22 06:08 michito744

Yes, but the trend changes. So I don't think this will be the best network.

Chenvincentkevin avatar Aug 15 '22 04:08 Chenvincentkevin

@Chenvincentkevin

I agree. This is just the situation as of now.

michito744 avatar Aug 18 '22 03:08 michito744

@michito744 Will it be a good idea to estimate the leading scores in katago's official training and rating games and when it is big there might be a blunder and training these situations when the blunder happens?

Chenvincentkevin avatar Aug 18 '22 12:08 Chenvincentkevin

like the Resigned games, estimate again to see how much B/W leads and if it's over 30 or so, we can analyse what happened

Chenvincentkevin avatar Aug 18 '22 12:08 Chenvincentkevin

I mean especially games played by 40b+ networks

Chenvincentkevin avatar Aug 18 '22 12:08 Chenvincentkevin

It seems that s6.78 is crushingly stronger now.

Chenvincentkevin avatar Jan 02 '23 13:01 Chenvincentkevin