The programs competing for each of these games use the same zero knowledge reinforcement learning algorithms (fundamentally different from the Alpha Zero algorithms), created by Quentin Cohen-Solal and then improved during the PRAIRIE postdoc. This is the first time that the same algorithms have made it possible to win 5 gold medals at the Computer Olympiads.



