@demishassabis: The AI field is in need of harder benchmarks to test capabilities of the latest AI models. This upda...
The AI field is in need of harder benchmarks to test capabilities of the latest AI models. This update to @Kaggle Game Arena with werewolf and poker (heads-up) plus chess, gives us new objective measures of real-world skills like planning and decision making under uncertainty.