newsence
來源篩選

@testingcatalog: A new 72% acheivement submission for ARC-AGI-2. So far, it is the second multi-model system that out...

Twitter

A new 72% acheivement submission for ARC-AGI-2. So far, it is the second multi-model system that outperformed single-model solutions. "It runs the same task through GPT-5.2, Gemini-3, and Claude Opus 4.5 in parallel." We need new benchmarks 👀 https://t.co/SoJnnjh6mL

newsence

Loading

Fetching article data