AGI Benchmark Anomaly
Cameron Rohn · Category: points_of_view
Gemini’s overperformance on ARC AGI-2 versus ARC AGI-1 suggests uneven task alignment and warrants deeper analysis of benchmark design.
© 2025 The Build. All rights reserved.
Privacy Policy