ICYMI: Grok 4 (Thinking) sets a new state-of-the-art benchmark on ARC-AGI-2, scoring 15.9%. This result almost doubles the prior commercial SOTA while exceeding the top entry in the current Kaggle competition. 🤯🤯
2
0
0
Comments
0
Reposts