ICYMI: Grok 4 (Thinking) sets a new state-of-the-art benchmark on ARC-AGI-2, scoring 15.9%. This result almost doubles the prior commercial SOTA while exceeding the top entry in the current Kaggle competition. 🤯🤯

2
0
0 Comments 0 Reposts