MARCH 25, 2026 · RESEARCH

From 0% to 36% on Day 1 of ARC-AGI-3

Achieving 36% on ARC-AGI-3 using the Agentica framework.

The Agentica SDK by Symbolica achieves an unverified competition score of 36.08% on ARC-AGI-3 [1], passing 113 out of 182 playable levels, and completes 7 out of the 25 available games [2].

Our implementation outperforms CoT baselines of 0.2% (Opus 4.6 Max) and 0.3% (GPT 5.4 High), while maintaining a far lower cost: Agentica's 36.08% for $1,005 vs. Opus 4.6's 0.25% for $8,900.

Check out the code on GitHub symbolica-ai/ARC-AGI-3-Agents
ARC-AGI-3: Score vs Cost0%10%20%30%40%Score (%)$1$10$100$1k$10kCost ($)Gemini 3.1Pro(Preview)Grok 4.20(BetaReasoning)GPT-5.4(High)Opus4.6(Max)SOTAAgentica Opus4.6 (High)
Figure 1. A comparison of the score and cost per task on the ARC-AGI-3 public eval set between Chain of Thought (CoT) models and the Agentica ARC-AGI-3 agent for Opus 4.6 (120k) High. For details on the cost per task for Agentica Opus 4.6 (120k) High see the code.
97.6%
118 actions
CN0497.6% WIN
84.16%
273 actions
LP8584.16% WIN
83.28%
516 actions
AR2583.28% WIN
77.59%
123 actions
FT0977.59% WIN

Score Breakdown - All Games

Beat human baselineGame wonGame ended
GameL1L2L3L4L5L6L7L8L9L10Score
CN04201922213597.60
LP851711181823153191384.16
AR255030972873841064783.28
FT09371421215677.59
CD8260365714162070.15
TR8742323929433,96269.21
TU9317182345816214914867.87
KA5937563752271135965.33
SB2618221152017196720349.35
M0R02543121126140.06
RE862437611326628026335.54
SU151623217105136902715035.17
S5I533727714136523.85
WA303958868013222.22
SC25789304218.42
VC3311152914317.14
DC22949911412815.56
G50T691804678.70
LS20263872512132125027.13
LF52231372461749285.36
R11L44324.76
TN3657695281.31
SK4874722661.21
SP80281200.73
BP3548100.22
Overall36.08

Chat with Agentica

We've sandboxed the SDK and let it run any persistent task, including solving ARC puzzles.

Read about Agentica here

References

[1] ARC Prize Foundation. ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence. Arc Prize Foundation.

[2] ARC Prize. ARC-AGI-3. ARC Prize.

Appendix

A note on scoring

Human baseline scores available via the ARC-AGI-3 API state that the game cn04 has 6 levels in total. This does not match the number of levels in the corresponding game available via the API.

From 0% to 36% on Day 1 of ARC-AGI-3 | Symbolica Blog