OpenAI's new model o3 smashes AGI benchmarks, is one of the 200 best coders in the world

OpenAI’s new model o3 smashes AGI benchmarks, is one of the 200 best coders in the world

Dec. 23, 2024.
1 min. read.

4 Interactions

About the Writer

Conor O'Higgins

91.362 MPXR

Conor O'Higgins is the Editor of Mindplex Magazine

On 20 December, OpenAI announced o3, their latest model, which they claim demonstrates aspects of AGI (Artificial General Intelligence).

ARC is an AGI benchmark developed in 2019 to be a hard test that requires human-like reasoning. OpenAI’s o3 has just achieved a score of 75.7%. For comparison, no model in the first year of the test surpassed five per cent.

o3 achieves a score of 2727 in Codeforces’ coding test rating. This would make it the 175th-best competitive coder in the entire world.

o3 replaces OpenAI’s o1 model, released in September. Both models run on ‘reinforcement learning’, where they follow a series of logical reasoning steps, making it a lot slower and more computationally-intensive, but improving its performance on tasks like mathematics and science.

o3 has still not been made available to the public, so these evaluations were conducted behind closed doors. It’s also worth remembering that all tests are limited, and AGI when it comes will prove itself in the din of daily life, not any test.

Another limitation of o3 is its cost: the graph at the top of ARC Prize’s announcement shows that it scores about 3× as highly on the test as o1 for a 10-1000 increase in cost.

Still, it has gotten people excited about progress towards AGI again,

#o3

Let us know your thoughts! Sign up for a Mindplex account now, join our Telegram, or follow us on Twitter.

One thought on “OpenAI’s new model o3 smashes AGI benchmarks, is one of the 200 best coders in the world”

OpenAI's o3 achieving a 75.7% score on the ARC benchmark is a huge leap towards AGI, showcasing remarkable progress in human-like reasoning. While its high cost and limited availability remain challenges, the advancements in AI are undeniable and exciting for the future!

Dislike

💯 💘 😍 ✨ 🎉 👏
🟨 😴 😡 ❌ 🤮 💩

hana gebremedhin
4 mons ago
12.40921 MPXR

OpenAI's o3 achieving a 75.7% score on the ARC benchmark is a huge leap towards AGI, showcasing remarkable progress in human-like reasoning. While its high cost and limited availability remain challenges, the advancements in AI are undeniable and exciting for the future!


Dislike

💯 💘 😍 ✨ 🎉 👏
🟨 😴 😡 ❌ 🤮 💩

Exciting News! Our Mobile App is Here!

Welcome Back

No account? Create One

Join

Already have an account? Sign in

forgot password

OpenAI’s new model o3 smashes AGI benchmarks, is one of the 200 best coders in the world

About the Writer

Conor O'Higgins

RELATED NEWS

From Tokens to Thought: Meta Chain-of-Thought and the Evolution of Reasoning in AI

New OpenAI models enhance coding and comprehension

Google Expands Access to Gemini 2.5 Pro, Its Most Advanced AI Model

The Blue Wheel Threatening the Valley: How Deepseek Is Reshaping the AI Landscape

share

Copy link

Facebook

Twitter

Telegram

Linkedin

Interactions