TechnoSports Media Group
  • Home
  • Technology
  • Smartphones
  • Deal
  • Sports
  • Reviews
  • Gaming
  • Entertainment
No Result
View All Result
  • Home
  • Technology
  • Smartphones
  • Deal
  • Sports
  • Reviews
  • Gaming
  • Entertainment
No Result
View All Result
TechnoSports Media Group
No Result
View All Result

OpenAI Launches o3 Model, Breaks ARC-AGI Benchmark After 5 Years

Ishika Setia by Ishika Setia
December 23, 2024
in FAQ, Technology
0

In the last of the “12 Days of OpenAI” drop announcements, OpenAI has today announced a stunning large update: the opening of the o3 reasoning models and the o3-mini models. Fittingly, o3 also became the first AI model to break the famous ARC-AGI benchmark, which had been undefeated for five years.

OpenAI

Table of Contents

  • OpenAI Launches o3 and o3-mini Models, First to Break ARC-AGI Benchmark in 5 Years, Paving Path to AGI
  • FAQs
    • What is the ARC-AGI benchmark?
    • When will OpenAI release the o3 and o3-mini models?

OpenAI Launches o3 and o3-mini Models, First to Break ARC-AGI Benchmark in 5 Years, Paving Path to AGI

Using high-compute resources and an extended processing window, o3 reached 87.5% on the ARC-AGI Semi-Private Evaluation Set. It surpasses the ARC Prize hurdle (85%) which is generally the level humans achieve. By comparison, OpenAI’s o1 model from earlier has a score of only 32%.

RelatedPosts

Radiohead Breaks Seven-Year Silence: Complete Setlist From the Band’s Triumphant Madrid Return

Meenakshi Chaudhary Transforms Into Archaeologist Daksha for Naga Chaitanya’s Mythological Thriller NC24

Man City vs Liverpool: Five Key Battles That Will Define Guardiola’s Milestone Match

The new measurement represents AI models on their performance in solving new and unseen problems, rather than how good they are at pattern generalization, hence it is called the ARC-AGI test. If passing this test is an indicator, OpenAI’s o3 model is a large step toward Artificial General Intelligence (AGI), which can be considered a system that has the potential to be equal to or even surpass human intelligence.

Besides the ARC-AGI milestone, the o3 model also received high scores in other challenging benchmarks such as 71.7 in SWE-bench Verified, 2,727 in Codeforces, 96.7 in AIME 2024, and 87.7 in GPQA Diamond. On the Frontier Math benchmark, which has historically posed tests for human mathematicians and experts, o3 attained a record-best 25.2% success, following a best-in-class 2.0 score.

The o3-mini model is a distilled version of the o3 model, which is optimized for faster code generation, higher performance, and lower costs. The o3-mini comes with three compute settings—low, medium, and high—beating the o1 model at medium and with lower latency. To avoid legal disputes with the UK-based mobile operator O2, OpenAI avoided calling the model o2. The o3 and o3-mini are being safety tested by OpenAI right now, with the o3-mini expected to become publicly available around January 2025. The o3 model will come next, following further tests and regulatory approval.

FAQs

What is the ARC-AGI benchmark?

ARC-AGI tests AI for generalized intelligence, focusing on problem-solving skills rather than pattern recognition.

When will OpenAI release the o3 and o3-mini models?

The o3-mini will be available by January 2025, while the full o3 model will follow after safety testing.

Tags: o3 Modelo3-mini Modelo3-mini ModelsOpenAi
Previous Post

iGame Shadow DDR5: COLORFUL’s Stunning Memory Redefines Speed!

Next Post

Just In: OPPO Reno13 Series coming to India in January 2025

Related Posts

Entertainment

Radiohead Breaks Seven-Year Silence: Complete Setlist From the Band’s Triumphant Madrid Return

November 9, 2025
Entertainment

Meenakshi Chaudhary Transforms Into Archaeologist Daksha for Naga Chaitanya’s Mythological Thriller NC24

November 9, 2025
FAQ

Man City vs Liverpool: Five Key Battles That Will Define Guardiola’s Milestone Match

November 9, 2025
FAQ

Where Man City vs Liverpool Will Be Won: Five Tactical Battlegrounds That Will Decide the Clash

November 9, 2025
FAQ

Najiba Faiz Private Video Leaked? The Pakistani TV Host Responds to MMS Scandal

November 9, 2025
Subhashree Sahu Leaked MMS Controversy in 2025: What Happened?
FAQ

Subhashree Sahu Leaked MMS Controversy 2025: All Details

November 9, 2025
Next Post
Just In: OPPO Reno13 Series coming to India in January 2025

Just In: OPPO Reno13 Series coming to India in January 2025

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

TechnoSports Media Group

© 2025 TechnoSports Media Group - The Ultimate News Destination

Email: admin@technosports.co.in

  • Terms of Use
  • Privacy Policy
  • About Us
  • Contact Us

Follow Us

No Result
View All Result
  • Home
  • Technology
  • Smartphones
  • Deal
  • Sports
  • Reviews
  • Gaming
  • Entertainment

© 2025 TechnoSports Media Group - The Ultimate News Destination