o3 - Search News

News

GPT-5 bombed my coding tests, but redeemed itself with code analysis

GPT-5 Pro, GPT-5 Thinking, and o3 to analyze my code repository - and found surprising differences in detail, reasoning, and ...

OpenAI's performance charts in the GPT-5 launch video are such a mess you have to think GPT-5 itself probably made them, and the company's attempted fixes raise even more questions

OpenAI has since posted some updated charts on its website. The new deception rate chart certainly suggests that a mere mistake was made. The revised stats show GPT-5's coding deception rate at 16.5%, ...

ChatGPT users dismayed as OpenAI pulls popular models GPT-4o, o3 and more — enterprise API remains (for now)

Following this post’s publication, OpenAI co-founder and CEO Sam Altman announced the company would restore access to GPT-4o and other old m ...

5don MSN

OpenAI says GPT-5 hallucinates less — what does the data say?

Specifically, GPT makes incorrect claims 9.6 percent of the time, compared to 12.9 percent for GPT-4o. And according to the ...

Electronic Specifier1d

OpenAI’s o3 beats xAI’s Grok: what AI chess matches teach us

OpenAI’s o3 has defeated xAI’s Grok in a chess match that took place on Google’s Kaggle platform. But what does it prove?

Decrypt2d

Sam Altman's OpenAI Crushes Elon Musk's Grok in AI Chess Championship

Google's AI chess tournament ended Thursday with OpenAI’s o3 model sweeping xAI's Grok in four straight games.

4don MSN

OpenAI fixes 'unintentional chart crime' after people pointed out something was off in the GPT-5 livestream

Some charts featured in its OpenAI's GPT-5 livestream on Thursday had several mistakes, which CEO Sam Altman called a "mega ...

NextBigFuture7d

OpenAI Release O3 Level Open Source Models

Openai is making O3 level capabilities open source ahead of the release of GPT-5 in about two days. These open source models ...

Hosted on MSN7mon

OpenAI's o3 model aced a test of AI reasoning - MSN

The o3 model also failed to solve more than 100 visual puzzle tasks, even when OpenAI applied a very large amount of computing power toward the unofficial score, said Mike Knoop, an ARC Challenge ...

ZDNet7mon

OpenAI's o3 isn't AGI yet but it just did something no other AI has ...

The o3 model scored 76% accuracy on ARC-AGI in an evaluation formally coordinated by OpenAI and the author of ARC-AGI, François Chollet, a scientist in Google's artificial intelligence unit.

‘Like watching kids games’: Magnus Carlsen roasts Elon Musk’s Grok 4 as it loses 4-0 to OpenAI’s o3 in chess tournament

Elon Musk's Grok 4 was the strongest fighter in the eight-player field until the final. But in the final, it made ...

Hosted on MSN6mon

ChatGPT o3-mini will be free for all, and I can’t wait to try the ...

o3-mini is free for all When OpenAI announced the new model in December, my first thought was "how long will we have to wait to try this for free?" Luckily, Altman and co has given free users a ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results