Loading...
Loading...
Browse all stories on DeepNewz
VisitWill OpenAI's o1-preview model maintain the top spot on LiveBench AI by end of 2024?
Yes • 50%
No • 50%
LiveBench AI rankings
OpenAI o1-Preview Takes First Place Overall on LiveBench AI, Surpasses Claude 3.5 Sonnet
Sep 13, 2024, 02:03 PM
OpenAI's new o1-preview model has claimed the top spot on LiveBench AI, surpassing Anthropic's Claude 3.5 Sonnet, which had led for 85 days. The o1-preview model excels in reasoning, mathematics, data analysis, and language, although it is not perfect in every area. Notably, OpenAI's o1 mini model is reported to be even better at reasoning than the o1-preview. Despite these advancements, Claude 3.5 Sonnet still outperforms the o1-preview in coding tasks. OpenAI o1-preview is now first place overall on LiveBench AI.
View original story
Yes • 50%
No • 50%
Top 10% • 25%
Top 1% • 25%
Top 5% • 25%
Below Top 10% • 25%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
OpenAI o1-preview • 25%
Other • 25%
OpenAI o1 mini • 25%
Anthropic Claude 3.5 Sonnet • 25%
Other • 25%
OpenAI o1-preview • 25%
Anthropic Claude 3.5 Sonnet • 25%
OpenAI o1 mini • 25%