Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich AI model will have the highest lm-sys Elo score by the end of 2024?
OpenAI o1 • 25%
GPT-4 • 25%
Gemini • 25%
Claude • 25%
lm-sys Elo score rankings
OpenAI Unveils o1 Model, Project Strawberry, Seeks $250M Investment
Sep 19, 2024, 06:40 PM
OpenAI has unveiled its latest AI model, OpenAI o1, also known as Project Strawberry. The new model, including variants o1-preview and o1-mini, is designed to think before answering questions and excels in math and programming challenges. It has been compared to an outstanding PhD student in biomedical sciences and is seen as a significant advancement in AI reasoning. The o1 model has been tested against other leading models, including GPT-4, Gemini, and Anthropic's Claude, showing superior performance. Sam Altman, CEO of OpenAI, confirmed that the o1 model represents a new paradigm in AI development, potentially leading to AGI. OpenAI has also expanded the o1 models to enterprise and educational sectors, offering advanced reasoning capabilities to ChatGPT Enterprise and Edu customers. Additionally, OpenAI is seeking a $250 million investment in its latest funding round, reflecting the high valuation and investor interest in the company. The o1 model has also shown dominance in lmsys Elo scores, leading by over 50 points.
View original story
ChatGPT-4o • 25%
Gemini 1.5 Pro • 25%
Claude-3.5 • 25%
Other • 25%
Nemotron 70B • 25%
ChatGPT4o • 25%
Sonnet 3.5 • 25%
Other • 25%
Google's Gemini • 25%
OpenAI's GPT • 25%
Microsoft's Azure AI • 25%
Other • 25%
Sora • 25%
DALL-E 3 • 25%
Runway Gen-2 • 25%
Other • 25%
Apple's 7B AI model • 25%
Mistral 7B • 25%
Llama 3 8B • 25%
Google's Gemma • 25%
OpenAI's O1 model • 25%
GPT-4 • 25%
Gemini • 25%
Anthropic's Claude • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Meta's Llama 3.1-70B • 25%
OpenAI's GPT-4 • 25%
Google's Bard • 25%
Other • 25%
Grok 3.0 • 25%
OpenAI GPT-5 • 25%
Google DeepMind's latest model • 25%
Other • 25%
DeepSeek-R1-Lite-Preview • 25%
OpenAI's o1-preview • 25%
Google DeepMind's model • 25%
Other • 25%
Claude 3.5 Sonnet • 33%
GPT-4o • 33%
Google's AI Model • 33%
Yes • 50%
No • 50%
Other • 25%
Microsoft • 25%
Google • 25%
Amazon • 25%