Loading...
Loading...
Browse all stories on DeepNewz
VisitWhat will be Moshi's ranking on Hugging Face's model leaderboard by December 31, 2024?
Top 10 • 25%
Top 20 • 25%
Top 50 • 25%
Below Top 50 • 25%
Hugging Face model leaderboard
Kyutai Labs Unveils Moshi: Real-Time Open-Source GPT-4o Alternative
Jul 3, 2024, 08:41 PM
Kyutai Labs, a French AI startup, has unveiled Moshi, a groundbreaking real-time multimodal foundation model capable of listening, speaking, and understanding emotions. Moshi, which can run on consumer laptops and GPUs, is set to be open-sourced, offering a competitive alternative to OpenAI's GPT-4o. Developed by an 8-person team in just six months, Moshi features low latency of under 300ms, achieving 160ms latency with a Real-Time Factor of 2, and supports 70 different emotions and styles. The model's capabilities include real-time conversation, role-playing, and providing explanations. Despite some initial robotic voice quality, Moshi's fast response times and natural interaction have been well-received. The release includes the code, model, and accompanying research paper. Moshi operates with a 7B Multimodal LM and a 2 channel I/O system.
View original story
Top 5 • 25%
Top 10 • 25%
Top 20 • 25%
Outside Top 20 • 25%
Ranked 1st • 25%
Ranked 2nd • 25%
Ranked 3rd to 5th • 25%
Ranked below 5th • 25%
Top 1 • 25%
Top 5 • 25%
Top 10 • 25%
Below Top 10 • 25%
Less than 10,000 • 25%
10,000 to 50,000 • 25%
50,001 to 100,000 • 25%
More than 100,000 • 25%
Nemotron 70B • 25%
ChatGPT4o • 25%
Sonnet 3.5 • 25%
Other • 25%
Apple's 7B AI model • 25%
Mistral 7B • 25%
Llama 3 8B • 25%
Google's Gemma • 25%
Top 1 • 25%
Top 2-5 • 25%
Top 6-10 • 25%
Outside Top 10 • 25%
OpenAI's O1 model • 25%
GPT-4 • 25%
Gemini • 25%
Anthropic's Claude • 25%
No • 50%
Yes • 50%
Better than GPT-4o • 25%
Inconclusive • 25%
Worse than GPT-4o • 25%
Equal to GPT-4o • 25%