Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich Gemini 2.0 feature will be publicly released first by June 30, 2025?
Image Generation • 25%
Audio Generation • 25%
Video Understanding • 25%
Other • 25%
Official Google announcements or updates
Google Unveils Gemini 2.0: Faster, Multimodal AI Model Powers New AI Agents
Dec 11, 2024, 04:54 PM
Google has unveiled Gemini 2.0, its most advanced AI model to date, introducing significant upgrades in speed and performance. According to Jeff Dean, Google's Senior Fellow, "The Gemini 2.0 Flash model is almost universally better than the Gemini 1.5 Pro model but with the latency and speed of the Gemini 1.5 Flash model." The new model features multimodal capabilities, including native image and audio generation, and is twice as fast as its predecessor. Gemini 2.0 is designed to power a new generation of AI agents, including Project Astra, a universal AI assistant; Project Mariner, capable of taking actions in Chrome; and Jules, a code agent. Sundar Pichai, Google's CEO, stated that Gemini 2.0 marks a "new agentic era" in AI development, bringing the industry closer to the vision of a universal assistant. Developers can now access Gemini 2.0 Flash via the Multimodal Live API in Google AI Studio and Vertex AI, with general availability expected early next year. The model supports real-time multimodal API, enabling text-to-speech, image generation, and advanced video understanding. Built on Trillium hardware, Gemini 2.0 has already been integrated into enhanced Google Search and Maps. The model debuts as the third overall on the LM Arena leaderboard, with top rankings in various subcategories. While the model is impressive, it is currently in an experimental phase, with some features like image and audio generation available only in private preview.
View original story
Image Output • 25%
Audio Output • 25%
Agentic Features • 25%
Coding Assistance • 25%
Text Generation • 25%
Image Generation • 25%
Voice-to-Voice Interaction • 25%
Other • 25%
Project Astra • 25%
Project Mariner • 25%
Jules • 25%
Deep Research • 25%
Speed • 25%
Accuracy • 25%
User Interface • 25%
Integration Capabilities • 25%
Multimodal Capabilities • 25%
Agentic Focus • 25%
128k Tokens Context Window • 25%
Other • 25%
Real-time voice interactions • 25%
Desktop information retrieval • 25%
Enhanced multimodal capabilities • 25%
Autonomous task execution • 25%
Real-time conversation • 25%
Screen sharing • 25%
Image sharing • 25%
Other • 25%
Speed • 25%
Multimodal capabilities • 25%
Image editing • 25%
Voice-to-voice interactions • 25%
AI Assistant (Project Astra) • 25%
Browser-based tasks (Project Mariner) • 25%
AI Coding (Jules) • 25%
Gaming (Gemini 2.0 for Games) • 25%
Meta • 25%
Microsoft • 25%
OpenAI • 25%
Other • 25%