Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich Gemini 2.0 feature will Google highlight in 2025 marketing?
Multimodal Capabilities • 25%
Agentic Focus • 25%
128k Tokens Context Window • 25%
Other • 25%
Google's marketing materials and advertisements throughout 2025.
Google Launches Gemini 2.0 Flash: Twice as Fast, Multimodal AI with Agentic Focus
Dec 11, 2024, 03:51 PM
Google has unveiled Gemini 2.0, its most advanced AI model to date, designed for the agentic era. The new model, Gemini 2.0 Flash, offers significant improvements over its predecessors, outperforming the Gemini 1.5 Pro on key benchmarks at twice the speed. Gemini 2.0 introduces enhanced multimodal capabilities, including text, image, and audio generation, as well as native tool use, 2D/3D spatial understanding, a 128k tokens context window, and a Multimodal Live API with audio and video streaming inputs. It offers multilingual native audio output and native image generation. The model enables developers to build AI agents capable of interacting with web browsers, controlling Chrome, moving the cursor, clicking buttons, and filling out forms. Gemini 2.0 is made available in an experimental phase to developers via the Gemini API, Google AI Studio, and Vertex AI. Google also unveiled Project Mariner and Project Astra, prototypes demonstrating the agentic capabilities of Gemini 2.0 in creating universal AI assistants and coding agents, including Jules, an AI code agent. Leading figures at Google, including CEO Sundar Pichai and Google DeepMind CEO Demis Hassabis, announced the release and highlighted the model's advancements in multimodality, performance, speed, and agentic focus.
View original story
Image Output • 25%
Audio Output • 25%
Agentic Features • 25%
Coding Assistance • 25%
Image Generation • 25%
Audio Generation • 25%
Video Understanding • 25%
Other • 25%
Text Generation • 25%
Image Generation • 25%
Voice-to-Voice Interaction • 25%
Other • 25%
Google Search • 25%
Google Assistant • 25%
Google Maps • 25%
Other • 25%
Speed • 25%
Multimodal capabilities • 25%
Image editing • 25%
Voice-to-voice interactions • 25%
Google Search • 25%
Google Lens • 25%
Google Maps • 25%
Other • 25%
Real-time voice interactions • 25%
Desktop information retrieval • 25%
Enhanced multimodal capabilities • 25%
Autonomous task execution • 25%
Speed • 25%
Accuracy • 25%
User Interface • 25%
Integration Capabilities • 25%
Meta • 25%
OpenAI • 25%
Other • 25%
Microsoft • 25%