Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich Gemini 2.0 project will launch public beta first by May 31, 2025?
Project Astra • 25%
Project Mariner • 25%
Jules • 25%
Deep Research • 25%
Official announcements from Google or public beta launch details
Google Launches Gemini 2.0 AI with Native Image, Audio Output, and Agentic Features
Dec 12, 2024, 02:49 PM
Google has unveiled Gemini 2.0, its next-generation AI model, introducing significant advancements in its capabilities. The new model, which includes an experimental version called Gemini 2.0 Flash, aims to enhance Google's AI agent agenda by offering native image and audio output, as well as improved performance in coding, reasoning, and visual understanding. Gemini 2.0 is designed to enable the creation of AI agents that can act autonomously on behalf of users, with features like real-time video and image analysis, integration with Google services, and the ability to generate detailed reports from web research. This model will be available across Google's ecosystem, including Google AI Studio, Vertex AI, and the Gemini AI assistant, with plans to expand its implementation in 2025. Google's launch of Gemini 2.0 comes amidst increasing competition in AI development and regulatory scrutiny over its search engine and Chrome browser. Key projects include Project Astra, a universal AI assistant; Project Mariner, an agent for web tasks; Jules, an AI coding agent; and Deep Research, which automates research and report generation. The model also supports multimodal outputs like images and audio.
View original story
Image Generation • 25%
Audio Generation • 25%
Video Understanding • 25%
Other • 25%
Project Astra • 25%
Project Mariner • 25%
Multimodal Voice API • 25%
Other • 25%
Real-time conversation • 25%
Screen sharing • 25%
Image sharing • 25%
Other • 25%
Speed • 25%
Accuracy • 25%
User Interface • 25%
Integration Capabilities • 25%
AI Assistant (Project Astra) • 25%
Browser-based tasks (Project Mariner) • 25%
AI Coding (Jules) • 25%
Gaming (Gemini 2.0 for Games) • 25%
Web-based tasks • 25%
Gameplay advice • 25%
In-depth research • 25%
Wearable technology • 25%
Text Generation • 25%
Image Generation • 25%
Voice-to-Voice Interaction • 25%
Other • 25%
Search Enhancements • 25%
YouTube Content Management • 25%
Android Integration • 25%
Other • 25%
Healthcare • 25%
Finance • 25%
Automotive • 25%
Retail • 25%
Image Output • 25%
Coding Assistance • 25%
Agentic Features • 25%
Audio Output • 25%