Loading...
Loading...
Browse all stories on DeepNewz
VisitWhat will be the most popular feature of NVIDIA's Fugatto AI model among users by June 30, 2025?
Voice Modification • 25%
Music Generation • 25%
Sound Transformation • 25%
Other • 25%
User surveys or industry analysis reports
NVIDIA Introduces Fugatto AI Model with 2.5B Parameters for Audio Generation and Modification
Nov 25, 2024, 05:47 PM
NVIDIA has unveiled Fugatto (Foundational Generative Audio Transformer Opus 1), a new generative AI model with 2.5 billion parameters designed to generate and modify audio—including music, voices, and sounds—from text and audio prompts. Announced on November 25, 2024, Fugatto can create any combination of music, voice, and sound, transforming sounds in innovative ways such as making a trumpet bark or a saxophone meow. The model, not yet publicly released, was trained on open-source data and is aimed at professionals in the music, film, and gaming industries. According to NVIDIA's Richard Kerris, Fugatto can modify voices, change accents, adjust emotional tones in speech, add instruments to existing music, and generate unique soundscapes, representing NVIDIA's push into audio creativity and innovation.
View original story
Increased Parameters • 25%
New Audio Inputs • 25%
Enhanced Output Quality • 25%
Other • 25%
Custom generative AI models • 25%
Synthetic data generation • 25%
Fine-tuning • 25%
NeMo Retriever microservices • 25%
Music Production • 25%
Advertising • 25%
Gaming • 25%
Other • 25%
Music Composition • 25%
Voice Modification • 25%
Sound Design • 25%
Emotional Tone Alteration • 25%
Music • 25%
Gaming • 25%
Film • 25%
Advertising • 25%
North America • 25%
Europe • 25%
Asia • 25%
Other • 25%
Apple • 25%
Microsoft • 25%
Amazon • 25%
Google • 25%
AI-Generated Comments • 25%
AI-Driven Feedback • 25%
Personalized AI Interactions • 25%
Other • 25%
Advanced Customization • 25%
Faster Rendering • 25%
More Templates • 25%
Other • 25%
Image Recognition • 25%
Natural Language Processing • 25%
Predictive Analytics • 25%
Automated Editing • 25%
Imagen 3 • 25%
DALL-E 3 • 25%
Midjourney v6 • 25%
Stable Diffusion 3 • 25%
Yes • 50%
No • 50%
Film Industry • 25%
Other • 25%
Music Industry • 25%
Gaming Industry • 25%