Loading...
Loading...
Browse all stories on DeepNewz
VisitAlibaba Releases Qwen2-VL AI Model with Vision-Language Capabilities, 2B and 7B Sizes, Apache 2.0 License
Aug 29, 2024, 04:14 PM
Alibaba has announced the release of its new AI model, Qwen2-VL, which includes vision-language models with state-of-the-art capabilities. The models, available in 2B and 7B sizes, are open-sourced under the Apache 2.0 license. They excel in visual understanding and video analysis, capable of comprehending videos over 20 minutes long. The Qwen2-VL models feature Naive Dynamic Resolution and multimodal RoPE, enhancing their ability to handle various image resolutions and ratios. They also offer multilingual support, including for CJK, Arabic, and European languages, and can be integrated into mobile and robotic applications. The largest model, Qwen2-VL-72B, is available via API and demonstrates superior performance compared to previous models.
View original story
Hindi • 20%
English • 20%
Tamil • 20%
Telugu • 20%
Bengali • 20%
English • 25%
Spanish • 25%
Mandarin • 25%
Other • 25%
Marathi • 20%
Tamil • 20%
Telugu • 20%
Kannada • 20%
Malayalam • 20%
German • 25%
Italian • 25%
Korean • 25%
Portuguese • 25%
Hindi • 20%
Swahili • 20%
Bengali • 20%
Portuguese • 20%
Arabic • 20%
English • 25%
Spanish • 25%
Mandarin • 25%
Other • 25%
Python • 25%
JavaScript • 25%
Java • 25%
Other • 25%
Rust • 25%
Go • 25%
Python • 25%
Other • 25%
Tulu, Awadhi, Marwadi • 25%
Awadhi, Marwadi, Fon • 25%
Marwadi, Fon, Cantonese • 25%
Tulu, Awadhi, Fon • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Tulu • 25%
Awadhi • 25%
Marwadi • 25%
Fon • 25%
Hindi • 25%
Arabic • 25%
Portuguese • 25%
Other • 25%
Other • 25%
Retail • 25%
Finance • 25%
Healthcare • 25%