Loading...
Loading...
Browse all stories on DeepNewz
VisitPrimary commercial application of Janus by December 2025?
Image Generation • 25%
Multimodal Understanding • 25%
Autonomous Systems • 25%
Other • 25%
Market analysis reports and commercial announcements
DeepSeek AI Unveils Janus, a 1.3B Multimodal Model With Decoupled Visual Encoding and Image Generation
Oct 18, 2024, 08:08 AM
DeepSeek AI, in collaboration with researchers from the University of Hong Kong and Peking University, has unveiled Janus, a 1.3 billion parameter multimodal model that integrates image generation capabilities. Janus is designed as an autoregressive framework that unifies multimodal understanding and generation by decoupling visual encoding, using different visual encoders for understanding and generation, which enhances flexibility and performance. The model is built upon DeepSeek-LLM-1.3b-base and incorporates SigLIP-L as its vision encoder. Despite its advanced capabilities, Janus is super small in size, only 1.8 billion parameters. Utilizing a single transformer architecture, Janus is trained on approximately 500 billion text tags and employs a specific tokenizer for image generation with a downsampling rate of 16. As DeepSeek AI's first multimodal offering on Hugging Face, the model is now available for download.
View original story
Healthcare • 25%
Finance • 25%
Automotive • 25%
Retail • 25%
Staking • 25%
Data Analysis • 25%
Network Security • 25%
Other • 25%
Media and Entertainment • 25%
Advertising • 25%
E-commerce • 25%
Other • 25%
Automotive • 25%
Healthcare • 25%
Manufacturing • 25%
Entertainment • 25%
Finance • 25%
Healthcare • 25%
Supply Chain • 25%
Other • 25%
Scientific Research • 25%
Healthcare • 25%
Finance • 25%
Other • 25%
Financial Transactions • 25%
Gaming • 25%
Supply Chain Management • 25%
Decentralized Applications • 25%
Liquidity Provision • 25%
Cross-Chain Transactions • 25%
Decentralized Exchange (DEX) Trading • 25%
Other • 25%
Yes • 50%
No • 50%
Healthcare • 25%
Finance • 25%
Education • 25%
Other • 25%
Technical malfunctions • 25%
Data analysis issues • 25%
Resource limitations • 25%
Other challenges • 25%
Retail • 25%
Entertainment • 25%
Finance • 25%
Healthcare • 25%