Loading...
Loading...
Browse all stories on DeepNewz
VisitMistral AI Releases 25.38 GB Pixtral 12B Multimodal AI Model with 400M Parameters
Sep 11, 2024, 07:31 AM
Mistral AI has released a new multimodal AI model called Pixtral 12B. This model integrates both language and vision processing capabilities, marking a significant advancement in multimodal AI technology. The Pixtral 12B model features a text backbone based on Mistral Nemo 12B and includes a vision adapter with 400 million parameters. The model architecture consists of 40 layers, a hidden dimension of 14,336, a dimension of 5120, a head dimension of 128, 32 heads, and 8 kv-heads. It also has a vocabulary size of 131,072. Additionally, the vision adapter uses GeLU and 2D RoPE, and the tokenizer includes three new special tokens. The model is available for download via a torrent magnet link and has a size of approximately 25.38 GB.
View original story
Markets
Yes • 50%
No • 50%
Official announcements from Mistral AI or the partnering tech company
Yes • 50%
No • 50%
Official download statistics from Mistral AI or reputable third-party trackers
Yes • 50%
No • 50%
Official product announcements and releases from major tech companies
No • 25%
Yes, in a research paper • 25%
Yes, in a project • 25%
Yes, in both • 25%
Publications in reputable journals or conferences, or official project announcements
Top 1 in a benchmark • 25%
Top 5 in a benchmark • 25%
Top 10 in a benchmark • 25%
No significant milestone • 25%
Results published on reputable AI benchmark websites or research papers
Yes, at Microsoft Build 2025 • 25%
Yes, at CES 2025 • 25%
Yes, at Google I/O 2025 • 25%
No • 25%
Official agendas and recordings of major tech conferences