Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich dataset will be the most queried using Hugging Face's SQL console by December 31, 2024?
IMDB dataset • 25%
Common Crawl dataset • 25%
Wikipedia dataset • 25%
Other • 25%
Usage statistics from Hugging Face
Hugging Face Launches In-Browser SQL Console for Over 200,000 Datasets, Powered by DuckDB and WebAssembly
Sep 12, 2024, 05:02 AM
Hugging Face has introduced a new SQL console that allows users to run SQL queries directly on over 200,000 public datasets available on its platform. Powered by DuckDB and WebAssembly, the SQL console enables users to query, filter, and explore datasets without the need to download them. This feature aims to enhance data analytics by providing real-time insights and the ability to share SQL queries via URL. Users can also export query results as parquet files. The new console is available in the browser, making it convenient for data analysts and researchers to access and manipulate data efficiently. As an example, users can check out positive samples in the IMDB dataset.
View original story
Text Generation • 25%
Image Generation • 25%
Audio Processing • 25%
Other • 25%
Nemotron 70B • 25%
ChatGPT4o • 25%
Sonnet 3.5 • 25%
Other • 25%
1-2 billion • 25%
2-3 billion • 25%
3-4 billion • 25%
4 billion or more • 25%
Azure OpenAI • 25%
OpenAI • 25%
Meta • 25%
MistralAI • 25%
Training Transformers • 25%
Inference with Sentence Transformers • 25%
Deploying Diffusers models • 25%
Other • 25%
FLUX.1[pro] • 33%
FLUX.1[dev] • 33%
FLUX.1[schnell] • 33%
Tie • 1%
Less than 1.2 million • 25%
1.2 million to 1.4 million • 25%
1.4 million to 1.6 million • 25%
More than 1.6 million • 25%
GPT-4o • 25%
InternVL 2 • 25%
NVLM 1.0 • 25%
Other • 25%
Less than 500 billion • 25%
500 billion to 700 billion • 25%
700 billion to 900 billion • 25%
More than 900 billion • 25%
Cerebras Llama 3.1 • 25%
OpenAI GPT-4o • 25%
Anthropic Claude 3.5 • 25%
Other • 25%
Top 10 • 25%
Top 20 • 25%
Top 50 • 25%
Below Top 50 • 25%
Oracle Exadata Exascale • 25%
Amazon RDS • 25%
Microsoft Azure SQL Database • 25%
Google Cloud SQL • 25%
No • 50%
Yes • 50%
Yes • 50%
No • 50%
No • 50%
Yes • 50%
Sharing SQL queries via URL • 25%
Querying datasets • 25%
Filtering datasets • 25%
Exporting query results • 25%