Loading...
Loading...
Browse all stories on DeepNewz
VisitWill Grass Network's UpvoteWeb-24-600M dataset face a significant legal challenge by end of 2024?
Yes • 50%
No • 50%
Legal news databases and court records
Grass Network on Solana Open-Sources 600 Million Reddit Posts for AI Training
Jul 4, 2024, 03:51 AM
Grass Network, the data layer of AI on Solana, has open-sourced a dataset containing 600 million top Reddit posts and comments from 2024. This dataset, named UpvoteWeb-24-600M, includes media links and reply lineage, and has been anonymized to preserve user privacy. The data, gathered by 2 million nodes globally in just one week, aims to make AI training more accessible for developers, leveling the playing field with centralized model training sets. This marks a significant milestone for the Grass ecosystem and the broader AI community.
View original story
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
No • 50%
Yes • 50%
Anonymization • 25%
Accessibility • 25%
Scalability • 25%
Quality of Data • 25%
Other • 25%
Content Moderation • 25%
Natural Language Processing (NLP) • 25%
Recommendation Systems • 25%