|

The DATA Foundation Launches to Tackle AI’s Multi-Billion Dollar Training Data Bottleneck

The DATA Foundation Launches to Tackle AI’s Multi-Billion Dollar Training Data Bottleneck

The DATA Foundation Launches to Tackle AI’s Multi-Billion Dollar Training Data Bottleneck

Palo Alto, United States, June twenty fifth, 2026, Chainwire

Story rebrands as The DATA Foundation, launches DATA Network with flagship Kled AI integration, registering 1.5 billion user-contributed data on the platform

The Foundation additionally introduces Trace, the primary public audit layer for consent, licensing, and knowledge provenance at scale

Today, Story broadcasts a strategic transition to change into The DATA Foundation (“DATA”) and launches Trace, an onchain registry for AI coaching knowledge provenance and licensing. The launch features a flagship integration with Kled, the world’s largest opt-in human knowledge market, registering 1.5 billion user-contributed data on the Network. Andrea Muttoni turns into CEO of The DATA Foundation, and Kled’s founder, Avi Patel, joins in an advisor place because the Chief Data Officer. 

AI’s Training Data Has Hit a Bottleneck

The shift to DATA displays the place the market is pulling hardest. AI coaching knowledge has emerged as essentially the most useful and least solved class of IP. Frontier AI labs have hit a multi-billion-dollar knowledge bottleneck, the place the web has been successfully exhausted for scraping. The remaining provide is both costly and bespoke or legally undocumented, leaving labs with out a method to supply knowledge at scale, show its provenance, or assure its high quality.

The authorized stakes are rising, as frontier labs stake out market-defining merchandise on knowledge sourced via opaque networks, usually with out clear data of consent or jurisdiction. Scraped and undocumented knowledge is now not an choice for enterprise-grade AI.

“The problem in AI has shifted from compute and structure to sourcing and provenance. As the scrapable internet fractures, the query for labs now’s who’s holding the receipts,” stated Andrea Muttoni, CEO of The DATA Foundation. “With Kled, we mix full knowledge transparency and auditability with the most important pool of AI coaching knowledge on the planet.”

Building the Infrastructure for Trusted AI Data

DATA builds on the unique mission to ship an information and mental property (IP) layer for the web, recognizing that the type of knowledge and IP that’s most crucial on this period is AI coaching knowledge. DATA Network brings important infrastructure for coaching AI, anchored by a flagship integration with Kled. Starting at the moment, Kled’s licensing rails and contributor receipts run on DATA Network with added help for secure coin payouts, which entails registering a staggering 1.5 billion user-contributed data with programmatic authorized safeguards.

“Frontier labs have exhausted the provision of high-quality, human-generated public textual content out there on the open internet. Suppliers exhibiting data-sourcing provenance will win the following decade of offers, and that’s our wager,” stated Avi Patel, CEO and founder of Kled and part-time advisory CDO of The DATA Foundation. “Instead of sourcing knowledge blindly, Kled’s knowledge market and DATA’s auditable chain of custody converge on what labs really want to license knowledge with confidence and transparency.” 

Trace Launches because the Public Audit Layer for AI Training Data

Trace, The DATA Foundation’s public audit and search platform, additionally launches at the moment alongside the Kled integration. Trace generates immutable, confidential receipts for each contribution, permitting labs to confirm the legitimacy of datasets in seconds. For each single document uploaded by customers worldwide, a receipt on DATA will probably be generated, enabling upstream compensation for contributors’ knowledge and mental property. This addresses an pressing want for a verifiable and compliant AI coaching knowledge market, which has change into a authorized and operational minefield.

A Wider Contributor Network

DATA’s thesis was validated by Poseidon, the AI knowledge processing undertaking incubated by Story, which cleans, normalizes, and scores uncooked human knowledge for authenticity and high quality, guaranteeing each document that reaches a purchaser is model-ready. Poseidon’s early traction with frontier labs proved the AI coaching knowledge alternative. Backed by a16z and now operating totally on DATA, its contributor app Numo is dwell at the moment, bringing hundreds of contributors into the AI economic system in alternate for real-time payouts. 

“We began Story to construct an IP layer for the web, and a very powerful IP of this period is the information you may’t scrape: how a surgeon’s fingers transfer, how a robotic grips, how folks communicate, drive, and work in the true world,” stated SY Lee, CEO of PIP Labs and strategic adviser to The DATA Foundation. “DATA is the place that conviction goes subsequent: an end-to-end community that proves real-world knowledge’s origin, licenses it, and pays the individuals who made it. “

Token Migration and Ecosystem Continuity

The $IP token migrates to $DATA one-to-one with no motion required from current holders. Migration steering, alternate timing, and an FAQ can be found here.

About The DATA Foundation

Data is the largest bottleneck in frontier AI. The knowledge fashions want most both sits siloed with folks and firms, or doesn’t exist but, and gained’t, till incentives are aligned to create it. DATA Network is the world’s AI audit rails constructed to reply the three questions each lab asks: are you able to supply knowledge at scale, show the place it got here from, and assure its high quality? Contributor apps together with Numo and Kled provide opt-in human knowledge; Trace offers each document a public, tamper-proof receipt; Poseidon turns it into model-ready datasets, so frontier AI can preserve advancing on a basis it will possibly belief. $IP is now $DATA. More info out there at datafdn.org

Contact

HV
henri.vies@piplabs.xyz

The publish The DATA Foundation Launches to Tackle AI’s Multi-Billion Dollar Training Data Bottleneck appeared first on Metaverse Post.

Similar Posts