withprotegeai (@withprotegeai) 's Twitter Profile
withprotegeai

@withprotegeai

The data layer for AI training.

ID: 1953623008163119104

linkhttp://withprotege.ai calendar_today08-08-2025 01:03:20

1 Tweet

2 Followers

1 Following

withprotegeai (@withprotegeai) 's Twitter Profile Photo

Yesterday we announced our exciting new partnership with - check out some clips of their authentic, unscripted human behavior data! 👥 10,000+ contributors... 🏁 across 30+ countries... 📣 50+ languages record conversations, play games, and capture real-world

TwelveLabs (twelvelabs.io) (@twelve_labs) 's Twitter Profile Photo

Data pipelines are becoming the hidden bottleneck for AI. You can’t iterate fast if it takes months to locate precise video moments across massive archives. Our work with Protege AI showed what changes when: 📚 Aggregated, licensed content at scale 🧠 AI that actually

Data pipelines are becoming the hidden bottleneck for AI. You can’t iterate fast if it takes months to locate precise video moments across massive archives.
Our work with <a href="/withprotegeai/">Protege AI</a> showed what changes when:
📚 Aggregated, licensed content at scale
🧠 AI that actually
withprotegeai (@withprotegeai) 's Twitter Profile Photo

Uncontaminated, evaluation-ready datasets built by Protege ⬇️ New medical benchmarks for clinical documentation and coding. Importantly, all datasets were held out of pretraining at the patient level — not just the record level — to prevent contamination. These are EMR datasets