Manuel Cherep (@manuelcherep) 's Twitter Profile
Manuel Cherep

@manuelcherep

PhD student @MIT working on behavioral machine learning through agents, audio, and simulations.

bsky.app/profile/mchere…

ID: 705423568738390016

linkhttp://mcherep.github.io calendar_today03-03-2016 16:04:19

27 Tweet

140 Takipçi

324 Takip Edilen

Manuel Cherep (@manuelcherep) 's Twitter Profile Photo

📢 The code for CTAG is now available! You can easily generate sound from prompts by using a modular synthesizer (SynthAX ⚡️) github.com/PapayaResearch…

MIT Media Lab (@medialab) 's Twitter Profile Photo

At #ICML2024, Media Lab researchers Manuel Cherep, Nikhil Singh, and Jessica Shand will present “Creative Text-to-Audio Generation via Synthesizer Programming,” a text-to-audio generation method using a virtual modular synth. Explore the code! ctag.media.mit.edu

At #ICML2024, Media Lab researchers <a href="/manuelcherep/">Manuel Cherep</a>, <a href="/nikhilsinghmus/">Nikhil Singh</a>, and <a href="/jessicashand_/">Jessica Shand</a> will present “Creative Text-to-Audio Generation via Synthesizer Programming,” a text-to-audio generation method using a virtual modular synth. Explore the code! ctag.media.mit.edu
MIT Media Lab (@medialab) 's Twitter Profile Photo

Congratulations to Media Lab student Manuel Cherep on being selected for the 2024–2026 "la Caixa" Fellowship! media.mit.edu/articles/manue…

Shayne Longpre (@shayneredford) 's Twitter Profile Photo

✨New Preprint ✨ How are shifting norms on the web impacting AI? We find: 📉 A rapid decline in the consenting data commons (the web) ⚖️ Differing access to data by company, due to crawling restrictions (e.g.🔻26% OpenAI, 🔻13% Anthropic) ⛔️ Robots.txt preference protocols

✨New Preprint ✨ How are shifting norms on the web impacting AI?

We find:

📉 A rapid decline in the consenting data commons (the web)

⚖️ Differing access to data by company, due to crawling restrictions (e.g.🔻26% OpenAI, 🔻13% Anthropic)

⛔️ Robots.txt preference protocols
Manuel Cherep (@manuelcherep) 's Twitter Profile Photo

Excited to be (jet lagged) at ICML presenting "Creative Text-to-Audio Generation via Synthesizer Programming" Let's talk ML for audio 🤖, synthesizers 🎹🎶, etc. (work ICML Conference w/ Nikhil Singh Jessica Shand) ctag.media.mit.edu

Nikhil Singh (@nikhilsinghmus) 's Twitter Profile Photo

Thrilled to join Dartmouth CS as Assistant Professor in Jan 2025! I’m seeking 1-2 PhD students to join in Fall 2025. Application is by December 15th; please feel free to reach out with any questions. More details here: dartgo.org/ns-phd-apps-20…

Matt Groh (@mattgroh) 's Twitter Profile Photo

I'm recruiting a PhD student to join the Human AI Collaboration lab at Kellogg School Northwestern University Computer Science NICO If you're excited about computational social science, LLMs, digital experiments, real-world problem solving, this could be a great fit Please reshare! Deets 👇

I'm recruiting a PhD student to join the Human AI Collaboration lab at <a href="/KelloggSchool/">Kellogg School</a>  <a href="/northwesterncs/">Northwestern University Computer Science</a> <a href="/NICOatNU/">NICO</a>

If you're excited about computational social science, LLMs, digital experiments, real-world problem solving, this could be a great fit

Please reshare! 

Deets 👇
Shayne Longpre (@shayneredford) 's Twitter Profile Photo

✨New Report✨ Our data ecosystem audit across text, speech, and video (✏️,📢,📽️) finds: 📈 Rising reliance on web, synthetic, and YouTube data. 🛑 80%+ datasets carry hidden restrictions. 🌍 Relative representation in languages and creators has not improved for 10+ yrs.

Shayne Longpre (@shayneredford) 's Twitter Profile Photo

Thrilled our global data ecosystem audit was accepted to #ICLR2025! Empirically, we find: 1⃣ Soaring synthetic text data: ~10M tokens (pre-2018) to 100B+ (2024). 2⃣ YouTube is now 70%+ of speech/video data but could block third-party collection. 3⃣ <0.2% of data from

Thrilled our global data ecosystem audit was accepted to #ICLR2025!

Empirically, we find:

1⃣ Soaring synthetic text data: ~10M tokens (pre-2018) to 100B+ (2024).

2⃣ YouTube is now 70%+ of speech/video data but could block third-party collection.

3⃣ &lt;0.2% of data from
MIT Media Lab (@medialab) 's Twitter Profile Photo

Congratulations to @fluidinterfaces PhD student Manuel Cherep and group head Prof. Pattie Maes on receiving a 2024 Amazon Research Award for their project "Understanding How LLM Agents Deviate from Human Choices"! media.mit.edu/posts/manuel-c…

Keyon Vafa (@keyonv) 's Twitter Profile Photo

Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵