Language Technologies Institute | @CarnegieMellon (@ltiatcmu) 's Twitter Profile
Language Technologies Institute | @CarnegieMellon

@ltiatcmu

The Language Technologies Institute in Carnegie Mellon University's @SCSatCMU

ID: 4901650162

linkhttp://lti.cs.cmu.edu calendar_today12-02-2016 15:12:09

1,1K Tweet

10,10K Takipçi

236 Takip Edilen

Nishant Subramani (@nsubramani23) 's Twitter Profile Photo

🚀 Excited to share a new interp+agents paper: 🐭🐱 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools appearing at #NAACL2025 This was work done Microsoft last summer with Jason Eisner Justin Svegliato Benjamin Van Durme Yu Su @[email protected] 1/🧵

🚀 Excited to share a new interp+agents paper: 🐭🐱 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools appearing at #NAACL2025

This was work done <a href="/Microsoft/">Microsoft</a> last summer with <a href="/adveisner/">Jason Eisner</a> <a href="/justinsvegliato/">Justin Svegliato</a> <a href="/ben_vandurme/">Benjamin Van Durme</a> <a href="/ysu_nlp/">Yu Su</a> <a href="/sammthomson/">@sammthomson@mstdn.social</a> 

1/🧵
Kwanghee Choi (@juice500ml) 's Twitter Profile Photo

Can self-supervised models 🤖 understand allophony 🗣? Excited to share my new #NAACL2025 paper: Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation Assessment arxiv.org/abs/2502.07029 (1/n)

Can self-supervised models 🤖 understand allophony 🗣? Excited to share my new #NAACL2025 paper: Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation Assessment arxiv.org/abs/2502.07029 (1/n)
Athiya Deviyani (@athiyad) 's Twitter Profile Photo

🔑 So what now? When picking metrics, don’t rely on global scores alone. 🎯 Identify the evaluation context 🔍 Measure local accuracy ✅ Choose metrics that are stable and/or perform well in your context ♻️ Reevaluate as models and tasks evolve 📄 aclanthology.org/2025.findings-… (🧵9/9)

Kshitish Ghate (@ghatekshitish) 's Twitter Profile Photo

Excited to announce our #NAACL2025 Oral paper! 🎉✨ We carried out the largest systematic study so far to map the links between upstream choices, intrinsic bias, and downstream zero-shot performance across 131 CLIP Vision-language encoders, 26 datasets, and 55 architectures!

Excited to announce our #NAACL2025 Oral paper! 🎉✨   
We carried out the largest systematic study so far to map the links between upstream choices, intrinsic bias, and downstream zero-shot performance across 131 CLIP Vision-language encoders, 26 datasets, and 55 architectures!
Syeda Nahida Akter (@snat02792153) 's Twitter Profile Photo

RL boosts LLM reasoning—but why stop at math & code? 🤔 Meet Nemotron-CrossThink—a method to scale RL-based self-learning across law, physics, social science & more. 🔥Resulting in a model that reasons broadly, adapts dynamically, & uses 28% fewer tokens for correct answers!

RL boosts LLM reasoning—but why stop at math &amp; code? 🤔
Meet Nemotron-CrossThink—a method to scale RL-based self-learning across law, physics, social science &amp; more.

🔥Resulting in a model that reasons broadly, adapts dynamically, &amp; uses 28% fewer tokens for correct answers!
William Chen (@chenwanch1) 's Twitter Profile Photo

What happens if you scale Whisper to billions of parameters? Our #ICML2025 paper develops scaling laws for ASR/ST models, training models with up to 18B params and 360K hours of data, and 100+ languages Joint work b/w Language Technologies Institute | @CarnegieMellon and NVIDIA arxiv.org/abs/2502.10373

What happens if you scale Whisper to billions of parameters?

Our #ICML2025 paper develops scaling laws for ASR/ST models, training models with up to 18B params and 360K hours of data, and 100+ languages

Joint work b/w <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a> and <a href="/nvidia/">NVIDIA</a>

arxiv.org/abs/2502.10373
Language Technologies Institute | @CarnegieMellon (@ltiatcmu) 's Twitter Profile Photo

“Go forth and innovate. Disrupt wisely. And carry your LTI pedigree with pride.” Read LTI Director Mona Diab’s message to our 2025 graduating class: lti.cmu.edu/news-and-event…

Graham Neubig (@gneubig) 's Twitter Profile Photo

Some people have said that OpenAI achieved state of the art results on the SWE-Bench Verified leaderboard with their codex model, but that's actually not quite correct, no matter how you measure it. A quick 🧵

Some people have said that OpenAI achieved state of the art results on the SWE-Bench Verified leaderboard with their codex model, but that's actually not quite correct, no matter how you measure it.

A quick 🧵
Lei Li (@lileics) 's Twitter Profile Photo

We are organizing Generative AI for Biology workshop at #ICML2025. Welcome to submit any relevant work on AI for biomolecule, AI model for bio systems, AI and experiments, Agent for bio discovery, new datasets and tools, etc. The deadline is May 25th. genbio-workshop.github.io/2025/

Language Technologies Institute | @CarnegieMellon (@ltiatcmu) 's Twitter Profile Photo

Our MSAII and MCDS programs were both recently ranked among the nations best by TechGuide, with the MSAII program being named the number one program among MS programs in AI. Congrats to all the faculty, students and staff that make us who we are! lti.cmu.edu/news-and-event…

jiatongshi (@jiatongshi) 's Twitter Profile Photo

🚀 Introducing Uni-VERSA: a unified model for multi-dimensional speech evaluation-naturalness, intelligibility, noise, prosody & more. ⚡ 109× faster than native VERSA metric computation 🤗 Pretrained models + Colab demo 🧰 VERSA integration coming! 🔗 huggingface.co/collections/es…

Sean Welleck (@wellecks) 's Twitter Profile Photo

New paper by Andre He: Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening arxiv.org/abs/2506.02355 Tired of sharpening the distribution? Try unlikeliness reward to learn new things from the roads less traveled

New paper by Andre He:

Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening

arxiv.org/abs/2506.02355

Tired of sharpening the distribution? Try unlikeliness reward to learn new things from the roads less traveled
CDT in Speech and Language Technologies (@sltcdt) 's Twitter Profile Photo

Not long until our #SLT #CDT Annual #Conference on 23 June! #Keynote Spotlight: Carlos Busso from Language Technologies Institute | @CarnegieMellon will be giving a talk entitled "Improving Generalization in #Speech #Emotion Recognition". Find out more and #register (by 15 June) at slt-cdt.sheffield.ac.uk/annual-confere…

Not long until our #SLT #CDT Annual #Conference on 23 June! 

#Keynote Spotlight: <a href="/BussoCarlos/">Carlos Busso</a> from <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a> will be giving a talk entitled "Improving Generalization in #Speech #Emotion Recognition". 

Find out more and #register (by 15 June) at slt-cdt.sheffield.ac.uk/annual-confere…