Khurram Yamin (@khurramyam) 's Twitter Profile
Khurram Yamin

@khurramyam

ML PhD CMU @mldcmu. Interested in Causality and LLM Reasoning - feel free to reach out!

ID: 1449116115250991104

calendar_today15-10-2021 20:53:10

8 Tweet

29 Followers

59 Following

Sukjun (June) Hwang (@sukjun_hwang) 's Twitter Profile Photo

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

Aditi Raghunathan (@adtraghunathan) 's Twitter Profile Photo

There’s been a lot of work on unlearning in LLMs, trying to erase memorization without hurting capabilities — but we haven’t seen much success. ❓What if unlearning is actually doomed from the start? 👇This thread explains why and how *memorization sinks* offer a new way forward.

Emily Byun (@yewonbyun_) 's Twitter Profile Photo

💡Can we trust synthetic data for statistical inference? We show that synthetic data (e.g. LLM simulations) can significantly improve the performance of inference tasks. The key intuition lies in the interactions between the moments of synthetic data and those of real data

💡Can we trust synthetic data for statistical inference?

We show that synthetic data (e.g. LLM simulations) can significantly improve the performance of inference tasks. The key intuition lies in the interactions between the moments of synthetic data and those of real data
Jacob Springer (@jacspringer) 's Twitter Profile Photo

Does synthetic data always help text-embedder models? Not quite. The gains are sparse and come with trade-offs. We open-source data + code to make research on synthetic data for embeddings more rigorous. 1/

Does synthetic data always help text-embedder models?
Not quite. The gains are sparse and come with trade-offs.
We open-source data + code to make research on synthetic data for embeddings more rigorous. 1/
Yutong (Kelly) He (@electronickale) 's Twitter Profile Photo

I'm teaching a diffusion & flow matching class at CMU in Spring 2026 where students can use ChatGPT, Cursor, or any AI tool they want. No exams. Just build with open internet. 139 students signed up for 20 spots. Here's what's happening: 🧵 kellyyutonghe.github.io/10799S26/

I'm teaching a diffusion & flow matching class at CMU in Spring 2026 where students can use ChatGPT, Cursor, or any AI tool they want. No exams. Just build with open internet.

139 students signed up for 20 spots.

Here's what's happening: 🧵
kellyyutonghe.github.io/10799S26/
Khurram Yamin (@khurramyam) 's Twitter Profile Photo

I feel like the worst part about the AI Era is that it’s impossible to get any human customer support agents on the line 😭😭