Nishant Subramani (@nsubramani23) 's Twitter Profile
Nishant Subramani

@nsubramani23

PhD student @LTIatCMU working on model interpretability // Prev: intern @msftresearch, predoc @allen_ai in #NLProc // @BVB supporter //
he/him

ID: 454318852

linkhttp://nishantsubramani.github.io calendar_today03-01-2012 21:30:55

733 Tweet

674 Followers

1,1K Following

Apoorv Khandelwal (@apoorvkh) 's Twitter Profile Photo

Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? 🧵 See our paper to learn about the current state of academic compute and how to efficiently train models! Use our code to test your own models/GPUs! arxiv.org/abs/2410.23261 github.com/apoorvkh/acade…

Clara Na (@claranahhh) 's Twitter Profile Photo

Building/customizing your own LLM? You'll want to curate training data for it, but how do you know what makes the data good? You can try out recipes👩‍🍳 iterate on vibes✨ but we can't actually test all possible combos of tweaks,,, right?? 🙅‍♂️WRONG! arxiv.org/abs/2410.15661 (1/n) 🧵

Building/customizing your own LLM? You'll want to curate training data for it, but how do you know what makes the data good?

You can try out recipes👩‍🍳 iterate on vibes✨ but we can't actually test all possible combos of tweaks,,, right?? 🙅‍♂️WRONG! arxiv.org/abs/2410.15661 (1/n) 🧵
Nishant Subramani (@nsubramani23) 's Twitter Profile Photo

Presenting this today at the poster session at #NAACL2025! Come chat about interpretability, trustworthiness, and tool-using agents! 🗓️ - Thursday May 1st (today) 📍 - Hall 3 🕑 - 200-330pm

Nishant Subramani (@nsubramani23) 's Twitter Profile Photo

Excited to announce that I started at Google Cloud as a student researcher last month working with Hamid Palangi on actionable #interpretability 🔍 to build better tool using #agents ⚒️🤖

Michael Li (@bearseascape) 's Twitter Profile Photo

🚨New #interpretability paper with Nishant Subramani@ACL🇦🇹 :🕵️Model Internal Sleuthing: Finding Lexical Identity and Inflectional Morphology in Modern Language Models

🚨New #interpretability paper with <a href="/nsubramani23/">Nishant Subramani@ACL🇦🇹</a> :🕵️Model Internal Sleuthing: Finding Lexical Identity and Inflectional Morphology in Modern Language Models
Nishant Subramani (@nsubramani23) 's Twitter Profile Photo

At #ICML2025 till Sunday! Love to chat about #interpretability, understanding model internals, and finding yummy vegan food in Vancouver 🥬🍜

Nishant Subramani (@nsubramani23) 's Twitter Profile Photo

At #ACL2025 in Vienna 🇦🇹 till next Saturday! Love to chat about anything #interpretability 🔎, understanding model internals 🔬, and finding yummy vegan food 🥬