Abhinav Chinta (@abhinavchinta10) Twitter Tweets • TwiCopy

ConvAI@UIUC

a year ago

Honored to share our ConvAI contributions to #EMNLP2024! Congrats to Ishika Agarwal, Sagnik Mukherjee, Sumuk 🤗, and Abhinav Chinta - we'll see you in Miami 🌴☀️

Honored to share our ConvAI contributions to #EMNLP2024! Congrats to <a href="/wonderingishika/">Ishika Agarwal</a>, <a href="/saagnikkk/">Sagnik Mukherjee</a>, <a href="/sumukx/">Sumuk 🤗</a>, and <a href="/AbhinavChinta10/">Abhinav Chinta</a> - we'll see you in Miami 🌴☀️

thumb_up_off_alt17

chat_bubble_outline0

repeat4

shareShare

Feel GPT's style is not quite what you like? Wish that it could understand your preferences implicitly? Our latest work from ConvAI@UIUC, Unsupervised Human Preference Learning (now accepted to the EMNLP 2024 main conference), offers a novel, decoupled solution to the alignment

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

ConvAI@UIUC

@convai_uiuc

a year ago

Congratulations Sumuk 🤗 Abhinav Chinta Vaibhav Sahai dilek hakkani-tur looking forward to your presentation in Miami 🎉

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

ConvAI@UIUC

@convai_uiuc

a year ago

3 days left until #EMNLP2024!🌴ConvAI will be in Miami (DM if you'd like to meet up!) presenting the following works:

thumb_up_off_alt17

chat_bubble_outline1

repeat8

shareShare

Sumuk

@sumukx

a year ago

happening today at riverfront hall from 10:30 - 12:00! do drop by - we’ve done something quite clever and think you’ll be super excited!

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Abhinav Chinta

@abhinavchinta10

a year ago

Happening today at 10:30AM in Riverfront Hall!! Please drop by and check out our novel take on preference learning! #EMNLP

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Abhinav Chinta

@abhinavchinta10

8 months ago

With a bit of prompting gemini 2.0 flash experimental was finally able to generate this. Tried with every other model and nothing even came close. “A horse riding an astronaut.”

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Sumuk

@sumukx

7 months ago

we're launching 🤗 yourbench today, an open source tool for custom benchmarking and synthetic data generation from ANY of your documents. it's a big step towards improving how model evaluations work early access link in replies! (1/8)

thumb_up_off_alt293

chat_bubble_outline11

repeat49

shareShare

Sagnik Mukherjee

@saagnikkk

6 months ago

🚀Our ICML 2025 paper introduces "Premise-Augmented Reasoning Chains" - a structured approach to induce explicit dependencies in reasoning chains. By revealing the dependencies within chains, we significantly improve how LLM reasoning can be verified. 🧵[1/n]

thumb_up_off_alt69

chat_bubble_outline1

repeat22

shareShare

Sagnik Mukherjee

@saagnikkk

5 months ago

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive

thumb_up_off_alt844

chat_bubble_outline17

repeat125

shareShare

Sagnik Mukherjee

@saagnikkk

4 months ago

🚀 Headed to #ICML2025 in Vancouver (July 13-19) ! We will present our paper in the poster session at East Exhibition Hall on Tuesday (15th) at 4:30 PM PDT. Happy to chat regarding reasoning, post-training and anything LLMs in general !

thumb_up_off_alt38

chat_bubble_outline1

repeat5

shareShare

Abhinav Chinta

@abhinavchinta10

4 months ago

🚨Presenting our ICML paper “Premise Augmented Reasoning Chains” at 4:30PM PT today in the East Exhibition Hall A-B (E-2410). Come check out our work on improving error detection in COT using DAGs! Link to paper: abhinavchinta.com/parc/ #icml25

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Sumuk

@sumukx

a month ago

100% agree with will. building my first 3090 cluster with Abhinav Chinta was such a great learning experiment in sourcing cheap components from shenzhen, dealing with riser retiming issues, hacking power supplies together etc. just buy the service “advice” is toxic.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Anjiang Wei

@anjiangw

11 days ago

We introduce SuperCoder, the first work to successfully apply LLMs as superoptimizers for assembly code 🚀. Our RL-trained model achieves 95% correctness ✅ and 1.46× speedup ⚡over gcc -O3. 📄arxiv.org/pdf/2505.11480 💻github.com/Anjiang-Wei/Su… #LLM #Compilers #Code #Optimization

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare