Pranjal Aggarwal (@pranjalaggarw16) Twitter Tweets • TwiCopy

Pranjal Aggarwal

@pranjalaggarw16

+ Follow

PhD Student @LTIatCMU. Working on reasoning, code-gen agents and test-time compute. Prev @IITD

ID: 1297948544091877377

linkhttp://pranjal2041.github.io calendar_today24-08-2020 17:28:37

68 Tweet

400 Followers

84 Following

Sean Welleck

@wellecks

9 months ago

The recent Claude 3.7 model from Anthropic lets you control the budget for thinking—how might this work? Check out L1, our fully open recipe for training reasoning models with controllable thinking budgets!

thumb_up_off_alt76

chat_bubble_outline4

repeat7

shareShare

Sean Welleck

@wellecks

7 months ago

Cool to see our L1 (arxiv.org/abs/2503.04697) methodology used here! And a nice insight about using the controllable reasoning budget to enable more efficient use of inference hardware

thumb_up_off_alt98

chat_bubble_outline3

repeat10

shareShare

Sean Welleck

@wellecks

7 months ago

AlphaVerus has been accepted at #ICML2025! alphaverus.github.io arxiv.org/abs/2412.06176 We've seen in math that good verification (e.g., Lean) unlocks surprising capabilities–why not for code too? AlphaVerus puts LLMs & Rust’s Verus verifier into a self-improving loop–lots

thumb_up_off_alt84

chat_bubble_outline5

repeat9

shareShare

Shashwat Goel

@shashwatgoel7

6 months ago

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

thumb_up_off_alt836

chat_bubble_outline33

repeat120

shareShare

Pranjal Aggarwal

@pranjalaggarw16

4 months ago

I will be at #ICML2025 this week. Reach out if you want to chat about llm reasoning, computer-use agents, code gen or actually anything! (DMs are open) I will also be presenting AlphaVerus (self-improving verified code gen) this Thursday! alphaverus.github.io

thumb_up_off_alt15

chat_bubble_outline0

repeat1

shareShare

Pranjal Aggarwal

@pranjalaggarw16

4 months ago

Can LLMs self-improve on code generation? Check out our work AlphaVerus where model generates provably correct code and self-improves without any weight updates! At #ICML2025 today: 📆: 11:00 AM - 1:30 PM 📷: Poster #East-2912 alphaverus.github.io w/ Bryan, Sean Welleck

thumb_up_off_alt57

chat_bubble_outline0

repeat10

shareShare

Sean Welleck

@wellecks

4 months ago

Excited about CMU's new Institute for Computer-Aided Reasoning in Mathematics (ICARM), a new NSF Mathematical Sciences Research Institute. I'm honored to serve as an Assistant Director focusing on machine learning and mathematics.

thumb_up_off_alt169

chat_bubble_outline8

repeat20

shareShare