Bhavya Agrawalla (@agrawallabhavya) Twitter Tweets • TwiCopy

Bhavya Agrawalla

@agrawallabhavya

+ Follow

Research Interests - Statistics, Deep Reinforcement Learning.
PhD student @CMU CS.
Prev - Math and CS undergrad at MIT (2021-24) and IISc Bangalore (2020-21).

ID: 1589510567030444033

linkhttps://agrawallabhavya.github.io/ calendar_today07-11-2022 06:50:39

9 Tweet

60 Takipçi

336 Takip Edilen

Demis Hassabis

@demishassabis

a year ago

Advanced mathematical reasoning is a critical capability for modern AI. Today we announce a major milestone in a longstanding grand challenge: our hybrid AI system attained the equivalent of a silver medal at this year’s International Math Olympiad!

thumb_up_off_alt3,3K

chat_bubble_outline166

repeat591

shareShare

Kush Tiwary

@ktiwary2

6 months ago

Yep ! We trained eyeballs from scratch, starting with just light-detecting photoreceptors. 🔬👁️ Why? To simulate vision evolution in-silico and understand why we perceive the world the way we do. 🌍✨ Check it out: eyes.mit.edu

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Alireza Mousavi @ ICLR 2025

@alirezamh_

5 months ago

With infinite compute, would it make a difference to use Transformers, RNNs, or even vanilla Feedforward nets? They’re all universal approximators after all. We prove that Yes! You end up with different sample complexity, no matter how much compute/memory you have.👇

thumb_up_off_alt570

chat_bubble_outline6

repeat77

shareShare

Max Sobol Mark

@maxsobolmark

4 months ago

I'll be presenting Policy-Agnostic RL: Fine-Tuning of Any Policy Class and Backbone at the Robot Learning (Sunday) and GenBot (Monday) workshops as Orals at #ICLR2025! Happy to chat or meet!

thumb_up_off_alt31

chat_bubble_outline0

repeat4

shareShare

Yuxiao Qu

@quyuxiao

4 months ago

I am excited to give an oral talk on our work about “Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning” at #ICLR2025 FM-Wild Workshop! 🚀 📍Hall 4 #6 🕚11:30AM, April 27th 🖥️Can’t be there in person, but chat with Ian Wu who’ll present our poster after the talk!

thumb_up_off_alt26

chat_bubble_outline0

repeat2

shareShare

Aviral Kumar

@aviral_kumar2

4 months ago

At #ICLR25 workshops, my students+collabs will give many orals talks on newer stuff (don't miss!): - robot VLA RL fine-tuning Max Sobol Mark - optimizing test-time compute Yuxiao Qu - why RL is crucial for test-time scaling Amrith Setlur - scaling laws for value-based RL

thumb_up_off_alt63

chat_bubble_outline1

repeat5

shareShare

Fahim Tajwar

@fahimtajwar10

3 months ago

RL with verifiable reward has shown impressive results in improving LLM reasoning, but what can we do when we do not have ground truth answers? Introducing Self-Rewarding Training (SRT): where language models provide their own reward for RL training! 🧵 1/n

thumb_up_off_alt819

chat_bubble_outline20

repeat136

shareShare

HBCSE

@hbcse_tifr

a month ago

Spectacular performance by the Indian team at the International Mathematical Olympiad 2025 held at Sunshine Coast, Australia from Jul 10-21, winning 3 Gold, 2 Silver and 1 Bronze! Congratulations Team India you make the nation proud!! #IMO2025, #Matholympiad.

thumb_up_off_alt178

chat_bubble_outline2

repeat53

shareShare