
Aviral Kumar
@aviral_kumar2
Assistant Professor of CS & ML at @CarnegieMellon. Part-time Research Scientist Google. PhD from UC Berkeley.
ID: 737487375648100352
http://aviralkumar2907.github.io 31-05-2016 03:34:27
294 Tweet
4,4K Followers
345 Following


Introducing e3 π₯ Best <2B model on math πͺ Are LLMs implementing algos βοΈ OR is thinking an illusion π©.? Is RL only sharpening the base LLM distrib. π€ OR discovering novel strategies outside base LLM π‘? We answer these β€΅οΈ π¨ arxiv.org/abs/2506.09026 π¨ matthewyryang.github.io/e3/
