
Paria Rashidinejad
@paria_rd
Incoming Assistant Professor of ECE @USC | Research Scientist at FAIR, GenAI @AIatMeta | PhD @Berkeley_EECS @berkeley_ai @CHAI_Berkeley
ID: 1634327660007669760
10-03-2023 22:57:55
10 Tweet
104 TakipΓ§i
191 Takip Edilen


Excited to share our work led by Yiping Wang RLVR with only ONE training example can boost 37% accuracy on MATH500.




π’ Our travel planner solver (arxiv.org/abs/2410.16456, published in EMNLP Demo Track'24, and arxiv.org/abs/2411.13904) is now open sourced in github.com/facebookresearβ¦ ππ In these works, we build LLM-equipped agent that can take user inputs in natural language, in either the

PPO vs. DPO? π€ Our new paper proves that it depends on whether your models can represent the optimal policy and/or reward. Paper: arxiv.org/abs/2505.19770 Led by Ruizhe Shi Minhak Song


Introducing e3 π₯ Best <2B model on math πͺ Are LLMs implementing algos βοΈ OR is thinking an illusion π©.? Is RL only sharpening the base LLM distrib. π€ OR discovering novel strategies outside base LLM π‘? We answer these β€΅οΈ π¨ arxiv.org/abs/2506.09026 π¨ matthewyryang.github.io/e3/





Excited to release AlgoTune!! It's a benchmark and coding agent for optimizing the runtime of numerical code π algotune.io π algotune.io/paper.pdf π€ github.com/oripress/AlgoTβ¦ with Ofir Press Ori Press Patrick Kidger Bartolomeo Stellato Arman Zharmagambetov & many others π§΅