
Yingru Li
@richardyrli
AI, RL, LLMs, Data Science | PhD@CUHK | ex-intern @MSFTResearch @TencentGlobal | On Job Market
ID: 2152232932
https://richardli.xyz 25-10-2013 07:28:08
549 Tweet
414 Followers
1,1K Following

Excited to share our work led by Yiping Wang RLVR with only ONE training example can boost 37% accuracy on MATH500.






I was really grateful to have the chance to speak at Cohere Labs and ML Collective last week. My goal was to make the most helpful talk that I could have seen as a first-year grad student interested in neural network optimization. Sharing some info about the talk here... (1/6)


Introducing AlphaEvolve: a Gemini-powered coding agent for algorithm discovery. Itās able to: š Design faster matrix multiplication algorithms š Find new solutions to open math problems š Make data centers, chip design and AI training more efficient across Google. š§µ



Padding a transformerās input with blank tokens (...) is a simple form of test-time compute. Can it increase the computational power of LLMs? š New work with Ashish Sabharwal addresses this with *exact characterizations* of the expressive power of transformers with padding š§µ


i'm teaming up with Kyle Corbitt from openpipe to teach a class about agents + RL :) we'll be teaching the class on Maven š starting june 16. as far as we know, this is the first course of its kind anywhere to bridge RL + LLM agents, and weāre really excited to share some of our





