Matthew Yang
@matthewyryang
MSML student @ CMU
ID: 1820131135478808576
04-08-2024 16:14:41
9 Tweet
5 Followers
76 Following
Oh my goodness. GPT-o1 got a perfect score on my Carnegie Mellon University undergraduate #math exam, taking less than a minute to solve each problem. I freshly design non-standard problems for all of my exams, and they are open-book, open-notes. (Problems included below, with links to
"We werenβt born to do jobs." Bill Gates says jobs are a relic of human scarcity. In a world without shortages, society will be able to produce enoughβfood, healthcare, servicesβwithout everyone working. The real shift wonβt be economic. Itβll be reprogramming how we think
Introducing e3 π₯ Best <2B model on math πͺ Are LLMs implementing algos βοΈ OR is thinking an illusion π©.? Is RL only sharpening the base LLM distrib. π€ OR discovering novel strategies outside base LLM π‘? We answer these β€΅οΈ π¨ arxiv.org/abs/2506.09026 π¨ matthewyryang.github.io/e3/
Our view on test-time scaling has been to train models to discover algos that enable them to solve harder problems. Amrith Setlur & Matthew Yang's new work e3 shows how RL done with this view produces best <2B LLM on math that extrapolates beyond training budget. π§΅β¬οΈ
πΒ Introducing Wan2.2: The World's First Open-Source MoE-Architecture Video Generation Model with Cinematic Control! π₯Β Key Innovations: κ· World's First Open-Source MoE Video Model:Β Our Mixture-of-Experts architecture scales model capacityΒ without increasing computational