Sahil Verma
@sahil1v
PhD student @uwcse. Robustness and Interpretability. Currently at @MSFTResearch. Former intern at @amazon, @itsArthurAI. Undergrad @IITKanpur
ID: 1896456847
https://vsahil.github.io 23-09-2013 06:55:38
548 Tweet
518 Followers
1,1K Following
๐ฅ "Vibe coding" is everywhereโbut is it really care-free? We introduce ๐๐๐๐, an RL framework that trains LLMs with automated program analysis feedback, enabling "vibe coding" to be not just fastโbut ๐ฏ๐ฎ๐ฅ๐ง๐๐ซ๐๐๐ข๐ฅ๐ข๐ญ๐ฒ-๐๐ซ๐๐ & ๐ฉ๐ซ๐จ๐๐ฎ๐๐ญ๐ข๐จ๐ง-๐ซ๐๐๐๐ฒ ๐ก๏ธ
๐จ Code is live! Check out LoRe โ a modular, lightweight codebase for personalized reward modeling from user preferences. ๐ฆ Few-shot personalization ๐ Benchmarks: TLDR, PRISM, PersonalLLM ๐ github.com/facebookresearโฆ Huge thanks to AI at Meta for open-sourcing this research ๐
๐ตโ๐ซ Struggling with ๐๐ข๐ง๐-๐ญ๐ฎ๐ง๐ข๐ง๐ ๐๐จ๐? Meet ๐๐๐ง๐ฌ๐๐๐ข๐ฑ๐๐ซ โ an MoE post-training method that offers more ๐ฉ๐ซ๐๐๐ข๐ฌ๐ ๐ซ๐จ๐ฎ๐ญ๐๐ซ ๐ ๐ซ๐๐๐ข๐๐ง๐ญ, making MoE ๐๐๐ฌ๐ข๐๐ซ ๐ญ๐จ ๐ญ๐ซ๐๐ข๐ง and ๐๐๐ญ๐ญ๐๐ซ ๐ฉ๐๐ซ๐๐จ๐ซ๐ฆ๐ข๐ง๐ ! Blog: fengyao.notion.site/moe-posttrainiโฆ
I will be at the Actionable Interpretability Workshop (Actionable Interpretability Workshop ICML2025, #ICML) presenting *SSAEs* in the East Ballroom A from 1-2pm. Drop by (or send a DM) to chat about (actionable) interpretability, (actionable) identifiability, and everything in between!
Transformers struggle with length generalization and long context. What can we do about it? Our new #TMLR paper with Roland Fernandez , Paul Smolensky and Jianfeng Gao shows how to handle the issue. Using a new attention mechanism called TRA. Curious? Read the ๐งต for more ๐ค
RFDiffusion3 generates all atom bound conformation, making it significant for flexible targets like DNA. An excellent teamwork to achieve something impossible by any one of us in just few months. Jasper Butcher Rohith Krishna biorxiv.org/content/10.110โฆ
Today, we report a method for design of active enzymes, RFdiffusion2, in Nature Methods. For the first time, we are able to design enzymes with native-range catalytic activity. We also are releasing our next frontier model, RFdiffusion3, code ๐