Misha Laskin (@mishalaskin) 's Twitter Profile
Misha Laskin

@mishalaskin

Co-founder, CEO at @reflection_ai.
Prev: Staff Research Scientist @DeepMind.
Gemini RL team.

ID: 1674028088

linkhttps://www.reflection.ai/ calendar_today15-08-2013 20:43:53

729 Tweet

13,13K Followers

205 Following

Sequoia Capital (@sequoia) 's Twitter Profile Photo

.Ioannis Antonoglou helped create one of the most powerful agents built to date, AlphaGo. @mishalaskin led reward modeling for Google Gemini. At Reflection AI, they're building superintelligent coding systems that learn from their mistakes. 🔗 👇 sequoiacap.com/article/reflec…

Marta Jamrozik (@marta_jamrozik) 's Twitter Profile Photo

I’m non-technical and built the app I always wanted with AI. I like art and history, and wanted an easy way to learn more about both. I take a photo, wait a few seconds, and get a two-minute audio tour of what I’m seeing. Feels wild to go from non-technical to vibe coding an

Steven Lee (@stevenl) 's Twitter Profile Photo

I’m thrilled to announce the launch of my $40M pre-seed and seed-stage fund, Seven Stars, where I’ll be focused on partnering with visionary founders building enduring AI application companies across consumer and enterprise technology. Seven Stars is deeply personal. It’s

Kasey Zhang (@_weexiao) 's Twitter Profile Photo

It’s easy to fine-tune small models w/ RL to outperform foundation models on vertical tasks. We’re open sourcing Osmosis-Apply-1.7B: a small model that merges code (similar to Cursor’s instant apply) better than foundation models. Links to download and try out the model below!

Misha Laskin (@mishalaskin) 's Twitter Profile Photo

The biggest question in RL research has always been - what environment are you training on? It used to be video (Atari) and board (Go / Chess) games. But now that RL works with LLMs, there is only one environment that matters. And it is your product.

Quentin Anthony (@quentinanthon15) 's Twitter Profile Photo

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown. I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown.

I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.
Misha Laskin (@mishalaskin) 's Twitter Profile Photo

Excited that Skild is finally showing some of the incredible research they've been up to The team has produced some of the most exciting advances I've seen in robotics