Misha Laskin (@mishalaskin) Twitter Tweets • TwiCopy

Misha Laskin

@mishalaskin

+ Follow

Co-founder, CEO at @reflection_ai.
Prev: Staff Research Scientist @DeepMind.
Gemini RL team.

ID: 1674028088

linkhttps://www.reflection.ai/ calendar_today15-08-2013 20:43:53

729 Tweet

13,13K Followers

205 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

.Ioannis Antonoglou helped create one of the most powerful agents built to date, AlphaGo. @mishalaskin led reward modeling for Google Gemini. At Reflection AI, they're building superintelligent coding systems that learn from their mistakes. 🔗 👇 sequoiacap.com/article/reflec…

thumb_up_off_alt48

chat_bubble_outline7

repeat10

shareShare

Marta Jamrozik

@marta_jamrozik

3 months ago

I’m non-technical and built the app I always wanted with AI. I like art and history, and wanted an easy way to learn more about both. I take a photo, wait a few seconds, and get a two-minute audio tour of what I’m seeing. Feels wild to go from non-technical to vibe coding an

thumb_up_off_alt2,2K

chat_bubble_outline224

repeat97

shareShare

Steven Lee

@stevenl

2 months ago

I’m thrilled to announce the launch of my $40M pre-seed and seed-stage fund, Seven Stars, where I’ll be focused on partnering with visionary founders building enduring AI application companies across consumer and enterprise technology. Seven Stars is deeply personal. It’s

thumb_up_off_alt490

chat_bubble_outline100

repeat41

shareShare

Kasey Zhang

@_weexiao

a month ago

It’s easy to fine-tune small models w/ RL to outperform foundation models on vertical tasks. We’re open sourcing Osmosis-Apply-1.7B: a small model that merges code (similar to Cursor’s instant apply) better than foundation models. Links to download and try out the model below!

thumb_up_off_alt1,1K

chat_bubble_outline44

repeat134

shareShare

Misha Laskin

@mishalaskin

a month ago

The biggest question in RL research has always been - what environment are you training on? It used to be video (Atari) and board (Go / Chess) games. But now that RL works with LLMs, there is only one environment that matters. And it is your product.

thumb_up_off_alt113

chat_bubble_outline2

repeat5

shareShare

Quentin Anthony

@quentinanthon15

a month ago

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown. I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.

thumb_up_off_alt3,3K

chat_bubble_outline98

repeat423

shareShare

Misha Laskin

@mishalaskin

10 days ago

Something big is going to happen tomorrow. These are some of the most talented researchers I know

thumb_up_off_alt14

chat_bubble_outline1

repeat0

shareShare

Misha Laskin

@mishalaskin

9 days ago

Excited that Skild is finally showing some of the incredible research they've been up to The team has produced some of the most exciting advances I've seen in robotics

thumb_up_off_alt74

chat_bubble_outline2

repeat4

shareShare

Misha Laskin

Gate.io

Sequoia Capital

Marta Jamrozik

Steven Lee

Kasey Zhang

Misha Laskin

Quentin Anthony

Misha Laskin

Misha Laskin