aimilios (@aimihat) 's Twitter Profile
aimilios

@aimihat

.

ID: 1224401910775582726

linkhttp://hatzistamou.com calendar_today03-02-2020 18:39:11

208 Tweet

146 Followers

715 Following

Stella Li (@stellalisy) 's Twitter Profile Photo

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

🤯 We cracked RLVR with... Random Rewards?!
Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:
- Random rewards: +21%
- Incorrect rewards: +25%
- (FYI) Ground-truth rewards: + 28.8%
How could this even work⁉️ Here's why: 🧵
Blogpost: tinyurl.com/spurious-rewar…
aimilios (@aimihat) 's Twitter Profile Photo

One of the most underrated LLM use-cases: picking a movie with friends. Just open voice mode, let everyone share what vibe they’re feeling, and land on a great movie in 3 minutes instead of 30.

Jeff Dean (@jeffdean) 's Twitter Profile Photo

If you're ever in Athens, I highly recommend the Museum of Ancient Greek Technology! maps.app.goo.gl/sngwM98ueKkroo… Early humanoid robots to pour water+wine, demonstrations of heat+steam+water weight to "magically" open 500 kg temple doors, armor, ... (cont)

If you're ever in Athens, I highly recommend the Museum of Ancient Greek Technology! 

maps.app.goo.gl/sngwM98ueKkroo…

Early humanoid robots to pour water+wine, demonstrations of heat+steam+water weight to "magically" open 500 kg temple doors, armor, ...
(cont)
Jason, Yew Siang (@jason_ys) 's Twitter Profile Photo

Day 2. We got tired of Duolingo being slow + repetitive. So we’re building Gaijin Mode - a new way to actually talk about the things you love, even if you’ve never spoken Japanese before. 99 days to go. Final boss: Japan.

Greg Brockman (@gdb) 's Twitter Profile Photo

OpenAI for Greece — includes expanding access to high-quality AI tools in secondary education: openai.com/global-affairs…

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

I don't know what labs are doing to these poor LLMs during RL but they are mortally terrified of exceptions, in any infinitesimally likely case. Exceptions are a normal part of life and healthy dev process. Sign my LLM welfare petition for improved rewards in cases of exceptions.