vmoens (@vincentmoens) 's Twitter Profile
vmoens

@vincentmoens

TorchRL maintainer (@torchrl1) - PyTorch SWE @ Meta - London
Neuroscience PhD, ex-MD

vmoens on the butterfly platform

ID: 3373571519

linkhttp://github.com/vmoens/ calendar_today13-07-2015 07:21:24

487 Tweet

1,1K Followers

677 Following

Brian Bartoldson (@bartoldson) 's Twitter Profile Photo

🚀 We fixed a major LLM post-training bottleneck! Our new method (TBA) combines trajectory balance with asynchronous training to speed up LLM RL 5-50x while improving results+scalability. For example, using VinePPO's GSM8K setup, we obtain +1.2% accuracy and 50x faster RL.

🚀 We fixed a major LLM post-training bottleneck! 

Our new method (TBA) combines trajectory balance with asynchronous training to speed up LLM RL 5-50x while improving results+scalability. 

For example, using VinePPO's GSM8K setup, we obtain +1.2% accuracy and 50x faster RL.
vmoens (@vincentmoens) 's Twitter Profile Photo

We worked a bit on non-tensor data within tensordict, it's looking pretty good! The reshaping feature is really joyful to play with

We worked a bit on non-tensor data within tensordict, it's looking pretty good!
The reshaping feature is really joyful to play with
vmoens (@vincentmoens) 's Twitter Profile Photo

The true coding assistant will be able to understand when I say "Oh you know what I mean c'mon" Now I still feel it's the assistant telling me "Oh you know what I mean"

torchrl (@torchrl1) 's Twitter Profile Photo

torchrl 🤝 gymnasium happy ever after With the help of the Farama Foundation team, we managed to make TorchRL compatible with gymnasium v1.1 onward!

vmoens (@vincentmoens) 's Twitter Profile Photo

It’s mid 2025, we have autonomous cars, AI etc, and almost all the online forms I fill still mistakenly put a cap letter at the beginning of my email address