Morteza Mardani
@mardanimorteza
Principal research scientist @NVIDIA | visiting researcher @Stanford | bridging theory and practice of ML, generative learning, diffusion models
ID: 1301749487912714240
https://mortezamardani.github.io/mardani/ 04-09-2020 05:10:52
110 Tweet
1,1K Followers
1,1K Following
🚀 How far can RL scaling take LLMs? Drop ProRLv2! 🔥We keep expanding LLM’s reasoning boundaries through 3,000+ RL steps over 5 domains and set a new state-of-the-art ✨ among 1.5B reasoning models. 🔗Full blog: research.nvidia.com/labs/lpr/prorl… 🤗Open model: huggingface.co/nvidia/Nemotro…