Manan Roongta (@mananroongta) 's Twitter Profile
Manan Roongta

@mananroongta

Researcher @BerkeleySky | Building @rllm_project | EECS @UCBerkeley

ID: 973509024

linkhttp://www.linkedin.com/in/mananroongta calendar_today27-11-2012 08:39:19

4 Tweet

8 Followers

9 Following

Agentica Project (@agentica_) 's Twitter Profile Photo

✨RL magic is in the air! Introducing DeepScaleR-1.5B-Preview—a fully open-source, 1.5B-parameter model trained with RL to surpass o1-preview for general math reasoning. 📜Blog: pretty-radio-b75.notion.site/DeepScaleR-Sur… 💻Github: github.com/agentica-proje…

✨RL magic is in the air! Introducing DeepScaleR-1.5B-Preview—a fully open-source, 1.5B-parameter model trained with RL to surpass o1-preview for general math reasoning.

📜Blog: pretty-radio-b75.notion.site/DeepScaleR-Sur…
💻Github: github.com/agentica-proje…
Snorkel AI (@snorkelai) 's Twitter Profile Photo

A 4B model > 235B on financial reasoning. We partnered with rLLM to fine-tune Qwen3-4B-Instruct-2507 — and it outperformed Qwen3-235B-A22B on expert-curated financial benchmarks.

A 4B model > 235B on financial reasoning.

We partnered with <a href="/rllm_project/">rLLM</a> to fine-tune Qwen3-4B-Instruct-2507 — and it outperformed Qwen3-235B-A22B on expert-curated financial benchmarks.