Agentica Project (@agentica_) Twitter Tweets • TwiCopy

Agentica Project

@agentica_

+ Follow

Building generalist agents that scale @BerkeleySky

ID: 1884497281870929920

linkhttp://www.agentica-project.com calendar_today29-01-2025 07:02:25

55 Tweet

2,2K Followers

8 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Check out Michael Luo's latest work, Autellix—an ultra-fast system for serving agentic workloads, achieving 4-15x speedups over vLLM/SGLang! At Agentica, we are committed to building efficient infra for serving/training of LLM agents, and Autellix is the first step towards it!

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Agentica Project

@agentica_

5 months ago

🚀Just two weeks after we open-sourced our DeepScaleR model, and the community has already reproduced our results - surpassing O1-preview by following our training recipe! 🎉 Our goal has always been to democratize RL training for LLMs, and by sharing everything—data, code, and

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Together AI

@togethercompute

4 months ago

Announcing DeepCoder-14B – an o1 & o3-mini level coding reasoning model fully open-sourced! We’re releasing everything: dataset, code, and training recipe.🔥 Built in collaboration with the Agentica Project team. See how we created it. 🧵

thumb_up_off_alt2,2K

chat_bubble_outline72

repeat358

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

4 months ago

DeepCoder-14B is a code reasoning LLM fine-tuned from DeepSeek-R1-Distill-Qwen-14B using distributed RL with GRPO+ and iterative context lengthening. Trained on ~24K coding problems (TACO-Verified, PrimeIntellect SYNTHETIC-1, LCB v5), it improves Pass@1 on LiveCodeBench v5 to

thumb_up_off_alt81

chat_bubble_outline6

repeat13

shareShare

Michael Luo

@michaelzluo

4 months ago

🚀 We introduce DeepCoder-14B-Preview, a fully open-sourced coding model that is on par with o3-mini and o1! 📷 We scaled our model with RL magic up to 32K context. It's performance scales to 64K context 🔥

thumb_up_off_alt111

chat_bubble_outline9

repeat15

shareShare

Sijun Tan

@sijun_tan

4 months ago

Hey Sam Altman, we know you're planning to open-source your reasoning model—but we couldn’t wait. Introducing DeepCoder-14B-Preview: a fully open-source reasoning model that matches o1 and o3-mini on both coding and math. And yes, we’re releasing everything: model, data, code, and

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat149

shareShare

Roy Huang

@_royh021

4 months ago

😱14B Fully Open Source o3-mini level model?? 😱 DeepCoder is a small but mighty reasoning model that is o3-mini and o1 level on coding and math that runs at a tiny fraction of the cost. Another W by open source. Go check it out!!

thumb_up_off_alt25

chat_bubble_outline1

repeat1

shareShare

Yuchen Jin

@yuchenj_uw

4 months ago

UC Berkeley open-sourced a 14B model that rivals OpenAI o3-mini and o1 on coding! They applied RL to Deepseek-R1-Distilled-Qwen-14B on 24K coding problems. It only costs 32 H100 for 2.5 weeks (~$26,880)! It's truly open-source. They released everything: the model, training