Joan Cabezas (@josancamon19) Twitter Tweets • TwiCopy

Joan Cabezas

@josancamon19

+ Follow

Co-founder cifrato.ai (YC W25) | prev built @omedotme, tripplanner.ai

ID: 354608817

calendar_today14-08-2011 00:52:33

179 Tweet

203 Followers

240 Following

Joan Cabezas

@josancamon19

2 months ago

cute Huey

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

gsm8k in fact might be too easy or potentially contaminated, regarding less steps for bigger models, 8B seems to saturate at 12/16 steps, whereas 14B continues to get gains (marginal) up to 30 steps, and smaller models peak at 40/50 steps, on lr/hp, didn't see any meaningful

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Andrej Karpathy

@karpathy

2 months ago

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,

thumb_up_off_alt16,16K

chat_bubble_outline517

repeat2,2K

shareShare

Joan Cabezas

@josancamon19

2 months ago

next steps to get this right: 1. explore more complex (e.g. tool calling) RL behaviors, ditch gsm8k. 2. qwen contamination issues, use Gemma 1B 4B 12B 27B -pt. 3. use David Hall marin checkpoints on 8B to figure task X% = a*C-pt + b*C-RL, (a, b being the optimal ratios to %).

thumb_up_off_alt9

chat_bubble_outline2

repeat0

shareShare

Joan Cabezas

@josancamon19

2 months ago

been so long since I was on this side of the table

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Joan Cabezas

@josancamon19

2 months ago

interpretability folks should spend some time hiring cracked frontend engineers, tools available look like software from the 90's, what if it just looked like an interactive fMRI? so many ways to make this cool, cc Neel Nanda

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Devvrit

@devvrit_khatri

2 months ago

Wish to build scaling laws for RL but not sure how to scale? Or what scales? Or would RL even scale predictably? We introduce: The Art of Scaling Reinforcement Learning Compute for LLMs

thumb_up_off_alt549

chat_bubble_outline10

repeat103

shareShare

Belinda

@belindmo

2 months ago

KGGen now has a way to visually navigate generated knowledge graphs: with Stanford Trustworthy AI Research (STAIR) Lab

KGGen now has a way to visually navigate generated knowledge graphs:

with <a href="/stai_research/">Stanford Trustworthy AI Research (STAIR) Lab</a>

thumb_up_off_alt18

chat_bubble_outline1

repeat6

shareShare

Joan Cabezas

@josancamon19

2 months ago

"we estimate that your P(breaking flow) geometrically increases 10% every second that passes while you wait for agent response, with the exact threshold varying based on perceived complexity of the request. The arbitrary “flow window” we hold ourselves to is 5 seconds.". finally

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Felipe Chávez

@felipekiwi90

2 months ago

Six months ago, we introduced Robot.com Today, we launch it 🎉 Over the past few years, we’ve quietly scaled from 300K to 1.7 million+ robotic tasks. 500+ real robots. Doing real work every day — delivering, moving, inspecting, and more. Here's the lineup: 1.

thumb_up_off_alt1,1K

chat_bubble_outline281

repeat193

shareShare