Chase Blagden (@chaseblagden) 's Twitter Profile
Chase Blagden

@chaseblagden

scaling inference @synth_labs data science @caltech

ID: 1709399193624645632

calendar_today04-10-2023 02:45:13

120 Tweet

97 Followers

429 Following

Anikait Singh (@anikait_singh_) 's Twitter Profile Photo

Scaling LLMs with more data is hitting its limits. To address more complex tasks, we need innovative approaches. Shifting from teaching models what to answer to how to solve problems, leveraging test-time compute and meta-RL, could be the solution. Check out Rafael's đź§µ below!

SynthLabs (@synth_labs) 's Twitter Profile Photo

Ever watched someone solve a hard math problem? Their first attempt is rarely perfect. They sketch ideas, cross things out, and try new angles. This process of exploration is key to human reasoning and our latest research formalizes this as Meta Chain-of-Thought (1/8) 🧵👇

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models "In this work, we present Big-Math, a dataset of over 250,000 high-quality math questions with verifiable answers, purposefully made for reinforcement learning (RL). To create

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

"In this work, we present Big-Math, a dataset of over 250,000 high-quality math questions with verifiable answers, purposefully made for reinforcement learning (RL). To create
SynthLabs (@synth_labs) 's Twitter Profile Photo

Releasing Big-MATH—the first heavily curated & verifiable dataset designed specifically for large-scale RL training & LLM reasoning! 📝 250,000+ problems, 47k NEW Q's ✅ 10x larger than existing datasets like MATH 🧑‍⚖️ Verifiable—we eliminated 400k+ problems Details below! 🧵👇

Releasing Big-MATH—the first heavily curated & verifiable dataset designed specifically for large-scale RL training & LLM reasoning!

📝 250,000+ problems, 47k NEW Q's
âś… 10x larger than existing datasets like MATH
🧑‍⚖️ Verifiable—we eliminated 400k+ problems

Details below! 🧵👇
SynthLabs (@synth_labs) 's Twitter Profile Photo

Start exploring Big-MATH today! 📄 Paper: arxiv.org/abs/2502.17387 💻 Code: github.com/SynthLabsAI/bi… 📂 Dataset: huggingface.co/datasets/Synth…

Alon Albalak (@albalakalon) 's Twitter Profile Photo

Happy to finally announce Big-MATH, the largest math reasoning dataset purposefully designed for large-scale RL! We worked tirelessly, cleaning and filtering math datasets so that you don't have to!

Happy to finally announce Big-MATH, the largest math reasoning dataset purposefully designed for large-scale RL!

We worked tirelessly, cleaning and filtering math datasets so that you don't have to!
Nebius (@nebiusai) 's Twitter Profile Photo

Read how SynthLabs, a startup developing AI solutions tailored for logical reasoning, is advancing AI post-training with our @TractoAI: nebius.com/customer-stori… 🔹 Goal: Develop an ML system that empowers reasoning models to surpass pattern matching and implement sophisticated

Read how <a href="/synth_labs/">SynthLabs</a>, a startup developing AI solutions tailored for logical reasoning, is advancing AI post-training with our @TractoAI: nebius.com/customer-stori…

🔹 Goal:
Develop an ML system that empowers reasoning models to surpass pattern matching and implement sophisticated
Asher Trockman (@ashertrockman) 's Twitter Profile Photo

Are you a frontier lab investing untold sums in training? Are you trying to stay competitive? Are you finding that your competitors' models are ... thinking a bit too much like yours? Then antidistillation.com might be for you! Sam Altman Elon Musk

Are you a frontier lab investing untold sums in training? Are you trying to stay competitive? Are you finding that your competitors' models are ... thinking a bit too much like yours?

Then antidistillation.com might be for you! <a href="/sama/">Sam Altman</a> <a href="/elonmusk/">Elon Musk</a>
Benjamin Spiegel (@superspeeg) 's Twitter Profile Photo

Why did only humans invent graphical systems like writing? 🧠✍️ In our new paper at CogSci Society, we explore how agents learn to communicate using a model of pictographic signification similar to human proto-writing. 🧵👇

Bahareh Tolooshams (@btolooshams) 's Twitter Profile Photo

We have released VARS-fUSI: Variable sampling for fast and efficient functional ultrasound imaging (fUSI) using neural operators. The first deep learning fUSI method to allow for different sampling durations and rates during training and inference. biorxiv.org/content/10.110… 1/

We have released VARS-fUSI: Variable sampling for fast and efficient functional ultrasound imaging (fUSI) using neural operators.

The first deep learning fUSI method to allow for different sampling durations and rates during training and inference. biorxiv.org/content/10.110… 1/
Devin (@_chotzen) 's Twitter Profile Photo

Our first long-horizon agentic software engineering model is here! We've shipped a model that matches Claude on Cascade in a lot of ways. However the most exciting thing about this release is the trajectory we're on. So much left to do... we're hiring!

nathan lile (@nathanthinks) 's Twitter Profile Photo

excellent work by Jason Weston & team—extending our "Generative Reward Models" work with RL (GRPO) to optimize LLM reasoning during judgment scalable (synthetic) evaluation continues to be AI's key bottleneck!

excellent work by <a href="/jaseweston/">Jason Weston</a> &amp; team—extending our "Generative Reward Models" work with RL (GRPO) to optimize LLM reasoning during judgment

scalable (synthetic) evaluation continues to be AI's key bottleneck!
nathan lile (@nathanthinks) 's Twitter Profile Photo

btw we have ongoing research on this front! we're open-science, pro-publication, and love collaboration. want to push this frontier forward? we're growing our SF team & always open to research partners—reach out, my DMs are open 📩