SynthLabs (@synth_labs) Twitter Tweets • TwiCopy

Daniel van Strien

10 months ago

Big-Math: Big-Math: Massive Math Dataset for RL Training - 10x larger than GSM8k/MATH - 3 core properties: uniquely verifiable, open-ended, closed-form - Human-validated 90%+ precision filters - Difficulty metrics for curriculum learning

thumb_up_off_alt133

chat_bubble_outline2

repeat26

shareShare

Nebius

@nebiusai

10 months ago

The final stop in our meetup series will be in San Francisco! 🌁 nebius.com/events/nebius-… Join us at Convene 100 Stockton near Union Square on Thursday, March 13, for a deep dive into our AI cloud. Our developers, AI R&D engineers and architects will share insights with the tech

thumb_up_off_alt31

chat_bubble_outline1

repeat4

shareShare

Alon Albalak

@albalakalon

10 months ago

Happy to finally announce Big-MATH, the largest math reasoning dataset purposefully designed for large-scale RL! We worked tirelessly, cleaning and filtering math datasets so that you don't have to!

thumb_up_off_alt126

chat_bubble_outline5

repeat18

shareShare

Alon Albalak

@albalakalon

10 months ago

🤯 Big-Math is the #3 most popular dataset on Hugging Face If you're using it, I'd love to see the results of your work🤩Please share with us

🤯 Big-Math is the #3 most popular dataset on <a href="/huggingface/">Hugging Face</a>

If you're using it, I'd love to see the results of your work🤩Please share with us

thumb_up_off_alt16

chat_bubble_outline2

repeat2

shareShare

nathan lile

@nathanthinks

10 months ago

thrilled to see Big-MATH climbing to #3️⃣ on Hugging Face—clear signal the community wants more high-quality, verifiable RL datasets. grateful to everyone who’s been liking, downloading, and supporting ❤️

thrilled to see Big-MATH climbing to #3️⃣ on <a href="/huggingface/">Hugging Face</a>—clear signal the community wants more high-quality, verifiable RL datasets.

grateful to everyone who’s been liking, downloading, and supporting ❤️

thumb_up_off_alt21

chat_bubble_outline3

repeat7

shareShare

nathan lile

@nathanthinks

10 months ago

📜 Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models: arxiv.org/abs/2502.17387 🤗 Hugging Face dataset huggingface.co/datasets/Synth…

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Rafael Rafailov @ NeurIPS

@rm_rafailov

10 months ago

This is the dataset we curated for our own reasoning experiments. There is a lot of reasoning data coming out now, but we spend extra time on this to make sure all the problems are high-quality and suitable for RL training!

thumb_up_off_alt52

chat_bubble_outline2

repeat10

shareShare

The AI Timeline

@theaitimeline

10 months ago

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Author's Explanation: x.com/synth_labs/sta… Overview: Big-Math, a dataset of over 250,000 high-quality math questions with verifiable answers, is purposefully designed for

thumb_up_off_alt6

chat_bubble_outline1

repeat2

shareShare

nathan lile

@nathanthinks

10 months ago

still climbing 📈 Big-MATH just hit 🥈 on Hugging Face huggingface.co/datasets/Synth…

thumb_up_off_alt17

chat_bubble_outline1

repeat5

shareShare

nathan lile

@nathanthinks

10 months ago

Qwen+RL = dramatic, Aha! Llama+RL = quick plateau Same size. Same RL. Why? Qwen naturally exhibits cognitive behaviors that Llama doesn't Prime Llama with 4 synthetic reasoning patterns & it matched Qwen's self-improvement performance! We can engineer this into any model! 👇

thumb_up_off_alt369

chat_bubble_outline6

repeat51

shareShare

nathan lile

@nathanthinks

10 months ago

models primed with INCORRECT solutions but with RIGHT BEHAVIORS achieve identical performance to those trained on correct solutions? > optimize for behaviors & amplify with RL

thumb_up_off_alt42

chat_bubble_outline3

repeat4

shareShare

Michael Burkov

@xmikebur

10 months ago

Proud of my team TractoAI . Our platform makes it easy to validate datasets and LLMs at scale.

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Nebius

@nebiusai

10 months ago

Our TractoAI is a great choice for AI labs accelerating their research. Congrats on the release, Mitch Thomas!

thumb_up_off_alt41

chat_bubble_outline5

repeat11

shareShare

nathan lile

@nathanthinks

10 months ago

btw, random fun fact we pointed out months ago: the only MATH example OpenAI published with o1 announcement included an unsubstantiated assumption 😬

btw, random fun fact we pointed out months ago:

the only MATH example <a href="/OpenAI/">OpenAI</a> published with o1 announcement included an unsubstantiated assumption 😬

thumb_up_off_alt37

chat_bubble_outline1

repeat7

shareShare

Nebius

@nebiusai

9 months ago

Read how SynthLabs, a startup developing AI solutions tailored for logical reasoning, is advancing AI post-training with our @TractoAI: nebius.com/customer-stori… 🔹 Goal: Develop an ML system that empowers reasoning models to surpass pattern matching and implement sophisticated

Read how <a href="/synth_labs/">SynthLabs</a>, a startup developing AI solutions tailored for logical reasoning, is advancing AI post-training with our @TractoAI: nebius.com/customer-stori…

🔹 Goal:
Develop an ML system that empowers reasoning models to surpass pattern matching and implement sophisticated

thumb_up_off_alt59

chat_bubble_outline2

repeat14

shareShare

nathan lile

@nathanthinks

7 months ago

btw we have ongoing research on this front! we're open-science, pro-publication, and love collaboration. want to push this frontier forward? we're growing our SF team & always open to research partners—reach out, my DMs are open 📩

thumb_up_off_alt55

chat_bubble_outline16

repeat7

shareShare

nathan lile

@nathanthinks

7 months ago

Generative Reward Models impact compounds daily. way stronger interest now than when we published last fall 👇 many excellent recent extensions—cool seeing where researchers take GenRM

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare