Chen Qian (@chenmoneyq) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Chen Qian

@chenmoneyq

6 months ago

Thanks DeepLearning.AI for making the course happen! Hope people enjoy the course and DSPy!

thumb_up_off_alt39

chat_bubble_outline4

repeat3

shareShare

Very excited to finally release our paper for OpenThoughts! After DataComp and DCLM, this is the third large open dataset my group has been building in collaboration with the DataComp community. This time, the focus is on post-training, specifically reasoning data.

thumb_up_off_alt1,1K

chat_bubble_outline20

repeat208

shareShare

Lysandre

@lysandrejik

6 months ago

I have bittersweet news to share. Yesterday we merged a PR deprecating TensorFlow and Flax support in transformers. Going forward, we're focusing all our efforts on PyTorch to remove a lot of the bloating in the transformers library. Expect a simpler toolkit, across the board.

thumb_up_off_alt769

chat_bubble_outline27

repeat68

shareShare

Chen Qian

@chenmoneyq

6 months ago

Is LLaDA (github.com/ML-GSAI/LLaDA) doing a similar thing as Gemini Diffusion? Gemini Diffusion is doing a pretty solid work on the tasks I tried, so I am wondering when we can have a standard text diffusion model in the open source community.

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Chen Qian

@chenmoneyq

6 months ago

I've been looking forward to this for 2.5 years, so happy to see it finally happens!

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Chen Qian

@chenmoneyq

5 months ago

maybe... DSPyBricks? Omar Khattab

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Anmol Gulati

@anmol01gulati

5 months ago

XBench just released an Agentic eval leaderboard: xbench.org that seems to capture both frontier capabilities and real world jobs and usecases, which seems to tick most of the boxes!

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Chen Qian

@chenmoneyq

5 months ago

We are adding more real-world examples to DSPy tutorials: dspy.ai/tutorials/real…, please check them out and let us know what else you want to see and learn! We welcome contributions! Create a feature request and describe what you want to ship, we can discuss from there!

thumb_up_off_alt181

chat_bubble_outline6

repeat21

shareShare

Chen Qian

@chenmoneyq

5 months ago

This could lead to many new startups, and also force many startups to pivot. Looking forward to it!!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Mayee Chen

@mayeechen

5 months ago

LLMs often generate correct answers but struggle to select them. Weaver tackles this by combining many weak verifiers (reward models, LM judges) into a stronger signal using statistical tools from Weak Supervision—matching o3-mini-level accuracy with much cheaper models! 📊

thumb_up_off_alt223

chat_bubble_outline15

repeat33

shareShare

MLflow

@mlflow

5 months ago

In this clip from Data+AI Summit, Chen Qian talks about the release of DSPy 3, which brings production-ready capabilities, seamless #MLflow integration, streaming and async support, and advanced optimizers like Simba. Chen also explains how DSPy 3 streamlines prompt engineering

thumb_up_off_alt25

chat_bubble_outline0

repeat7

shareShare

Chen Qian

@chenmoneyq

5 months ago

I am definitely biased, but this is the best talk in Databricks 2025 summit IMO 😛

thumb_up_off_alt23

chat_bubble_outline0

repeat2

shareShare

Chen Qian

@chenmoneyq

5 months ago

I do feel the trend of myself becoming "lazy" these years. It doesn't necessarily mean my productivity is dropping, but I do feel less passionate about diving deep to learn a new framework/language/algorithm because I can rely on the LLMs to coach me.

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Chen Qian

@chenmoneyq

5 months ago

This is an underrated DSPy module in my opinion, and I am happy to see the community finds it out. Also probably my bad not to provide concrete use cases for it...

thumb_up_off_alt75

chat_bubble_outline2

repeat3

shareShare

alphaXiv

@askalphaxiv

5 months ago

"How Many Instructions Can LLMs Follow at Once?" In this paper they found that leading LLMs can satisfy only about 68% of 500 concurrent instructions, showing a bias toward earlier instructions.

thumb_up_off_alt838

chat_bubble_outline14

repeat151

shareShare

Chen Qian

@chenmoneyq

4 months ago

A new DSPy optimizer is coming!

thumb_up_off_alt59

chat_bubble_outline1

repeat6

shareShare

Chen Qian

@chenmoneyq

4 months ago

Very nice writeup that explains dspy.SIMBA!

thumb_up_off_alt27

chat_bubble_outline0

repeat3

shareShare

Chen Qian

@chenmoneyq

4 months ago

Cool and inspiring direction! I feel a bit strange about the evaluation part though. For example, pushing HotPotQA to 27.72% accuracy is not very exciting.. in addition, using GPT-2 as the baseline (table 8) is odd.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Chen Qian

good girl

Chen Qian

Ludwig Schmidt

Lysandre

Chen Qian

Chen Qian

Chen Qian

Anmol Gulati

Chen Qian

Chen Qian

Mayee Chen

MLflow

Chen Qian

Chen Qian

Chen Qian

alphaXiv

Chen Qian

Chen Qian

Chen Qian