Bhuwan Dhingra (@bhuwandhingra) Twitter Tweets • TwiCopy

Happy to share my first paper at Apple, led by Roy Xie. TL; DR: Interleaving <think> and <answer> blocks during reasoning reduces the time-to-first-token *and* improves accuracy.

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Bhuwan Dhingra

@bhuwandhingra

2 months ago

Looking forward to visiting Stanford this Thursday! Check out my talk at the NLP seminar if you’re around :)

thumb_up_off_alt37

chat_bubble_outline2

repeat3

shareShare

🚀 Introducing Mixture-of-Agents Alignment (MoAA), a new method to "distill" the collective intelligence of open-source LLMs into a single, efficient model. MoAA outperforms GPT-4o as a teacher, boosting smaller models like Llama3.1-8B to rival models 10x their size!

thumb_up_off_alt30

chat_bubble_outline2

repeat10

shareShare

Bhuwan Dhingra

@bhuwandhingra

2 months ago

Backtracking allows reasoning models to go back and correct mistakes in their solution attempts. What sorts of tasks benefit from this behavior? And can we boost it using SFT? Hongyi James Cai 's new preprint answers these questions and more -- check it out!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Tian Li

@litian0331

2 months ago

Want to train LLMs with less cost? We introduce BiClip, a clipping-based method that `approximates' adaptive optimizers without maintaining expensive preconditioners

thumb_up_off_alt166

chat_bubble_outline4

repeat21

shareShare

Ruoming Pang

@ruomingpang

2 months ago

At WWDC we introduce a new generation of LLMs developed to enhance the Apple Intelligence features. We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device foundation language model. machinelearning.apple.com/research/apple…

thumb_up_off_alt454

chat_bubble_outline9

repeat100

shareShare

Bhuwan Dhingra

@bhuwandhingra

21 days ago

The technical report for the second generation of Apple Foundation Models is out.. Its been a great year contributing to this effort and being part of an amazing team!

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Bhuwan Dhingra

Gate.io

Bhuwan Dhingra

Bhuwan Dhingra

Bhuwan Dhingra

Together AI

Bhuwan Dhingra

Tian Li

Ruoming Pang

Bhuwan Dhingra