Sebastien Bubeck(@SebastienBubeck) 's Twitter Profileg
Sebastien Bubeck

@SebastienBubeck

VP GenAI Research, Microsoft AI

ID:452384386

linkhttp://sbubeck.com calendar_today01-01-2012 19:44:13

1,4K Tweets

34,5K Followers

1,3K Following

Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

Hmmm, I have a feeling this plot might need an overhaul rather soon🤣.

I guess phi-2 was the lower left part of the triangle. I wonder what those guys have been up to in the last 6 months? 🤔

account_circle
Mark Russinovich(@markrussinovich) 's Twitter Profile Photo

As part of our ongoing work on AI safety and security, we've discovered a powerful, yet simple LLM jailbreak that exploits an intrinsic LLM behavior we call 'crescendo' and have demonstrated it on dozens of tasks across major LLM models and services: …ndo-the-multiturn-jailbreak.github.io

account_circle
Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

At a time where 314B parameters models are trending, come join me at to see what you can do with 1 or 2B parameters :-) (and coming soon, what can you do with 3B?!?)

At a time where 314B parameters models are trending, come join me at #NVIDIAGTC to see what you can do with 1 or 2B parameters :-) (and coming soon, what can you do with 3B?!?)
account_circle
Jelani Nelson(@minilek) 's Twitter Profile Photo

Elon Musk and Sam Altman may not agree on much of late, but do agree AI is built on strong math foundations, including algebra and calculus, applauding University of California for recent clarifications on math requirements for admission.

Many industry leaders signed:

mathmatters.ai

@elonmusk and @sama may not agree on much of late, but do agree AI is built on strong math foundations, including algebra and calculus, applauding @UofCalifornia for recent clarifications on math requirements for admission. Many industry leaders signed: mathmatters.ai
account_circle
Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

Interestingly the unicorn test is much harder today than back in 2022, because there are now a lot more crappy tikz unicorns on the web ... I guess this has some broader significance ...

account_circle
DeepSpeed(@MSFTDeepSpeed) 's Twitter Profile Photo

Introducing Mixtral, Phi2, Falcon, and Qwen support in -FastGen!

- Up to 2.5x faster LLM inference
- Optimized SplitFuse and token sampling
- Exciting new features like RESTful API and more!

For more details: github.com/microsoft/Deep…

Introducing Mixtral, Phi2, Falcon, and Qwen support in #DeepSpeed-FastGen! - Up to 2.5x faster LLM inference - Optimized SplitFuse and token sampling - Exciting new features like RESTful API and more! For more details: github.com/microsoft/Deep… #DeepSpeeed #AI
account_circle
Boris Hanin(@BorisHanin) 's Twitter Profile Photo

🚨Princeton ML Theory Summer School🚨

Aug 6 - 15, 2024

Speakers:

* F. Krzakala/L. Zdeborova Krzakala Florent Lenka Zdeborova
* S. Rakhlin
* B. Lourerio Bruno Loureiro
* M. Austern
* D. Krotov Dmitry Krotov
* J. Altschuler

Details: mlschool.princeton.edu

Sponsors:…

🚨Princeton ML Theory Summer School🚨 Aug 6 - 15, 2024 Speakers: * F. Krzakala/L. Zdeborova @KrzakalaF @zdeborova * S. Rakhlin * B. Lourerio @_brloureiro * M. Austern * D. Krotov @DimaKrotov * J. Altschuler Details: mlschool.princeton.edu Sponsors:…
account_circle
anton(@abacaj) 's Twitter Profile Photo

I did multiple fine tuning runs over tinyllama-3T and phi-2 using the same dataset (which I think is actually a good dataset of multiturn conversations)

Results? Tinyllama can't properly follow multiturns. The model gets confused and brings up completely unrelated completions

I did multiple fine tuning runs over tinyllama-3T and phi-2 using the same dataset (which I think is actually a good dataset of multiturn conversations) Results? Tinyllama can't properly follow multiturns. The model gets confused and brings up completely unrelated completions
account_circle
Nisheeth Vishnoi(@NisheethVishnoi) 's Twitter Profile Photo

Looking for a and would like to work at Yale on emerging problems in the areas of foundations of AI/responsible AI?

Have your CV, statement, and 3 letters emailed by Jan 15, 2023
cs.yale.edu/homes/vishnoi/…

Strong math and empirical background required

Please fwd/rt!

account_circle
Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

Check out this short video for a brief discussion of the phi series with Microsoft CTO Kevin Scott , including why 'textbooks' in 'Textbooks Are All You Need' might not be exactly what you have in mind.

youtu.be/O-DjHgZt-Uk?si…

account_circle
Peter Lee(@peteratmsr) 's Twitter Profile Photo

This video of a chat between Microsoft CTO Kevin Scott and MSR AI researcher Sebastien Bubeck is full of interesting insights about the phi family of large language models, and AI in general. Only 12 minutes, definitely worth watching. youtube.com/watch?v=O-DjHg…

account_circle
Peter Lee(@peteratmsr) 's Twitter Profile Photo

2023 has been the most consequential year in Microsoft Research's 33-year history. In addition to countless internal (to Microsoft, OpenAI, etc) contributions, I bet that again our labs contributed more per researcher than any others to open research. Read more microsoft.com/en-us/research…

account_circle
Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

We're so pumped to see phi-2 at the top of trending models on Hugging Face ! It's sibling phi-1.5 has already half a million downloads. Can't wait to see the mechanistic interpretability works that will come out of this & their impact on all the important LLM research questions!

We're so pumped to see phi-2 at the top of trending models on @huggingface ! It's sibling phi-1.5 has already half a million downloads. Can't wait to see the mechanistic interpretability works that will come out of this & their impact on all the important LLM research questions!
account_circle
Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

Check out this short video for a brief discussion of the phi series with Microsoft CTO Kevin Scott , including why 'textbooks' in 'Textbooks Are All You Need' might not be exactly what you have in mind.

youtu.be/O-DjHgZt-Uk?si…

account_circle
Sebastien Bubeck(@SebastienBubeck) 's Twitter Profile Photo

This is a really excellent video, including an insightful discussion of benchmarks like MMLU: youtube.com/watch?v=nPgs8T…

account_circle
Kevin Scott(@kevin_scott) 's Twitter Profile Photo

I'm immensely proud of the work that my colleagues in Microsoft Research showcased at this week’s NeurIPS Conference regarding foundation models of all sizes, including Phi-2, which matches or outperforms models up to 25x larger.

account_circle