Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile
Sebastien Bubeck

@sebastienbubeck

VP Applied Research, Microsoft AI

ID: 452384386

linkhttp://sbubeck.com calendar_today01-01-2012 19:44:13

1,1K Tweet

36,36K Followers

1,1K Following

Wenhu Chen (@wenhuchen) 's Twitter Profile Photo

Our MMLU-Pro paper is out. It's a more difficult, robust and reasoning-driven benchmark to measure expert-level intelligence. We have gradually included 50+ models in our leaderboard: huggingface.co/spaces/TIGER-L…. GPT-4o, Gemini-1.5-Pro, Claude-3-Opus are the current top-3 models.

Our MMLU-Pro paper is out. It's a more difficult, robust and reasoning-driven benchmark to measure expert-level intelligence.

We have gradually included 50+ models in our leaderboard:
huggingface.co/spaces/TIGER-L….
GPT-4o, Gemini-1.5-Pro, Claude-3-Opus are the current top-3 models.
.txt (@dottxtai) 's Twitter Profile Photo

👉 Structured generation beats GPT-4 Using structured generation, phi-3 achieves 95.5% accuracy when it only achieves 86% without structured generation. More importantly, it beats GPT-4 (93.5%) by a whopping 2 percentage point. 🔥🔥🔥

👉 Structured generation beats GPT-4

Using structured generation, phi-3 achieves 95.5% accuracy when it only achieves 86% without structured generation.

More importantly, it beats GPT-4 (93.5%) by a whopping 2 percentage point. 🔥🔥🔥
Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Every day I witness the AI revolution in action, and every day I see 1 or 2 questions that would deserve an entire PhD thesis to explore fully ... Honestly, how lucky we are to do research in that era!! *Even if* there is no more magical leap like from gpt2 to 3, or 3 to 4,

Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Just to state the obvious, AGI might not happen with current techniques. At a very basic level the argument for AGI is: if you compress sufficiently complex material, like the web, then you end up creating a "mind" that has in it all the "operations" necessary to create the web,

Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Some of you might find this short interview interesting. Quite a journey with Ronen Eldan since the entropic barrier 🤣, and it's still just the beginning! turingpost.com/p/phi3

Zeyuan Allen-Zhu (@zeyuanallenzhu) 's Twitter Profile Photo

If you're attending ICML 2024, join my 2-hour tutorial on Monday July 22 to explore the Physics of Language Model - all 6 parts. Visit: physics.allen-zhu.com and it will be live-streamed on Zoom. BONUS: this is the premiere of Part 2.1 + 2.2, don't miss out! #ICML2024 #MetaAI

If you're attending ICML 2024, join my 2-hour tutorial on Monday July 22 to explore the Physics of Language Model - all 6 parts. Visit: physics.allen-zhu.com and it will be live-streamed on Zoom. BONUS: this is the premiere of Part 2.1 + 2.2, don't miss out!  #ICML2024 #MetaAI
Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Second Ironman 70.3 in the bag 😁. Quite happy to have finished this one, it demanded some sacrifices during a busy year in AI... To the youngsters out there, don't forget to get your miles in even when work gets busy 😅

Second Ironman 70.3 in the bag 😁. Quite happy to have finished this one, it demanded some sacrifices during a busy year in AI... To the youngsters out there, don't forget to get your miles in even when work gets busy 😅
Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Check out this podcast about SLMs, it's a bit more product/business oriented than what I usually talk about but hopefully should be interesting to some of you! podcasts.apple.com/us/podcast/mic…

Csaba Szepesvari (@csabaszepesvari) 's Twitter Profile Photo

amathr.org/prizes/aiprize/ The Association for Mathematic Research announces "Prize in the Mathematics of Artificial Intelligence". I'm in the selection committee. The goal is to inspire young people to work on the intersection of AI and maths. Nominations to [email protected]

amathr.org/prizes/aiprize/
The Association for Mathematic Research announces "Prize in the Mathematics of Artificial Intelligence". I'm in the selection committee. The goal is to inspire young people to work on the intersection of AI and maths. Nominations to aiprize@amathr.org
Weizhu Chen (@weizhuchen) 's Twitter Profile Photo

We released phi 3.5: mini+MoE+vision A better mini model with multilingual support: huggingface.co/microsoft/Phi-… A new MoE model:huggingface.co/microsoft/Phi-… A new vision model supporting multiple images: huggingface.co/microsoft/Phi-…

Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

This is the coolest "application" of LLMs I have seen in a while: instead of using an LLM to solve a problem set, flip the table and ask the student to TEACH the LLM how to solve the problem set. Really lovely. Thx to Jordan Hoffmann for sharing this :-). youtube.com/watch?v=om7VpI…

Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Lots of progress on bandit convex optimization recently arxiv.org/abs/2406.18672 arxiv.org/abs/2406.06506 arxiv.org/abs/2302.05371, I wish I could follow it more closely ... looks like Conjecture 1 from arxiv.org/abs/1607.03084 is going to be resolved soon!!!