Mehrdad Farajtabar (@mfarajtabar) Twitter Tweets • TwiCopy

Mehrdad Farajtabar

@mfarajtabar

+ Follow

Research Scientist at @Apple, prev @DeepMind, prev @GeorgiaTech

ID: 1346668532210176000

linkhttps://sites.google.com/view/mehrdad calendar_today06-01-2021 04:03:06

147 Tweet

6,6K Takipçi

179 Takip Edilen

Ruoming Pang

@ruomingpang

5 months ago

In this report we describe the 2025 Apple Foundation Models ("AFM"). We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device AFM model. machinelearning.apple.com/research/apple…

thumb_up_off_alt446

chat_bubble_outline297

repeat88

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

5 months ago

Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential Autoregressive LMs already know future tokens, this work makes that usable: - Append <mask> tokens → jointly predict k+1 future tokens - Gated LoRA → updates only for MTP tokens, preserving NTP behavior

thumb_up_off_alt154

chat_bubble_outline1

repeat20

shareShare

Mehrdad Farajtabar

@mfarajtabar

4 months ago

It’s great to be excited about AI’s #IMO performance, while also recognizing the true source of its power. I came across this paragraph today during my reading of A Thousand Brains: A New Theory of #Intelligence, 2021, Jeff Hawkins!

thumb_up_off_alt25

chat_bubble_outline6

repeat0

shareShare

Jackson Atkins

@jacksonatkinsx

4 months ago

Apple research just revealed a way to make LLMs 5.35x faster. 🤯 That’s not a typo. They've found a method to get a >500% speedup for code & math tasks, with ZERO quality loss. Here's how they're unlocking AI model's "latent potential": 🧵

thumb_up_off_alt565

chat_bubble_outline18

repeat85

shareShare

Mehrdad Farajtabar

@mfarajtabar

4 months ago

I noticed the same thing! Engaging in conversations, replies, or DMs with #DeepMind folks always feels safe and welcoming. Their culture is truly remarkable. Thanks to leaders like Samy Bengio, Devi Krishna, Daphne Luong, JG, and many others who've joined Apple, this incredible

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Fartash Faghri

@fartashfg

4 months ago

📢Submissions are now open for #NeurIPS2025 CCFM workshop. Submission deadline: August 22, 2025, AoE. Website: sites.google.com/view/ccfm-neur… Call for papers: sites.google.com/view/ccfm-neur… Submission Link: openreview.net/group?id=NeurI…

thumb_up_off_alt11

chat_bubble_outline0

repeat6

shareShare

Edward Frenkel

@edfrenkel

3 months ago

This is an unwise statement that can only make people confused about what LLMs can or cannot do. Let me tell you something: Math is NOT about solving this kind of ad hoc optimization problems. Yeah, by scraping available data and then clustering it, LLMs can sometimes solve some

thumb_up_off_alt1,1K

chat_bubble_outline255

repeat233

shareShare