Jason Ramapuram (@jramapuram) Twitter Tweets • TwiCopy

Jason Ramapuram

@jramapuram

+ Follow

ML Research Scientist  MLR | Formerly: DeepMind, Qualcomm, Viasat, Rockwell Collins | Swiss-minted PhD in ML | Barista alumnus ☕ @ Starbucks | 🇺🇸🇮🇳🇱🇻🇮🇹

ID: 66173851

linkhttps://jramapuram.github.io calendar_today16-08-2009 19:22:59

247 Tweet

1,1K Followers

526 Following

Jason Ramapuram

@jramapuram

7 months ago

Ever wish you could have a simple pipeline to extract parameters from augmentations for an auxillary task (eg: self-supervised learning)? Well now you can! Check out Parameterized Transforms, great work from Eeshan Gunesh Dhekane at Apple MLR.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Edward Milsom

@edward_milsom

6 months ago

Our paper "Function-Space Learning Rates" is on arXiv! We give an efficient way to estimate the magnitude of changes to NN outputs caused by a particular weight update. We analyse optimiser dynamics in function space, and enable hyperparameter transfer with our scheme FLeRM! 🧵👇

thumb_up_off_alt420

chat_bubble_outline12

repeat68

shareShare

Inception Labs

@inceptionailabs

6 months ago

We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.

thumb_up_off_alt5,5K

chat_bubble_outline225

repeat996

shareShare

Rin Metcalf Susa

@rinmetcalfsusa

6 months ago

🚀 We're hiring an ML Researcher! 🚀 If you're an expert in LLM alignment & personalization and want to work on a world-class research team, apply here 👉 lnkd.in/gU9yeivi Know someone who’d be a great fit? Tag them! #MachineLearning #AI #Apple

thumb_up_off_alt34

chat_bubble_outline0

repeat11

shareShare

Aayush Karan

@aakaran31

6 months ago

Can machine learning models predict their own errors 🤯 ? In a new preprint w/ Apple collaborators Aravind Gollakota, Parikshit Gopalan, Charlotte Peale, and Udi Wieder, we present a theory of loss prediction and show an equivalence with algorithmic fairness! A thread (1/n):

Can machine learning models predict their own errors 🤯 ?

In a new preprint w/ <a href="/Apple/">Apple</a> collaborators Aravind Gollakota, Parikshit Gopalan, Charlotte Peale, and Udi Wieder, we present a theory of loss prediction and show an equivalence with algorithmic fairness!

A thread (1/n):

thumb_up_off_alt967

chat_bubble_outline11

repeat135

shareShare

Andy Keller

@t_andy_keller

6 months ago

In the physical world, almost all information is transmitted through traveling waves -- why should it be any different in your neural network? Super excited to share recent work with the brilliant Mozes Jacobs: "Traveling Waves Integrate Spatial Information Through Time" 1/14

thumb_up_off_alt7,7K

chat_bubble_outline148

repeat933

shareShare

Yizhe Zhang @ ICLR 2025 🇸🇬

@yizhezhangnlp

6 months ago

Excited to share our new paper on "Reversal Blessing" - where thinking BACKWARDS makes language models smarter on some multiple-choice questions! We found that right-to-left (R2L) models consistently outperform traditional left-to-right (L2R) models on certain reasoning tasks.🧵

thumb_up_off_alt129

chat_bubble_outline6

repeat26

shareShare

Kevin Patrick Murphy

@sirbayes

5 months ago

I'm happy to announce that v2 of my RL tutorial is now online. I added a new chapter on multi-agent RL, and improved the sections on 'RL as inference' and 'RL+LLMs' (although latter is still WIP), fixed some typos, etc. arxiv.org/abs/2412.05265…

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat291

shareShare

Jason Ramapuram

@jramapuram

5 months ago

Check out this great opportunity to help contribute to MLX at Apple MLR

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Martin Klissarov

@martinklissarov

5 months ago

Here is an RL perspective on understanding LLMs for decision making. Are LLMs best used as: policies / rewards / transition functions ? How do you fine-tune them ? Can LLMs explore / exploit ? 🧵 Join us down this rabbit hole... (ICLR 2025 paper, done at  ML Research)

thumb_up_off_alt169

chat_bubble_outline2

repeat27

shareShare

Pau Rodríguez

@prlz77

5 months ago

Our work on fine-grained control of LLMs and diffusion models via Activation Transport will be presented ICLR 2025 as spotlight✨Check out our new blog post machinelearning.apple.com/research/trans…

thumb_up_off_alt40

chat_bubble_outline1

repeat10

shareShare

Mustafa Shukor

@mustafashukor1

5 months ago

We release a large scale study to answer the following: - Is late fusion inherently better than early fusion for multimodal models? - How do native multimodal models scale compared to LLMs. - How sparsity (MoEs) can play a detrimental role in handling heterogeneous modalities? 🧵

thumb_up_off_alt428

chat_bubble_outline8

repeat73

shareShare

Shuangfei Zhai

@zhaisf

4 months ago

Proud to report that TarFlow is accepted to #ICML2025 as a Spotlight 🎉 I’m really looking forward to new ideas and applications enabled by powerful Normalizing Flow models 🚀

thumb_up_off_alt84

chat_bubble_outline0

repeat13

shareShare

Vaibhav (VB) Srivastav

@reach_vb

4 months ago

Let's goo! Starting today you can access 5000+ LLMs powered by MLX directly from Hugging Face Hub! 🔥 All you need to do is click `Use this model` from any compatible model \o/ That's it, all you need to get blazingly fast intelligence right at your terminal! What would you

thumb_up_off_alt207

chat_bubble_outline9

repeat20

shareShare

Google DeepMind

@googledeepmind

4 months ago

Video, meet audio. 🎥🤝🔊 With Veo 3, our new state-of-the-art generative video model, you can add soundtracks to clips you make. Create talking characters, include sound effects, and more while developing videos in a range of cinematic styles. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline267

repeat1,1K

shareShare

Maureen de Seyssel

@maureendss

3 months ago

Now that INTERSPEECH 2025 registration is open, time for some shameless promo! Sign-up and join our Interspeech tutorial: Speech Technology Meets Early Language Acquisition: How Interdisciplinary Efforts Benefit Both Fields. 🗣️👶 interspeech2025.org/tutorials ⬇️ (1/2)

thumb_up_off_alt16

chat_bubble_outline2

repeat4

shareShare