Jonas Pfeiffer (@pfeiffjo) Twitter Tweets • TwiCopy

AK

@_akhaliq

a year ago

Google presents Deliberation in Latent Space via Differentiable Cache Augmentation

thumb_up_off_alt345

chat_bubble_outline3

repeat48

shareShare

Marktechpost AI Research News ⚡

@marktechpost

a year ago

Google DeepMind Introduces Differentiable Cache Augmentation: A Coprocessor-Enhanced Approach to Boost LLM Reasoning and Efficiency Researchers from Google DeepMind have introduced a method called Differentiable Cache Augmentation. This technique uses a trained coprocessor to

thumb_up_off_alt37

chat_bubble_outline1

repeat14

shareShare

Raj Dabre

@prajdabre1

a year ago

Paper #2: Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization Link: aclanthology.org/2024.mrl-1.7/ Ever wondered how we can do LLM weight arithmetic to enable models to handle tasks in languages in zero shot style? The authors have a solution.

thumb_up_off_alt66

chat_bubble_outline2

repeat7

shareShare

Raj Dabre

@prajdabre1

a year ago

I've missed out several technical details so please read the paper, it contains some interesting information nuggets. Overall another cool work by: Alexandra Chronopoulou, Jonas Pfeiffer, Joshua Maynez, Xinyi Wang (Cindy), Sebastian Ruder, and Priyanka Agrawal Thanks for the cool paper!

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Douwe Kiela

@douwekiela

a year ago

I’m really sad that my dear friend Felix Hill is no longer with us. He had many friends and colleagues all over the world - to try to ensure we reach them, his family have asked to share this webpage for the celebration of his life: pp.events/felix

I’m really sad that my dear friend <a href="/FelixHill84/">Felix Hill</a> is no longer with us. He had many friends and colleagues all over the world - to try to ensure we reach them, his family have asked to share this webpage for the celebration of his life: pp.events/felix

thumb_up_off_alt737

chat_bubble_outline112

repeat89

shareShare

Leshem Choshen C U @ ICLR 🤖🤗

@lchoshen

a year ago

Thanks to our invited speakers Jonas Pfeiffer Alisa Liu who delivered inspiring talks on Modular Deep Learning and decoding-time experts for language model adaptation. A heartfelt thank you to our sponsors Hugging Face Sakana AI Arcee.ai ! Making the competition possible.

thumb_up_off_alt7

chat_bubble_outline2

repeat2

shareShare

Arthur Douillard

@ar_douillard

a year ago

Workshop alert 🚨 We'll host in ICLR 2025 a workshop on modularity, encompassing collaborative + decentralized + continual learning. Those topics are on the critical path to building better AIs. Interested? submit a paper and join us in Singapore! sites.google.com/corp/view/mcdc…

thumb_up_off_alt140

chat_bubble_outline4

repeat39

shareShare

AdapterHub

@adapterhub

10 months ago

🎁 A new update of the Adapters library is out! Check out all the novelties, changes & fixes here: github.com/adapter-hub/ad…

thumb_up_off_alt5

chat_bubble_outline0

repeat4

shareShare

Aishwarya Kamath

@ashkamath20

9 months ago

Super excited to announce what I’ve been working on for the past few months 💃 GEMMA 3 is out today! It supports 140+ languages, has a context length of 128k tokens and the best part? It’s natively multimodal! 📸

thumb_up_off_alt347

chat_bubble_outline10

repeat25

shareShare

JB Alayrac

@jalayrac

9 months ago

Congratulations to the whole Gemma team for the launch and especially Aishwarya Kamath who did an amazing job pushing the MM capability of the model 🚀. Give a try to the model 🔥

thumb_up_off_alt30

chat_bubble_outline0

repeat7

shareShare

Aishwarya Kamath

@ashkamath20

9 months ago

Put together a small demo with some fun examples of how you can use Gemma3’s new vision capability with multilinguality and reasoning!

thumb_up_off_alt55

chat_bubble_outline1

repeat5

shareShare

Jonas Pfeiffer

@pfeiffjo

8 months ago

I am hiring a Student Researcher for our Modularity team at the Google DeepMind office in Zurich🇨🇭 Please fill out the interest form if you would like to work with us! The role would start mid/end 2025 and would be in-person in Zurich with 80-100% at GDM forms.gle/N94ViTmKHCCAcv…

thumb_up_off_alt300

chat_bubble_outline2

repeat58

shareShare

Arthur Douillard

@ar_douillard

8 months ago

Jonas and the Zurich modularity team have been working on super exciting topics, i’d strongly recommend applying!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Ivan Vulić

@licwu

8 months ago

We've got plenty of exciting ideas flying around, so consider applying to carve them further with us!

thumb_up_off_alt17

chat_bubble_outline1

repeat1

shareShare

Prateek Yadav

@prateeky2806

8 months ago

Highly recommend Jonas and his team!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Alexandre Ramé

@ramealexandre

8 months ago

Hiring two student researchers for Gemma post-training team at Google DeepMind Paris! First topic is about diversity in RL for LLMs (merging, generalization, exploration & creativity), second is about distillation (with Nino Vieillard). Ideal if you're finishing PhD. DMs open!

thumb_up_off_alt206

chat_bubble_outline5

repeat36

shareShare

Leon Engländer

@leonenglaender

6 months ago

Thrilled about our new Adapters release!🎉I had a blast working on this version, especially contributing to the new plugin interface (like adding ModernBERT) and helping with the VeRA adapter method. Have a look at the full thread for all the awesome updates from our team 👇

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare