Bo Liu (@cranialxix) Twitter Tweets • TwiCopy

Konstantin Mishchenko

2 years ago

Constrained optimization perspective on what Lion optimizer is doing. They also generalize Lion to operations other than sign in the update. Paper: arxiv.org/abs/2310.05898 It seems highly related to dual space preconditioning, which is somehow not cited: arxiv.org/abs/1902.02257

thumb_up_off_alt90

chat_bubble_outline0

repeat14

shareShare

RL_Conference

@rl_conference

2 years ago

Thrilled to announce the first annual Reinforcement Learning Conference RL_Conference, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc.

Thrilled to announce the first annual Reinforcement Learning Conference <a href="/RL_Conference/">RL_Conference</a>, which will be held at UMass Amherst August 9-12!
RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc.

thumb_up_off_alt228

chat_bubble_outline3

repeat87

shareShare

AK

@_akhaliq

2 years ago

Google Deepmind present Asynchronous Local-SGD Training for Language Modeling paper page: huggingface.co/papers/2401.09… Local stochastic gradient descent (Local-SGD), also referred to as federated averaging, is an approach to distributed optimization where each device performs more

thumb_up_off_alt157

chat_bubble_outline2

repeat29

shareShare

Arthur Douillard

@ar_douillard

2 years ago

We release the async extension of DiLoCo shared in November, led by our amazing intern Bo Liu! 👀 TL;DR: we do distributed data-parallelism of a language model across the world, synchronized every 10-100 of steps, AND using heterogenous devices 🧵 below

thumb_up_off_alt30

chat_bubble_outline3

repeat8

shareShare

Bo Liu

@cranialxix

a year ago

Interested in the continual adaptation of large AI models? Join us by submitting your work to our NeurIPS workshop :) This is a great opportunity to engage with experts and advance the dialogue on how foundation models can be dynamically updated. Deadline is Sept 9th AoE.

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Kaizhao Liang

@kyleliang5

a year ago

SVD in Galore is an OVERKILL! Lyapunov analysis says any reasonable projection matrix works. Here comes Online Subspace Descent, a new family of memory efficient optimizers for LLM.🖖 📜: arxiv.org/abs/2408.12857 🧑‍💻: github.com/kyleliang919/O… 🤗: huggingface.co/papers/2408.12… Work done

thumb_up_off_alt27

chat_bubble_outline0

repeat7

shareShare

Yu Zhang 🐳🙇

@yzhang_cs

a year ago

🍾🍾🍾𝙀𝙭𝙘𝙞𝙩𝙚𝙙 𝙩𝙤 𝙞𝙣𝙩𝙧𝙤𝙙𝙪𝙘𝙚 𝙤𝙪𝙧 𝙡𝙖𝙩𝙚𝙨𝙩 𝙬𝙤𝙧𝙠: 𝙂𝙖𝙩𝙚𝙙 𝙎𝙡𝙤𝙩 𝘼𝙩𝙩𝙚𝙣𝙩𝙞𝙤𝙣 (𝙂𝙎𝘼), a new linear attention model inspired by ABC Hao Peng and GLA Songlin Yang Bailin Wang. Paper link: arxiv.org/abs/2409.07146 huggingface.co/papers/2409.07…

thumb_up_off_alt104

chat_bubble_outline5

repeat35

shareShare

Bo Liu

@cranialxix

a year ago

RWKV-7'update is pretty similar to the Longhorn model's update (arxiv.org/pdf/2407.14207), which is derived explicitly from solving online associative recall in closed form. The household transform used in the RWKV-7, (diag(w) - a \alpha^\top \beta), stems from optimizing a

thumb_up_off_alt17

chat_bubble_outline0

repeat2

shareShare

Jiaheng Hu

@jiahenghu1

a year ago

🚀 Despite efforts to scale up Behavior Cloning for Robots, large-scale BC has yet to live up to its promise. How can we break through the performance plateau? Introducing 🔥FLaRe: fine-tuning large-scale robot policies with Reinforcement Learning. robot-flare.github.io 🧵

thumb_up_off_alt123

chat_bubble_outline1

repeat37

shareShare

Bo Liu

@cranialxix

9 months ago

One line of code for improved training by ensuring the update aligns with the gradient. Note that there is no need to tune hyperparameters; just use those from AdamW or Lion.

thumb_up_off_alt16

chat_bubble_outline0

repeat4

shareShare

Ross Wightman

@wightmanr

9 months ago

I was going to publish a new timm release yesterday with significant Optimizer updates: Adopt, Big Vision Adafactor, MARS, and LaProp, along with numerous improvements to the factory, typing, etc. And then this popped up in my feed, dang, scope creep. Cautious LAMB runs from the

thumb_up_off_alt138

chat_bubble_outline4

repeat12

shareShare

Ross Wightman

@wightmanr

9 months ago

One of the last minute papers I added support for that delayed this release was 'Cautious Optimizers' As I promised, I pushed some sets of experiments at huggingface.co/rwightman/timm…. Consider me impressed, this boost appears more consistent than some of the new optimizers -- it's a

thumb_up_off_alt64

chat_bubble_outline2

repeat6

shareShare

Bo Liu

@cranialxix

9 months ago

For imitation learning in robotics: as cheap as behavioral cloning, as expressive as diffusion policy. From the original group that designed the rectified flow.

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Bo Liu

@cranialxix

7 months ago

If you are interested in learning/using flow/diffusion models, please check this thread from the original author of rectified flow (RF). It contains: 1. a tutorial blog (to quickly get a sense of what RF is and some interesting findings we had lately) 2. a codebase (a minimal

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Association for Computing Machinery

@theofficialacm

4 months ago

🙌 Meet the 2024 ACM Technical Awards Recipients! We’re proud to honor this year’s innovators in autonomous systems, cryptography, and software for parallel computers: 🏆 Peter Stone – ACM-AAAI Allen Newell Award For significant contributions to the theory and practice of

thumb_up_off_alt62

chat_bubble_outline6

repeat14

shareShare

Qi Wang

@qiwang067

2 months ago

🚀 Excited to announce our workshop “Embodied World Models for Decision Making” at #NeurIPS2025! 🎉 Keynote speakers, panelists, and content are now live! Check out: 👉 embodied-world-models.github.io #WorldModels #RL #NeurIPS #NeurIPS2025 #neuripsworkshop #workshop

thumb_up_off_alt47

chat_bubble_outline7

repeat13

shareShare