
Bo Liu
@cranialxix
Research Scientist @Meta FAIR | CS PhD @UT Austin | Former Research Intern @DeepMind, @Nvidia, @Baidu
ID: 953831169807675395
https://cranial-xix.github.io/ 18-01-2018 03:27:27
30 Tweet
329 Followers
206 Following


Thrilled to announce the first annual Reinforcement Learning Conference RL_Conference, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc.




SVD in Galore is an OVERKILL! Lyapunov analysis says any reasonable projection matrix works. Here comes Online Subspace Descent, a new family of memory efficient optimizers for LLM.π π: arxiv.org/abs/2408.12857 π§βπ»: github.com/kyleliang919/Oβ¦ π€: huggingface.co/papers/2408.12β¦ Work done


πΎπΎπΎπππππ©ππ π©π€ ππ£π©π§π€ππͺππ π€πͺπ§ π‘ππ©ππ¨π© π¬π€π§π : πππ©ππ ππ‘π€π© πΌπ©π©ππ£π©ππ€π£ (πππΌ), a new linear attention model inspired by ABC Hao Peng and GLA Songlin Yang Bailin Wang. Paper link: arxiv.org/abs/2409.07146 huggingface.co/papers/2409.07β¦





One of the last minute papers I added support for that delayed this release was 'Cautious Optimizers' As I promised, I pushed some sets of experiments at huggingface.co/rwightman/timmβ¦. Consider me impressed, this boost appears more consistent than some of the new optimizers -- it's a



