
Matt Henderson
@matthen2
maths, visualisations, conversational AI.
VP Research @polyaivoice
prev: @RekaAILabs, @Apple AI/ML, @GoogleAI, PhD @Cambridge_Eng
ID: 115190692
https://www.matthen.com/ 17-02-2010 22:23:19
6,6K Tweet
79,79K Takipçi
2,2K Takip Edilen


what simple tricks are there to improve vanilla LoRA training for LLMs (especially in DPO)? LoRA+ by Soufiane Hayou is easy to try- scale up the B gradients. What about simple ways to better initialize the matrices? PiSSA looks easy to try (bad name..)



