Matt Henderson (@matthen2) 's Twitter Profile
Matt Henderson

@matthen2

maths, visualisations, conversational AI.
VP Research @polyaivoice
prev: @RekaAILabs, @Apple AI/ML, @GoogleAI, PhD @Cambridge_Eng

ID: 115190692

linkhttps://www.matthen.com/ calendar_today17-02-2010 22:23:19

6,6K Tweet

79,79K Takipçi

2,2K Takip Edilen

Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Reka AI has launched Reka Flash 3, a new open source 21B parameter reasoning model - the highest scoring model of its size Key details: ➤ Artificial Analysis Intelligence Index of 47, beating almost all non-reasoning models with just 21B total parameters ➤ Stronger than all

Reka AI has launched Reka Flash 3, a new open source 21B parameter reasoning model - the highest scoring model of its size

Key details:

➤ Artificial Analysis Intelligence Index of 47, beating almost all non-reasoning models with just 21B total parameters
➤ Stronger than all
Matt Henderson (@matthen2) 's Twitter Profile Photo

what simple tricks are there to improve vanilla LoRA training for LLMs (especially in DPO)? LoRA+ by Soufiane Hayou is easy to try- scale up the B gradients. What about simple ways to better initialize the matrices? PiSSA looks easy to try (bad name..)