Matt Henderson (@matthen2) Twitter Tweets • TwiCopy

Matt Henderson

@matthen2

+ Follow

maths, visualisations, conversational AI.
VP Research @polyaivoice
prev: @RekaAILabs, @Apple AI/ML, @GoogleAI, PhD @Cambridge_Eng

ID: 115190692

linkhttps://www.matthen.com/ calendar_today17-02-2010 22:23:19

6,6K Tweet

79,79K Takipçi

2,2K Takip Edilen

Artificial Analysis

@artificialanlys

9 months ago

Reka AI has launched Reka Flash 3, a new open source 21B parameter reasoning model - the highest scoring model of its size Key details: ➤ Artificial Analysis Intelligence Index of 47, beating almost all non-reasoning models with just 21B total parameters ➤ Stronger than all

thumb_up_off_alt182

chat_bubble_outline2

repeat34

shareShare

Matt Henderson

@matthen2

8 months ago

what simple tricks are there to improve vanilla LoRA training for LLMs (especially in DPO)? LoRA+ by Soufiane Hayou is easy to try- scale up the B gradients. What about simple ways to better initialize the matrices? PiSSA looks easy to try (bad name..)

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare