Arya (@aryagxr) Twitter Tweets • TwiCopy

Arya

@aryagxr

+ Follow

22 | engineering | 📚🧎‍♀️

ID: 818438092222709762

linkhttp://aryagxr.com calendar_today09-01-2017 12:43:41

72 Tweet

19 Takipçi

173 Takip Edilen

Arya

@aryagxr

4 months ago

here's an update on RLing wordle, my RFT run is all setup for bootstrapping from an initial set of wordle prompts and feedback history. what's next is to intuit how different config params improve reasoning + speedups, (topK, temperature, beta, etc) but for now I have code that

thumb_up_off_alt44

chat_bubble_outline1

repeat1

shareShare

tokenbender

@tokenbender

4 months ago

link - tokenbender.com/post.html?id=a…

thumb_up_off_alt77

chat_bubble_outline3

repeat6

shareShare

Arya

@aryagxr

4 months ago

+1 for this you can really tell that a lot of work has gone into turning off sycophancy in gpt5, and I’m really liking it. ( i noticed emojis have reduced too ! )

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Arya

@aryagxr

4 months ago

<experimenting> the purple run is what loose clipping and dense rewards are doing for me, clip=1.0, lr=1e-4, weight_decay=1.0 and thanks to Anish for some generous brev compute credits!

<experimenting>
the purple run is what loose clipping and dense rewards are doing for me,
clip=1.0, lr=1e-4, weight_decay=1.0

and thanks to <a href="/athreesh/">Anish</a> for some generous brev compute credits!

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Arya

@aryagxr

4 months ago

really fun read we need evals that probe language models to output “what it sees”, not “what it thinks it sees” also very cool interp approach to visualize learned world model

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare