xjdr (@_xjdr) 's Twitter Profile
xjdr

@_xjdr

hot takes, linear Algebra, JAX apologist, Raconteur

ID: 1731404872871006208

calendar_today03-12-2023 20:07:57

3,3K Tweet

8,8K Followers

353 Following

xjdr (@_xjdr) 's Twitter Profile Photo

wow, this is kind of wild to wake up to. Thank you to everyone who stared entropix! Update: There are so many things i want to try right now but i am limiting myself to working on adding more diverse prompts to the repo for testing and evals so we can move beyond vibes to

wow, this is kind of wild to wake up to. Thank you to everyone who stared entropix!

Update: 
There are so many things i want to try right now but i am limiting myself to working on adding more diverse prompts to the repo for testing and evals so we can move beyond vibes to
xjdr (@_xjdr) 's Twitter Profile Photo

to double down on this, the specific original goal was to see what we could accomplish with a vanilla OSS model without touching the weights or the architecture at all. This is a series of inference time compute experiments that essentially use the model outputs as as read only

xjdr (@_xjdr) 's Twitter Profile Photo

While i'm waiting for some evals to finish, i decided to impl this idea, we have like 9 axis now, so the colors make less sense without a legend. We're also making some very good progress on the sampler still. Answers and process is getting better every day.

While i'm waiting for some evals to finish, i decided to impl this idea, we have like 9 axis now, so the colors make less sense without a legend.

We're also making some very good progress on the sampler still. Answers and process is getting better every day.
xjdr (@_xjdr) 's Twitter Profile Photo

i test entropix on all sorts of prompts, i just post the 9.9 vs 9.11 one because its short. We are doing all sorts of crazy things with this 1B model

i test entropix on all sorts of prompts, i just post the 9.9 vs 9.11 one because its short. We are doing all sorts of crazy things with this 1B model
xjdr (@_xjdr) 's Twitter Profile Photo

the important part here other than the correct reasoning and answer is it put the '...' pause token in exactly the right place where it usually gets the answer wrong. I am starting to get more and more consistent results on logical reasoning problems now ...

xjdr (@_xjdr) 's Twitter Profile Photo

I'm not sure most of you are ready for this. The GIL is very useful and heavily relied on in many unexpected ways. I am personally looking forward to all the new and exciting race conditions we are about to find in core libraries

xjdr (@_xjdr) 's Twitter Profile Photo

my 4090 died over night. I actually think i managed to work it to death. RIP king, you've served the cause with distinction and honor

xjdr (@_xjdr) 's Twitter Profile Photo

in the mean time, what's everyone's favorite service to rent a big GPU dev box? Probably a few H100s? I'll probably try it for a few days and then succumb to what i know is right and just going back to doing primary development work on TPUs, but this might be fun

xjdr (@_xjdr) 's Twitter Profile Photo

independent entropix result confirmation on the latest and greatest update, this time with torch! i think we may have made something good