NobodyExistsOnTheInternet (@nullvaluetensor) 's Twitter Profile
NobodyExistsOnTheInternet

@nullvaluetensor

Human Large Language model. Skills:
Distill data.
Training LLMs.
Test and Evaluate.
Rinse and repeat as required.

Based in SEA.

ID: 1722277089888649217

calendar_today08-11-2023 15:37:16

124 Tweet

384 Takipçi

45 Takip Edilen

lodestone-rock (@lodestonee621) 's Twitter Profile Photo

a really crude and quick 1 step (1 NFE) flow experiment based on mean flow paper, but modified without using JVP fwd to circumvent the forward autodiff issue in the attention computation. one step image generation is possible now, no distillation required! arxiv.org/abs/2505.13447

a really crude and quick 1 step (1 NFE) flow experiment based on mean flow paper, but modified without using JVP fwd to circumvent the forward autodiff issue in the attention computation. one step image generation is possible now, no distillation required!
arxiv.org/abs/2505.13447
Teknium (e/λ) (@teknium1) 's Twitter Profile Photo

Teng Yan · 30 days of COT Nous Research For anyone wondering: 1 - more compute is coming online - we are just about to add the compute for the 500k donations raised and will again open the pool for even more compute 2 - this is a test network, meaning the purpose alongside the actual model being trained is to find

Teknium (e/λ) (@teknium1) 's Twitter Profile Photo

Its funny that Anthropic and Google are the only competitors in coding AI that matter right now openai has just become a gen-media company and I only see people meaningfully using it for imagen and voice mode entertainment

Guacamole (@elomaguac) 's Twitter Profile Photo

Grok4 Heavy Agent 1: This is my answer Grok4 Heavy Agent 2: bro did your mother drop you this is the answer Grok4 Heavy Agent 3: Hooligans, both of you! This is the answer! Grok4 Heavy Agent 4: 早上好中国现在我有冰淇淋!!!

Brendan Dolan-Gavitt (@moyix) 's Twitter Profile Photo

Gemini (offline, hallucinating wildly): i shall peruse materials from the internet to solve this [somehow solves the problem anyway] Claude: as my great-uncle once told me at an oyster bar in Hanoi, o3: | have you | heard the | | --- | --- | | good word | about TABLES?? |

Nous Research (@nousresearch) 's Twitter Profile Photo

Atropos v0.3 is now out! Our RL Environments framework has seen a lot of upgrades since v0.2 - some highlights: - Atropos can now be used as a benchmarking and evaluations framework by Roger Jin, with our first external benchmark, Reward-Bench 2! - Added the Reasoning Gym,

near (@nearcyan) 's Twitter Profile Photo

gabriel my meta-experience here is that with issues this unique (where you *should* be doing great by any reasonable metric, but are not at all), it is a long and oft hellish journey, but also very hard to outsource since everything can be so n=1, you have to do all the tests on

Nous Research (@nousresearch) 's Twitter Profile Photo

Our Researcher in Residence nightwing will be discussing his work on SMC steering at UC Berkeley on Aug 3. Check out the blog on this work here: nousresearch.com/steering-the-s… Details below!

NobodyExistsOnTheInternet (@nullvaluetensor) 's Twitter Profile Photo

Can someone explain why only sonnet 4 on a day to day basis has hugely different performance? Does Anthropic just decide to deploy different version of sonnet 4 every other day? All the models are gaussianish, only sonnet is step-like. github.com/jacobphillips9…

Can someone explain why only sonnet 4 on a day to day basis has hugely different performance? Does Anthropic just decide to deploy different version of sonnet 4 every other day? 

All the models are gaussianish, only sonnet is step-like. 
github.com/jacobphillips9…