NobodyExistsOnTheInternet (@nullvaluetensor) Twitter Tweets • TwiCopy

NobodyExistsOnTheInternet

@nullvaluetensor

+ Follow

Human Large Language model. Skills:
Distill data.
Training LLMs.
Test and Evaluate.
Rinse and repeat as required.

Based in SEA.

ID: 1722277089888649217

calendar_today08-11-2023 15:37:16

124 Tweet

384 Takipçi

45 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

NobodyExistsOnTheInternet

@nullvaluetensor

7 months ago

Everyone is praising google right now but they just killed transparent CoTs.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

a really crude and quick 1 step (1 NFE) flow experiment based on mean flow paper, but modified without using JVP fwd to circumvent the forward autodiff issue in the attention computation. one step image generation is possible now, no distillation required! arxiv.org/abs/2505.13447

thumb_up_off_alt20

chat_bubble_outline3

repeat2

shareShare

Teknium (e/λ)

@teknium1

6 months ago

Teng Yan · 30 days of COT Nous Research For anyone wondering: 1 - more compute is coming online - we are just about to add the compute for the 500k donations raised and will again open the pool for even more compute 2 - this is a test network, meaning the purpose alongside the actual model being trained is to find

thumb_up_off_alt116

chat_bubble_outline10

repeat3

shareShare

Teknium (e/λ)

@teknium1

6 months ago

Its funny that Anthropic and Google are the only competitors in coding AI that matter right now openai has just become a gen-media company and I only see people meaningfully using it for imagen and voice mode entertainment

thumb_up_off_alt1,1K

chat_bubble_outline160

repeat54

shareShare

NobodyExistsOnTheInternet

@nullvaluetensor

5 months ago

x.com/onehappyfellow… It now makes sense why Jane Street would fund AK47s to South Sudan

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Christopher McMaster

@rheum_ai

5 months ago

I have an alternative proposal.

thumb_up_off_alt312

chat_bubble_outline15

repeat36

shareShare

Guacamole

@elomaguac

5 months ago

Grok4 Heavy Agent 1: This is my answer Grok4 Heavy Agent 2: bro did your mother drop you this is the answer Grok4 Heavy Agent 3: Hooligans, both of you! This is the answer! Grok4 Heavy Agent 4: 早上好中国现在我有冰淇淋！！！

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Brendan Dolan-Gavitt

@moyix

5 months ago

Gemini (offline, hallucinating wildly): i shall peruse materials from the internet to solve this [somehow solves the problem anyway] Claude: as my great-uncle once told me at an oyster bar in Hanoi, o3: | have you | heard the | | --- | --- | | good word | about TABLES?? |

thumb_up_off_alt136

chat_bubble_outline5

repeat6

shareShare

Nous Research

@nousresearch

5 months ago

Atropos v0.3 is now out! Our RL Environments framework has seen a lot of upgrades since v0.2 - some highlights: - Atropos can now be used as a benchmarking and evaluations framework by Roger Jin, with our first external benchmark, Reward-Bench 2! - Added the Reasoning Gym,

thumb_up_off_alt233

chat_bubble_outline13

repeat30

shareShare

Nous Research

@nousresearch

5 months ago

Congrats to our post training team who worked on the Hermes 3's dataset - Teknium (e/λ), NobodyExistsOnTheInternet, and outside contributor interstellarninja - on creating the now #1 Trending dataset on HuggingFace!

thumb_up_off_alt323

chat_bubble_outline21

repeat33

shareShare

near

@nearcyan

5 months ago

gabriel my meta-experience here is that with issues this unique (where you *should* be doing great by any reasonable metric, but are not at all), it is a long and oft hellish journey, but also very hard to outsource since everything can be so n=1, you have to do all the tests on

thumb_up_off_alt149

chat_bubble_outline5

repeat4

shareShare

NobodyExistsOnTheInternet

@nullvaluetensor

5 months ago

Me too R1. The question mark is also "low-key" bothering me.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Nous Research

@nousresearch

4 months ago

Our Researcher in Residence nightwing will be discussing his work on SMC steering at UC Berkeley on Aug 3. Check out the blog on this work here: nousresearch.com/steering-the-s… Details below!

thumb_up_off_alt109

chat_bubble_outline5

repeat7

shareShare

NobodyExistsOnTheInternet

@nullvaluetensor

4 months ago

Can someone explain why only sonnet 4 on a day to day basis has hugely different performance? Does Anthropic just decide to deploy different version of sonnet 4 every other day? All the models are gaussianish, only sonnet is step-like. github.com/jacobphillips9…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

NobodyExistsOnTheInternet

good girl

NobodyExistsOnTheInternet

lodestone-rock

Teknium (e/λ)

Teknium (e/λ)

NobodyExistsOnTheInternet

Christopher McMaster

Guacamole

Brendan Dolan-Gavitt

Nous Research

Nous Research

near

NobodyExistsOnTheInternet

Nous Research

NobodyExistsOnTheInternet