BertrandRussell simp (@brussellsimp) Twitter Tweets • TwiCopy

BertrandRussell simp

3 months ago

Nvidia now stressing on LLM size reduction. Nvidia's FP4 Quantization(1/2 size) of deepseek's R1-0528 comes with 1℅ degradation. Tensor RT optimzer is the latest after NAS(neural architecture search) implementation in nemotron ultra, which successively prunned llama 405B.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jerry Wei

@jerryweiai

3 months ago

Today marks my one-year anniversary at Anthropic, and I've been reflecting on some of the most impactful lessons I've learned during this incredible journey. One of the most striking realizations has been just how much a small, talent-dense team can accomplish. When I first

thumb_up_off_alt408

chat_bubble_outline6

repeat23

shareShare

Sakana AI

@sakanaailabs

3 months ago

We’re excited to introduce Text-to-LoRA: a Hypernetwork that generates task-specific LLM adapters (LoRAs) based on a text description of the task. Catch our presentation at #ICML2025! Paper: arxiv.org/abs/2506.06105 Code: github.com/SakanaAI/Text-… Biological systems are capable of

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat380

shareShare

Peiyi Wang

@sybilhyz

3 months ago

An insightful paper by Qingxiu Dong.😄

thumb_up_off_alt24

chat_bubble_outline1

repeat2

shareShare

Morph

@morph_labs

3 months ago

We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at Morph. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.

We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at <a href="/morph_labs/">Morph</a>. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.

thumb_up_off_alt375

chat_bubble_outline10

repeat50

shareShare

wh

@nrehiew_

3 months ago

This result that "reasoning" features learnt by an SAEs can be transferred **as is** across MODELS and datasets is super cool and similar in spirit to Mistral's finding that there exists a low dim reasoning direction

thumb_up_off_alt294

chat_bubble_outline5

repeat21

shareShare

rohan anil

@_arohan_

3 months ago

Last day today AI at Meta, reflecting on last several months, and wanted to highlight few things I enjoyed working with: Building new algorithms for on policy distillation with Dat Huynh Science of end to end thinking models Rishabh Agarwal and many others Working prototype of

thumb_up_off_alt291

chat_bubble_outline11

repeat7

shareShare

rohan anil

@_arohan_

3 months ago

Async RL framework for scaling RL

thumb_up_off_alt483

chat_bubble_outline3

repeat59

shareShare

Tongzhou Wang

@ssnl_tz

3 months ago

such a nice & clear articulation of the big question by Seohong Park ! also thanks for mentioning Quasimetric RL. now I just need to show people this post instead of explaining why I am excited by QRL :)

thumb_up_off_alt88

chat_bubble_outline0

repeat19

shareShare

BertrandRussell simp

@brussellsimp

3 months ago

Reinforcement Learning assists distributed training much more than pre training. Rewards are sparse,thus more important updates of Advantage & policies are delayed,thus the effects of errors in distributed training are lessened, in contrast pretraining demands close clustering.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

BertrandRussell simp

@brussellsimp

3 months ago

Yikes

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Z Fellows

@zfellows

3 months ago

"Pessimists sound smart. Optimists make money." — Patrick Collison, founder of Stripe

thumb_up_off_alt6,6K

chat_bubble_outline40

repeat499

shareShare

Jasper

@zjasper666

3 months ago

The famous Fields Medalist Mathematician Terence Tao shared his predictions on when AI could become a collaborator capable of producing Fields Medal–level mathematical proofs: > By 2026: AI will become a helpful assistant to mathematicians — a trustworthy partner in mathematical

thumb_up_off_alt78

chat_bubble_outline8

repeat16

shareShare

Yuchen Jin

@yuchenj_uw

3 months ago

Many PhDs (my past self included) fall into the trap of thinking that publishing in top-tier conferences is the ultimate goal. But publishing ≠ impact. Muon was just a blog post. It got Keller into OpenAI, he might be training GPT-5 with it now. I'm grateful he listed me as

thumb_up_off_alt1,1K

chat_bubble_outline42

repeat103

shareShare

Ashish Vaswani

@ashvaswani

3 months ago

Check out our latest research on data. We're releasing 24T tokens of richly labelled web data. We found it very useful for our internal data curation efforts. Excited to see what you build using Essential-Web v1.0!

thumb_up_off_alt575

chat_bubble_outline15

repeat78

shareShare

BertrandRussell simp

@brussellsimp

3 months ago

Energy ,Compute,Metal alloys are time invariants . Whether intelligence explosion takes place or not , expansion and existence of any form of higher intelligence is ensured by them.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jerry Tworek

@millionint

2 months ago

To summarize this week: - we released general purpose computer using agent - got beaten by a single human in atcoder heuristics competition - solved 5/6 new IMO problems with natural language proofs All of those are based on the same single reinforcement learning system

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat117

shareShare

BertrandRussell simp

@brussellsimp

2 months ago

All breakthroughs 1)Scaling transformers 2)Reasoning at inference via RL 3)Reasoning at inference with tool use 4)<now>Reasoning on hard and tough to verify domains have the OpenAI tag . Even when it looked bleak not so long ago,they keep delivering.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Junyang Lin

@justinlin610

2 months ago

this is what is not small! boys spent so much time building the Qwen3-Coder after Qwen2.5-Coder. it is much bigger, but based on MoE, and way stronger and smarter than before! not sure we can say competitive with claude sonnet 4 but might be for sure a really good coding agent.

thumb_up_off_alt989

chat_bubble_outline58

repeat78

shareShare

alphaXiv

@askalphaxiv

2 months ago

Google has shared the system prompt that got Gemini 2.5 Pro IMO 2025 Gold Medal 🏅 paper now #1 trending on alphaXiv 📈

thumb_up_off_alt2,2K

chat_bubble_outline26

repeat248

shareShare