Daniel Feygin (@innov8r) Twitter Tweets • TwiCopy

Yufan Zhuang

7 months ago

Can LLMs reason beyond context limits? 🤔 Introducing Knowledge Flow, a training-free method that helped gpt-oss-120b & Qwen3-235B achieve 100% on the AIME-25, no tools. How? like human deliberation, for LLMs. 📝 Blog: yufanzhuang.notion.site/knowledge-flow 💻 Code: github.com/EvanZhuang/kno…

thumb_up_off_alt207

chat_bubble_outline6

repeat35

shareShare

Chris Dixon

@cdixon

7 months ago

We’re excited to share our 2025 State of Crypto report. This year’s story: the maturation of the crypto industry — with growing institutional adoption, the rise of stablecoins, better infrastructure, new consumer experiences, and long-awaited regulatory clarity. Read the full

thumb_up_off_alt2,2K

chat_bubble_outline337

repeat677

shareShare

Alexia Jolicoeur-Martineau

@jm_alexia

6 months ago

New paper on how to fine-tune your Tiny Recursive Models!

thumb_up_off_alt295

chat_bubble_outline4

repeat26

shareShare

Andrew White 🐦‍⬛

@andrewwhite01

6 months ago

After two years of work, we’ve made an AI Scientist that runs for days and makes genuine discoveries. Working with external collaborators, we report seven externally validated discoveries across multiple fields. It is available right now for anyone to use. 1/5

thumb_up_off_alt3,3K

chat_bubble_outline108

repeat542

shareShare

Ant Wilson — e/postgres

@antwilson

6 months ago

make something agents want

thumb_up_off_alt332

chat_bubble_outline64

repeat33

shareShare

chastronomic

@chastronomic

6 months ago

If you’re an "ML Engineer" and you think “Transformer” just means stacking encoder–decoder blocks and calling it a day, you’re missing the actual mechanism that makes modern AI work. Concept 16: The Transformer Is a Math Engine, Not a “Model Architecture "Most people can

thumb_up_off_alt831

chat_bubble_outline25

repeat72

shareShare

Simon Willison

@simonw

5 months ago

OpenAI aren't talking about it yet, but it turns out they've adopted Anthropic's brilliant "skills" mechanism in a big way Skills are now live in both ChatGPT and their Codex CLI tool, I wrote up some detailed notes on how they work so far here: simonwillison.net/2025/Dec/12/op…

thumb_up_off_alt2,2K

chat_bubble_outline76

repeat195

shareShare

Andrej Karpathy

@karpathy

5 months ago

I love the expression “food for thought” as a concrete, mysterious cognitive capability humans experience but LLMs have no equivalent for. Definition: “something worth thinking about or considering, like a mental meal that nourishes your mind with ideas, insights, or issues that

thumb_up_off_alt4,4K

chat_bubble_outline338

repeat327

shareShare

Boris Cherny

@bcherny

5 months ago

Andrej Karpathy I feel this way most weeks tbh. Sometimes I start approaching a problem manually, and have to remind myself “claude can probably do this”. Recently we were debugging a memory leak in Claude Code, and I started approaching it the old fashioned way: connecting a profiler, using the

thumb_up_off_alt7,7K

chat_bubble_outline155

repeat484

shareShare

Engineering

@xeng

4 months ago

We have open-sourced our new 𝕏 algorithm, powered by the same transformer architecture as xAI's Grok model. Check it out here: github.com/xai-org/x-algo…

thumb_up_off_alt16,16K

chat_bubble_outline1,1K

repeat2,2K

shareShare

Mikel Jollett

@mikel_jollett

4 months ago

As someone who has studied cults (wrote a bestseller about it), let me tell you something: The lies Trump, Vance and Miller say are not meant to be believed by their followers. They are meant to be REPEATED. The repetition of the lie is the test of loyalty to the cult.

thumb_up_off_alt17,17K

chat_bubble_outline720

repeat4,4K

shareShare

Google Research

@googleresearch

3 months ago

A common heuristic in LLM agent design—"more agents is better"—might be wrong. Across 180 configurations, we find multi-agent coordination is task-contingent: +81% on parallelizable tasks (finance), but -70% on sequential ones (planning). Architecture-task alignment matters more

thumb_up_off_alt769

chat_bubble_outline36

repeat101

shareShare

Alex Zhang

@a1zhang

3 months ago

We just updated the RLM paper with some new stuff. First, we just released RLM-Qwen3-8B, the first natively recursive language model (at tiny scale!). We post-trained Qwen3-8B using only ~1000 RLM trajectories from unrelated domains to our evaluation benchmarks. RLM-Qwen3-8B

thumb_up_off_alt617

chat_bubble_outline38

repeat93

shareShare

Leonie

@helloiamleonie

3 months ago

i'm clearly biased but this is the most interesting take on agent memory i've seen so far. (yes, forget the "filesystem vs database" discussion) a few weeks back i had a nice chat with vintro from Plastic Labs and their approach is: memory is not a retrieval problem.

thumb_up_off_alt1,1K

chat_bubble_outline60

repeat103

shareShare

Robert Youssef

@rryssf_

3 months ago

Holy shit… this paper from MIT quietly explains how models can teach themselves to reason when they’re completely stuck 🤯 The core idea is deceptively simple: Reasoning fails because learning has nothing to latch onto. When a model’s success rate drops to near zero,

thumb_up_off_alt921

chat_bubble_outline31

repeat167

shareShare

Arvind Narayanan

@random_walker

3 months ago

Why do coding agents work so well and what would it take to replicate their success in other domains? One important and under-appreciated reason is that agentic coding is a type of neurosymbolic AI. The main weakness of LLMs is that they are statistical machines and struggle at

thumb_up_off_alt503

chat_bubble_outline57

repeat65

shareShare

Greg Brockman

@gdb

3 months ago

Software development is undergoing a renaissance in front of our eyes. If you haven't used the tools recently, you likely are underestimating what you're missing. Since December, there's been a step function improvement in what tools like Codex can do. Some great engineers at

thumb_up_off_alt10,10K

chat_bubble_outline369

repeat1,1K

shareShare

Omar Khattab

@lateinteraction

3 months ago

Nope. My lab is making 3 algorithmic bets. One of them is on recursion, RLMs being step 1. Another one is on the power of late interaction retrieval. Conventional single-vector retrieval was always a bottleneck, even back in 2019 when starting ColBERT. So if you're wondering if

thumb_up_off_alt690

chat_bubble_outline29

repeat39

shareShare

Dan Guido

@dguido

3 months ago

New: I'm sharing the Trail of Bits Claude Code defaults. This is how we setup, configure, and use claude code: github.com/trailofbits/cl…

New: I'm sharing the <a href="/trailofbits/">Trail of Bits</a> Claude Code defaults. This is how we setup, configure, and use claude code:
github.com/trailofbits/cl…

thumb_up_off_alt176

chat_bubble_outline14

repeat22

shareShare

Physics In History

@physinhistory

3 months ago

In 2015, physicists at the University of Rochester discovered the classic 17th-century Wallis formula for π hidden within quantum mechanical calculations of the hydrogen atom's energy levels. It was a purely mathematical relationship found to be baked into the fabric of physical

thumb_up_off_alt517

chat_bubble_outline15

repeat85

shareShare