Yi Ding -- prod/acc (@yi_ding) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Yi Ding -- prod/acc

@yi_ding

9 months ago

OpenAI: GPT-4.5 is our most creative model yet. Me: Doubt.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Don't be disappointed that GPT-4.5 isn't smarter than o1. Scaling up pretraining improves responses across the board. Scaling up reasoning improves responses a lot if they benefit from thinking time and not much otherwise. Wait to find out how the improvements stack together.

thumb_up_off_alt649

chat_bubble_outline34

repeat42

shareShare

Yi Ding -- prod/acc

@yi_ding

9 months ago

Flattering the model makes a comeback.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Yi Ding -- prod/acc

@yi_ding

9 months ago

Looks like gpt-4.5-preview is 25% faster than when I tried it yesterday, although still many multiples slower than 4o.

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

Yi Ding -- prod/acc

@yi_ding

9 months ago

Even the newly released Sesame knows the atoms joke. sesame.com/research/cross…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jerry Liu

@jerryjliu0

9 months ago

Today I’m excited to announce our Series A fundraise by Norwest 🔥 Agents have the potential to automate the majority of knowledge work - whether it’s financial due diligence, support resolution, PRD generation, contract review. Building these agents requires both data and

Today I’m excited to announce our Series A fundraise by <a href="/NorwestVP/">Norwest</a> 🔥

Agents have the potential to automate the majority of knowledge work - whether it’s financial due diligence, support resolution, PRD generation, contract review. Building these agents requires both data and

thumb_up_off_alt296

chat_bubble_outline45

repeat28

shareShare

cat

@_catwu

9 months ago

`npm install -g @anthropic-ai/claude-code` there's no more waitlist. have fun!

thumb_up_off_alt9,9K

chat_bubble_outline291

repeat680

shareShare

Yi Ding -- prod/acc

@yi_ding

9 months ago

If you're working on voice AI, watch this space.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Jerry Liu

@jerryjliu0

9 months ago

Mistral OCR is nice and fast but other models outperform it on document processing. We did a comprehensive benchmark on Mistral OCR and compared it against a comprehensive set of different LLM/LVM-powered parsing techniques - these direct parsing using gemini

thumb_up_off_alt390

chat_bubble_outline25

repeat51

shareShare

Yi Ding -- prod/acc

@yi_ding

9 months ago

Every programmer I interviewed was allowed to use ChatGPT/Cursor/Copilot. I expect that to be the norm in a few years. At the present moment it's relatively straightforward to tell the difference between someone who actually understands the code and someone who doesn't. That

thumb_up_off_alt2

chat_bubble_outline2

repeat0

shareShare

Han

@hanchunglee

9 months ago

leehanchung.github.io/blogs/2025/03/…

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Jelani Nelson

@minilek

9 months ago

From Yann LeCun on greencard/visa revocations and deportations of scientists.

From <a href="/ylecun/">Yann LeCun</a> on greencard/visa revocations and deportations of scientists.

thumb_up_off_alt2,2K

chat_bubble_outline58

repeat270

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

Packaging a library for NPM that works on multiple runtimes is way more challenging than it should be. The Gemini team seems to have taken one of the cleaner approaches I've seen to date. Will need to give it a shot in a future project.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Simon Willison

@simonw

8 months ago

Alex Albert Anthropic Am I right in understanding that there's no additional execution of any separate code here at all? You tell Claude "put your thoughts in the think tool if you need to" but it's effectively a null-op - it's effectively a prompting hack that encourages Claude to "think out loud"

thumb_up_off_alt304

chat_bubble_outline10

repeat6

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

One of the reasons why LLM hallucinations are so hard to deal with is that the models generally output responses in a confident tone of voice. AI "gaslighting" or evaluating underlying model certainties may be a way to uncover actual confidence vs. good bluffs.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

The "inputs only" reinforcement learning mechanism from Databricks is an interesting idea. databricks.com/blog/tao-using… Looks like the crux of it is a custom-developed proprietery reward model, and not LLM as a judge though.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

This Claude interpretability blog is one of the most interesting ones I've read so far this year. We all know what LLMs output, but how do they choose their output? With billions of parameters it might seem impossible. But the result is human-like? anthropic.com/research/traci…

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

When chess computers first started getting good, Anand and others pushed for a variant where humans and computers would work together on a team. That variant gradually grew out of favor because the computers got so good that the humans weren't adding anything. We're not ready.

thumb_up_off_alt103

chat_bubble_outline7

repeat11

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

Interesting Chinese stocks have been some of the least affected. From the NYT

thumb_up_off_alt5

chat_bubble_outline2

repeat0

shareShare

Yi Ding -- prod/acc

@yi_ding

8 months ago

Due diligence: When you find out someone is dumber than bricks before you spend 100 million putting them into power.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Yi Ding -- prod/acc

good girl

Yi Ding -- prod/acc

Bob McGrew

Yi Ding -- prod/acc

Yi Ding -- prod/acc

Yi Ding -- prod/acc

Jerry Liu

cat

Yi Ding -- prod/acc

Jerry Liu

Yi Ding -- prod/acc

Han

Jelani Nelson

Yi Ding -- prod/acc

Simon Willison

Yi Ding -- prod/acc

Yi Ding -- prod/acc

Yi Ding -- prod/acc

Yi Ding -- prod/acc

Yi Ding -- prod/acc

Yi Ding -- prod/acc