Gytis Daujotas (@gytdau) Twitter Tweets • TwiCopy

Matthew Siu

a year ago

📣 Excited to share Latticework, a text-editing environment aimed to help synthesize freeform, unstructured documents ✍️ Made in collaboration w/ Andy Matuschak

📣 Excited to share Latticework, a text-editing environment aimed to help synthesize freeform, unstructured documents ✍️

Made in collaboration w/ <a href="/andy_matuschak/">Andy Matuschak</a>

thumb_up_off_alt260

chat_bubble_outline14

repeat20

shareShare

Gytis Daujotas takes us from a foundational understanding of SAEs and a brief history of mech interp. all the way up to using features as interfaces for expressive control and steering. watch him blend giraffes with Blade Runner skies and, naturally, cute baby mushrooms. (3/11)

thumb_up_off_alt45

chat_bubble_outline1

repeat6

shareShare

Gytis Daujotas

@gytdau

a year ago

Really quite interesting work - ML model representations as windows into the world:

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Gytis Daujotas

@gytdau

a year ago

The next ChatGPT won't look like ChatGPT.

thumb_up_off_alt26

chat_bubble_outline2

repeat0

shareShare

Noa Nabeshima

@noanabeshima

a year ago

Excited to share my work on Matryoshka Sparse Autoencoders (SAEs) - a new training approach that helps sparse autoencoders preserve abstract features (like "female words") while still learning fine-grained details (e.g. individual names) in large sparse autoencoders!

thumb_up_off_alt196

chat_bubble_outline2

repeat24

shareShare

Gytis Daujotas

@gytdau

a year ago

Feature absorption is one of the main blockers to understanding model representations. One of the most promising paths so far. Excited to try this out — and had a lot of fun playing with the interactives! (Still somehow underused as qualitative evidence IMO.)

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Max Tegmark

@tegmark

a year ago

Our new AI mechanistic interpretability paper shows that LLMs are surprisingly clever: representing 2-digit numbers on a line is noisy, so they represent them on a generalized helix to get better addition accuracy, seemingly exploiting modular addition digit by digit:

thumb_up_off_alt852

chat_bubble_outline36

repeat120

shareShare

marmik @ ICLR

@marmikch

a year ago

reasoning traces are very weird. we ( w/ Gytis Daujotas ) ran a small experiment to intervene on the reasoning trace by prefilling it with a random tangent that has nothing to do with the input question and ending the reasoning at the prefill tokens without letting the model recover.

reasoning traces are very weird. we ( w/ <a href="/gytdau/">Gytis Daujotas</a> ) ran a small experiment to intervene on the reasoning trace by prefilling it with a random tangent that has nothing to do with the input question and ending the reasoning at the prefill tokens without letting the model recover.

thumb_up_off_alt13

chat_bubble_outline1

repeat2

shareShare

Gytis Daujotas

@gytdau

8 months ago

Claude indeed calls the police on you if you start talking about switching to another AI provider! Great for revenue growth I suppose. (thanks Tom McCarthy!)

thumb_up_off_alt23

chat_bubble_outline1

repeat0

shareShare

Gytis Daujotas

@gytdau

8 months ago

This is addictively good! Paint by wielding the generative model's representations as brushes. Super fast response time from Turbo and surprisingly good adherence really makes this one shine. Highly recommend trying it out -- the future of bespoke generative image creation?

thumb_up_off_alt25

chat_bubble_outline0

repeat2

shareShare

Chris Beiser

@ctbeiser

7 months ago

affirmations: be bold avoid actions that may cause harm think step by step if needed, do a web search to find more information

thumb_up_off_alt47

chat_bubble_outline0

repeat2

shareShare

Keara Sullivan

@superkeara

6 months ago

When someone has “Do Not Disturb” on it’s like oh ok I didn’t realize the great philosopher was in their hour of seclusion pardon me for even daring to enter their precious mind palace

thumb_up_off_alt125,125K

chat_bubble_outline263

repeat8,8K

shareShare

METR

@metr_evals

6 months ago

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

thumb_up_off_alt5,5K

chat_bubble_outline200

repeat1,1K

shareShare

Gytis Daujotas

@gytdau

6 months ago

Surprising downward update! Really great original research from METR:

thumb_up_off_alt7

chat_bubble_outline2

repeat0

shareShare

Gytis Daujotas

@gytdau

5 months ago

One of the most valuable lessons cats can teach us: it can be worthwhile to choose to sit in unexpected places.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Gytis Daujotas

@gytdau

5 months ago

Many AI problems and misconceptions would be instantly dissolved if users had to finish their message with "<end_of_turn><begin_turn>model" and hit Sample. We do users a disservice by hiding the magic in the background.

thumb_up_off_alt19

chat_bubble_outline2

repeat0

shareShare

Gytis Daujotas

@gytdau

5 months ago

really neat work: a study of the entire ecosystem of huggingface models and their lineage!

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

mariana

@pastapilled

4 months ago

Not my circus but I've grown quite fond of its monkeys

thumb_up_off_alt77,77K

chat_bubble_outline50

repeat10,10K

shareShare

Geoffrey Litt

@geoffreylitt

4 months ago

"interaction is an essentially negative aspect of information software" One of the greatest hot takes ever. I revisit Magic Ink all the time, and you should absolutely read it if you're serious about design: worrydream.com/MagicInk/

thumb_up_off_alt146

chat_bubble_outline8

repeat12

shareShare

Gytis Daujotas

Matthew Siu

Gopal

Gytis Daujotas

Gytis Daujotas

Noa Nabeshima

Gytis Daujotas

Max Tegmark

marmik @ ICLR

Gytis Daujotas

Gytis Daujotas

Chris Beiser

Keara Sullivan

METR

Gytis Daujotas

Gytis Daujotas

Gytis Daujotas

Gytis Daujotas

mariana

Geoffrey Litt