Edoardo Pona (@edoardopona) Twitter Tweets • TwiCopy

Eric Todd

2 years ago

LLMs represent words as vector embeddings. Do they represent *functions* as vectors too? Yes! This has implications for how we think about “reasoning” in language models. New preprint w/ Millicent Li, Arnab Sen Sharma, Aaron Mueller, byron wallace, David Bau: functions.baulab.info

thumb_up_off_alt570

chat_bubble_outline5

repeat89

shareShare

Peter J. Liu

@peterjliu

2 years ago

What do you call the disparity between GPU-rich and GPU-poor? Jensen's inequality

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat145

shareShare

Matt Shumer

@mattshumer_

2 years ago

Here is a powerful Claude 3 prompt for engineers. Use it to automatically refactor, comment, and improve your code: --- <prompt_explanation> You are a skilled software engineer with deep expertise in code refactoring and optimization across multiple programming languages. Your

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat129

shareShare

Bogdan Ionut Cirstea

@bogdanionutcir2

2 years ago

good question, I think authoritarian regime takeover [using AI] might be at least slightly neglected in current AI x-safety work

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Linus

@thesephist

2 years ago

By end of 2024, steering foundation models in latent/activation space will outperform steering in token space ("prompt eng") in several large production deployments. I felt skeptical about this in summer '23, felt vaguely positive in Jan, and now think it's more likely than not,

thumb_up_off_alt579

chat_bubble_outline21

repeat50

shareShare

Nicolas Yax

@nicolas__yax

2 years ago

🚨New preprint🚨 Can LLM finetuning relationships be inferred only through model outputs ? We found that adapting phylogenetic algorithms 🧬 to language models 🤖 helps identify families of models and can even predict their performances with Stefano Palminteri (@stepalminteri.bsky.social) and Pierre-Yves Oudeyer ! 1/9

thumb_up_off_alt126

chat_bubble_outline2

repeat38

shareShare

davidad 🎇

@davidad

2 years ago

the inevitable Minskification of neural networks

thumb_up_off_alt452

chat_bubble_outline9

repeat51

shareShare

Aryaman Arora

@aryaman2020

2 years ago

8x NVIDIA H100 80GB

thumb_up_off_alt788

chat_bubble_outline12

repeat29

shareShare

Richard Ngo

@richardmcngo

2 years ago

Instead of analyzing whether AI takeoff will be “fast” or “slow”, I now prefer to think about the spectrum from concentrated takeoff (within one organization in one country) to distributed takeoff (involving many organizations and countries).

thumb_up_off_alt188

chat_bubble_outline9

repeat7

shareShare

Dean W. Ball

@deanwball

2 years ago

My basic reaction to AI today is, “jeez, o1 performs in the top 1% of humans at math, yet fails routinely at basic logic tasks. I guess intelligence is a high-dimensional space, and that probably means, like most high-dimensional things, it behaves counterintuitively.”

thumb_up_off_alt467

chat_bubble_outline18

repeat27

shareShare

Peyman Milanfar

@docmilanfar

2 years ago

Strange but true - A wobbly table on any reasonable floor can be made steady by just turning it. Moral of the story: Before dining out, always ask if their floor is Lipschitz continuous.

thumb_up_off_alt704

chat_bubble_outline15

repeat54

shareShare

Functor Fact

@functorfact

a year ago

'The purpose of abstraction is not to be vague, but to create a new semantic level in which one can be absolutely precise' - Edsger Dijkstra

thumb_up_off_alt90

chat_bubble_outline5

repeat21

shareShare

Richard Ngo

@richardmcngo

a year ago

Tweet the training data you want to see in the world.

thumb_up_off_alt672

chat_bubble_outline34

repeat79

shareShare

Anthropic

@anthropicai

a year ago

New Anthropic research: Alignment faking in large language models. In a series of experiments with Redwood Research, we found that Claude often pretends to have different views during training, while actually maintaining its original preferences.

thumb_up_off_alt4,4K

chat_bubble_outline212

repeat727

shareShare

wh

@nrehiew_

a year ago

This is living rent free in my head. It is not obvious to me why this works. Models should not have any meta understanding of the data they were trained on - why we shouldnt trust the answer we get from asking “who are you”. Its a logical extension why we it doesnt make sense to

thumb_up_off_alt1,1K

chat_bubble_outline69

repeat47

shareShare

yobibyte

@y0b1byte

a year ago

Best paper of the year so far.

thumb_up_off_alt174

chat_bubble_outline2

repeat20

shareShare

Brendan Dolan-Gavitt

@moyix

a year ago

Duck typing? You're thinking too small. With AI, we can finally take Guido van Rossum's dream to its logical conclusion

thumb_up_off_alt1,1K

chat_bubble_outline59

repeat142

shareShare

Nora Belrose

@norabelrose

a year ago

What are the chances you'd get a fully functional language model by randomly guessing the weights? We crunched the numbers and here's the answer:

thumb_up_off_alt489

chat_bubble_outline29

repeat38

shareShare

Tomáš Daniš

@tmdanis

a year ago

Can humans reason? In this paper we show evidence many humans simply apply heuristics they've been exposed to over the course of their lives without deeper consideration. In conclusion, humans don't seem to reason and only copy reasoning patterns from their training data.

thumb_up_off_alt4,4K

chat_bubble_outline296

repeat537

shareShare

Alex Turner

@turn_trout

a year ago

I’m worried that “doom” speculation will make doom more likely. Specifically, AIs conform to our expectations of them, as communicated by their training data. This “self-fulfilling misalignment data” may be poisoning training already. 🧵

thumb_up_off_alt1,1K

chat_bubble_outline102

repeat103

shareShare