Haining Wang (@haining_wang_) Twitter Tweets • TwiCopy

Andrej Karpathy

a year ago

# RLHF is just barely RL Reinforcement Learning from Human Feedback (RLHF) is the third (and last) major stage of training an LLM, after pretraining and supervised finetuning (SFT). My rant on RLHF is that it is just barely RL, in a way that I think is not too widely

thumb_up_off_alt8,8K

chat_bubble_outline407

repeat1,1K

shareShare

Rohan Paul

@rohanpaul_ai

a year ago

Not your model, not your AI

thumb_up_off_alt1,1K

chat_bubble_outline70

repeat126

shareShare

Michael Cohen

@michael05156007

a year ago

New paper! Over-optimization in RL is well-known, but it even occurs when KL(policy || base model) is constrained fairly tightly. Why? And can we fix it? 🧵

thumb_up_off_alt419

chat_bubble_outline6

repeat75

shareShare

Ben Lee

@lee_bcg

a year ago

I’m recruiting Ph.D. students to join the Lab for Computing Cultural Heritage UW iSchool! If you’re interested in working with me and pursuing a Ph.D. at the intersection of AI, LIS, and the digital humanities, you can find more information here: bcglee.com/lcch.html

thumb_up_off_alt398

chat_bubble_outline7

repeat178

shareShare

Haining Wang

@haining_wang_

a year ago

New paper alert! We simplify scholarly abstracts from a postgraduate 🧑‍🎓 to a high school 🧑‍🏫 reading level without sacrificing faithfulness or quality. All of this is done by RL-tuning a Gemma-2B. Check it out: arxiv.org/abs/2410.17088

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Itai Yanai

@itaiyanai

a year ago

Science has a hypothesis-testing part, but it has an equally crucial ‘night science’ mode, where new ideas are improvised for the first time.

thumb_up_off_alt1,1K

chat_bubble_outline12

repeat254

shareShare

Science of Science

@mishateplitskiy

a year ago

People usually think replication attempts in science are rare. Journals don't publish replications, so scientists don't do them. In reality there are countless replication attempts (and failures), it's just PhD students assume they did something wrong journals.plos.org/plosone/articl…

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat310

shareShare

Melanie Walsh

@mellymeldubs

a year ago

I'm recruiting a PhD student to join my group UW iSchool in 2025-26. If you like the mountains and interdisciplinary research that blends data and culture, this could be a good fit! PhD apps due Dec 2: ischool.uw.edu/programs/phd/a… More info about my group: melaniewalsh.org/mentorship

thumb_up_off_alt342

chat_bubble_outline4

repeat125

shareShare

Yann LeCun

@ylecun

a year ago

I don't wanna say "I told you so", but I told you so. Quote: "Ilya Sutskever, co-founder of AI labs Safe Superintelligence (SSI) and OpenAI, told Reuters recently that results from scaling up pre-training - the phase of training an AI model that uses a vast amount of unlabeled

thumb_up_off_alt6,6K

chat_bubble_outline462

repeat612

shareShare

Rohan Paul

@rohanpaul_ai

a year ago

LLMs learn smarter by replacing fixed residual connections with dynamic, self-adjusting neural pathways. Hyper-connections, proposed in this paper, offer a flexible alternative to residual connections in neural networks. Original Problem 🚧: Residual connections in neural

thumb_up_off_alt27

chat_bubble_outline3

repeat2

shareShare

James Campbell

@jam3scampbell

a year ago

Transformers and RLHF were uploaded to arxiv on the exact same day

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat50

shareShare

Guilherme Penedo

@gui_penedo

a year ago

We've just updated 🍷FineWeb and 📚 FineWeb-Edu with data from all the remaining 2024 CommonCrawl dumps, covering up to December. 🍷FineWeb now has a little over 17 trillion tokens. Fresh data = more useful models. We'll keep it coming.

thumb_up_off_alt432

chat_bubble_outline13

repeat64

shareShare

Daniel Han

@danielhanchen

10 months ago

thumb_up_off_alt457

chat_bubble_outline7

repeat81

shareShare

Marc Backes

@themarcba

9 months ago

pride versioning > semantic versioning 😂

thumb_up_off_alt15,15K

chat_bubble_outline116

repeat1,1K

shareShare

Andrej Karpathy

@karpathy

8 months ago

I wrote a quick new post on "Digital Hygiene". Basically there are some no-brainer decisions you can make in your life to dramatically improve the privacy and security of your computing and this post goes over some of them. Blog post link in the reply, but copy pasting below

thumb_up_off_alt27,27K

chat_bubble_outline734

repeat3,3K

shareShare

Anthropic

@anthropicai

8 months ago

New Anthropic research: Tracing the thoughts of a large language model. We built a "microscope" to inspect what happens inside AI models and use it to understand Claude’s (often complex and surprising) internal mechanisms.

thumb_up_off_alt8,8K

chat_bubble_outline182

repeat1,1K

shareShare

Aran Komatsuzaki

@arankomatsuzaki

7 months ago

AI2 presents OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Finds and shows verbatim matches between segments of language model output and documents in the training text corpora

thumb_up_off_alt152

chat_bubble_outline6

repeat23

shareShare

A.I.Warper

@aiwarper

7 months ago

ChatGPT prompted 74 times “Create the exact replica of this image, do not change a thing” This is why I say you need to start a new chat after each edit 🤣

thumb_up_off_alt2,2K

chat_bubble_outline217

repeat229

shareShare

Arvind Narayanan

@random_walker

7 months ago

I tell students on the first day of class that if you're truly learning it's supposed to feel uncomfortable. The reason is that real learning is not simply the accumulation of facts; it is deep understanding, building mental models of the world, and other higher-level abilities.

thumb_up_off_alt899

chat_bubble_outline25

repeat157

shareShare

Arvind Narayanan

@random_walker

5 months ago

I find the story of AI and radiology fascinating. Of course, Hinton's prediction was wrong* and tech advances don't automatically and straightforwardly cause job replacement — that's not the interesting part. Radiology has embraced AI enthusiastically, and the labor force is

thumb_up_off_alt972

chat_bubble_outline58

repeat142

shareShare