Haochen Zhang (@jhaochenz) Twitter Tweets • TwiCopy

Haochen Zhang

5 years ago

Glad to share this new work! We theoretically show that parameter-dependent noises provide a bias toward parameters where noise itself is small. This is a potentially stronger effect than simply escaping sharp local minima. Code for our project is publicly available!

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Roger Grosse

@rogergrosse

5 years ago

With the right regularizer, linear autoencoders recover the ordered principal components, but very slowly. Interesting model system for representation learning since we know the optimal representation. New work w/ Jenny (Xuchan) Bao, James Lucas, and Sushant Sachdeva. arxiv.org/abs/2007.06731

thumb_up_off_alt134

chat_bubble_outline3

repeat30

shareShare

Tengyu Ma

@tengyuma

4 years ago

Why does contrastive learning magically produce linearly separable features? We leverage spectral graph theory to analyze it under realistic settings. (In contrast, many prior works require that positive pairs are independent conditioned on the label.) arxiv.org/abs/2106.04156

thumb_up_off_alt517

chat_bubble_outline1

repeat84

shareShare

Tengyu Ma

@tengyuma

4 years ago

Thinking of applying self-supervised learning (SSL) on your uncurated, imbalanced datasets? Good news: we found SSL is more robust to long tails than supervised representations. We also present theoretical and empirical analyses and an improved algorithm. arxiv.org/abs/2110.05025

thumb_up_off_alt391

chat_bubble_outline5

repeat79

shareShare

Tengyu Ma

@tengyuma

4 years ago

Pretraining is ≈SoTA for domain adaptation: just do contrastive learning on *all* unlabeled data + finetune on source labels. Features are NOT domain-invariant, but disentangle class & domain info to enable transfer. Theory & exps: arxiv.org/abs/2204.00570 arxiv.org/abs/2204.02683

thumb_up_off_alt665

chat_bubble_outline7

repeat126

shareShare

Stanford AI Lab

@stanfordailab

4 years ago

Curious about why contrastive learning produces representations useful for downstream tasks? Check out Haochen Zhang, Colin Wei, and Tengyu Ma's theoretical explanation in our latest blog post: ai.stanford.edu/blog/understan… Based on the NeurIPS 2021 oral paper: arxiv.org/abs/2106.04156

thumb_up_off_alt20

chat_bubble_outline1

repeat6

shareShare

Ananya Kumar

@ananyaku

4 years ago

Our paper got accepted to ICML ‘22 as a long talk! Thanks to all the co-authors (Kendrick Shen Robbie Jones Sang Michael Xie Haochen Zhang Tengyu Ma Percy Liang). Congrats Kendrick on yet another oral (as an undergrad!)

thumb_up_off_alt141

chat_bubble_outline3

repeat29

shareShare

Haochen Zhang

@jhaochenz

3 years ago

How does model architecture influence the contrastive representations? Check out our paper "A theoretical study of inductive biases in contrastive learning" at #ICLR2023 . Virtual poster: iclr.cc/virtual/2023/p…. Joint work with Tengyu Ma.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Tengyu Ma

@tengyuma

2 years ago

📢 Introducing Voyage AI Voyage_AI_! Founded by a talented team of leading AI researchers and me 🚀🚀. We build state-of-the-art embedding models (e.g., better than OpenAI 😜). We also offer custom models that deliver 🎯+10-20% accuracy gain in your LLM products. 🧵

📢 Introducing Voyage AI <a href="/Voyage_AI_/">Voyage_AI_</a>!

Founded by a talented team of leading AI researchers and me 🚀🚀.

We build state-of-the-art embedding models (e.g., better than OpenAI 😜).

We also offer custom models that deliver 🎯+10-20% accuracy gain in your LLM products. 🧵

thumb_up_off_alt761

chat_bubble_outline38

repeat94

shareShare

Haochen Zhang

@jhaochenz

2 years ago

🔥🔥🔥 Congrats on the launch! Demi Guo Chenlin Meng

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Haochen Zhang

@jhaochenz

a year ago

I joined the amazing team at Anthropic last month. Excited to share what we’ve been working on recently!

thumb_up_off_alt84

chat_bubble_outline4

repeat2

shareShare

Alex Albert

@alexalbert__

a year ago

Artifacts pro tip: If you are running into unsupported library errors with NPM modules, just ask Claude to use the cdnjs link instead and it should work just fine.

thumb_up_off_alt820

chat_bubble_outline44

repeat81

shareShare

Mike Krieger

@mikeyk

a year ago

New today: organize your claude[dot]ai chats in Projects, and add files & context and 💫 custom instructions 💫 that are shared across all project chats. And on the Claude Team plan, you can discover great uses of Claude within your team using a new activity feed & shared chats.

thumb_up_off_alt270

chat_bubble_outline20

repeat23

shareShare

Dario Amodei

@darioamodei

a year ago

Machines of Loving Grace: my essay on how AI could transform the world for the better darioamodei.com/machines-of-lo…

thumb_up_off_alt5,5K

chat_bubble_outline0

repeat1,1K

shareShare

Haochen Zhang

@jhaochenz

a year ago

Our model now has the capability of using computers. Excited for a new era of human-AI collaboration.

thumb_up_off_alt24

chat_bubble_outline0

repeat0

shareShare

Sara Price

@sprice354_

6 months ago

We've made Claude Opus 4 and Claude Sonnet 4 significantly better at avoiding reward hacking behaviors (like hard-coding and special-casing in code settings) that we frequently saw in Claude Sonnet 3.7.

thumb_up_off_alt108

chat_bubble_outline4

repeat10

shareShare

Haochen Zhang

@jhaochenz

6 months ago

Glad to see what we've been working on finally get out! Lots of cool ideas and hard work have gone into this model, and there are still so many open research questions ahead around solving alignment and scaling up RL. Looking forward to what we'll learn next.

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare