Hu Xu (@hu_hsu) Twitter Tweets • TwiCopy

Hu Xu

8 months ago

Great to see MetaCLIP algorithm (arxiv.org/abs/2309.16671) desaturate SSL training distribution as SSL 2.0. What’s next in SSL or pre-training? From our data research perspective, it’s likely about how to automatically desaturate a training distribution.

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Saining Xie

@sainingxie

7 months ago

Congrats Rob Fergus ! Big win for FAIR

thumb_up_off_alt44

chat_bubble_outline0

repeat2

shareShare

Hu Xu

@hu_hsu

7 months ago

A new start of FAIR and excited to be part of it.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Hu Xu

@hu_hsu

5 months ago

Heading to #ICML2025 (first time). Excited to meet need friends and old friends and chat about foundational data research and co-design with training (MetaCLIP), SelfCite arxiv.org/abs/2502.09604 with Yung-Sung Chuang and LongVU arxiv.org/abs/2410.17434 with XIAOQIAN SHEN .

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Hu Xu

@hu_hsu

5 months ago

Thanks for the invited talk and happy to share our industrial insights on “scaling data alignment” from Meta CLIP (its wide adoption and what’s next) in the DataWorld workshop #ICML2025 . happy to chat offline about data research.

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Pedro Domingos

@pmddomingos

5 months ago

If you want a job at Meta, get hired by OpenAI.

thumb_up_off_alt227

chat_bubble_outline23

repeat21

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

4 months ago

Phase 1 of Physics of Language Models code release ✅our Part 3.1 + 4.1 = all you need to pretrain strong 8B base model in 42k GPU-hours ✅Canon layers = strong, scalable gains ✅Real open-source (data/train/weights) ✅Apache 2.0 license (commercial ok!) 🔗github.com/facebookresear…

thumb_up_off_alt567

chat_bubble_outline8

repeat93

shareShare

Arijit Biswas

@pa9501460

4 months ago

Need to upgrade the CLAP?

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Hu Xu

@hu_hsu

4 months ago

Great to see an intern project grows into a big project that is landing many impacts. Thanks for the hard work throughout the way.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Hu Xu

@hu_hsu

4 months ago

will continue to contribute to open research eco-system, while we can

thumb_up_off_alt35

chat_bubble_outline2

repeat1

shareShare

Hu Xu

@hu_hsu

4 months ago

thx, there’s definitely some latent structure among the worldwide data that English only data don’t have.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Hu Xu

@hu_hsu

4 months ago

As LLM research transitions into large-scale production and intense competition, momentum in areas less directly related to LLMs (like CLIP) has slowed and de-focused. We hope these fields can endure and prove essential for long-term scientific progress.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Hu Xu

@hu_hsu

4 months ago

Genie 3 by Google DeepMind looks impressive. Extending Sora/Veo-style text-to-video generation with multi-round 'camera prompts' is an exciting direction. I believe the action space in world models goes far beyond human interaction through camera prompts—it should encompass much

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Tim Rocktäschel

@_rockt

4 months ago

"We don't just passively perceive the world; we actively generate it. The real world drives our perceptions, but the brain is always making best guesses. Perception is a controlled hallucination, constrained by sensory signals from the outside world" — Anil Seth

thumb_up_off_alt50

chat_bubble_outline3

repeat2

shareShare

Hu Xu

@hu_hsu

4 months ago

Truly appreciate the authors of Molmo Molmo (from Ai2 and University of Washington) for promoting open research and adopting MetaCLIP. There are many forms of openness today—such as open APIs, open weights, and open-source for reproducibility etc. I view MetaCLIP and Molmo's research

Truly appreciate the authors of Molmo <a href="/Molmo_AI/">Molmo</a> (from <a href="/allen_ai/">Ai2</a> and <a href="/UW/">University of Washington</a>) for promoting open research and adopting MetaCLIP. There are many forms of openness today—such as open APIs, open weights, and open-source for reproducibility etc. I view MetaCLIP and Molmo's research

thumb_up_off_alt98

chat_bubble_outline2

repeat8

shareShare

Hu Xu

@hu_hsu

4 months ago

why many epochs? i thought 1 epoch training gets a better next batch for generalization.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare