Yonglong Tian (@yonglongt) Twitter Tweets • TwiCopy

Yonglong Tian

2 years ago

Today marks the official ending of my PhD life at MIT. So grateful to this journey. Coincidentally, we arXiv a paper today: arxiv.org/abs/2306.00984. It shows the potential of learning from synthetic data. This coincidence nicely concludes my PhD life in an academic manner.

thumb_up_off_alt316

chat_bubble_outline23

repeat19

shareShare

Yonglong Tian

@yonglongt

2 years ago

This paper is jointly done w/ Lijie Fan, Dilip Krishnan, Phillip Isola, and Huiwen Chang

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Dilip Krishnan

@dilipkay

2 years ago

New paper!! We show that pre-training language-image models *solely* on synthetic images from Stable Diffusion can outperform training on real images!! Work done with Yonglong Tian (Google), Huiwen Chang (Google), Phillip Isola (MIT) and Lijie Fan (MIT)!!

thumb_up_off_alt575

chat_bubble_outline12

repeat111

shareShare

Jing Shao

@amanda_jshao

2 years ago

🎉(1/6) Exciting News:🐑LAMM is online! ⭐️Features: ① 200k 2D/3D Instruction tuning dataset ② Benchmark on 14 high-level 2D/3D vision tasks ③ Primary but potential framework trainable with only 4*A100s 📚Paper: arxiv.org/pdf/2306.06687… ⌨️Code: github.com/OpenLAMM/LAMM

thumb_up_off_alt0

chat_bubble_outline0

repeat2

shareShare

Yonglong Tian

@yonglongt

2 years ago

MIT is a place for serious research.

thumb_up_off_alt47

chat_bubble_outline0

repeat2

shareShare

Yonglong Tian

@yonglongt

2 years ago

Our new work led by elegant Yilun Xu , Mingyang and Xiang

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Sangnie Bhardwaj

@sangnie

2 years ago

Join us at the WiML Un-Workshop breakout session on "Role of Mentorship and Networking"! Do not miss the chance to talk with leading researchers Samy Bengio, Susan Zhang Hugo Larochelle Sharon Y. Li Pablo Samuel Castro John Langford and others! #ICML2023 WiML

thumb_up_off_alt53

chat_bubble_outline2

repeat21

shareShare

Tongzhou Wang

@ssnl_tz

2 years ago

Quasimetric RL code is now on GitHub: github.com/quasimetric-le… Instead of deleting 80% of the dev repo, I rewrote the algorithm in a hopefully cleaner way. But going through the old repo is fun. So many half-explored interesting ideas in the remaining 80%. RL=geometry

thumb_up_off_alt93

chat_bubble_outline1

repeat25

shareShare

Yonglong Tian

@yonglongt

2 years ago

Thank you AK for covering our work!

thumb_up_off_alt28

chat_bubble_outline0

repeat4

shareShare

Yonglong Tian

@yonglongt

2 years ago

I had the joy of working with Olivier (and Aaron) at DeepMind. My best internship experience. Strongly recommended!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Lijie Fan

@lijie_fan

2 years ago

🚀 Is the future of vision models Synthetic? Introducing SynCLR: our new pipeline leveraging LLMs & Text-to-image models to train vision models with only synthetic data! 🔥 Outperforming SOTAs like DinoV2 & CLIP on real images! SynCLR excels in fine-grained classification &

thumb_up_off_alt188

chat_bubble_outline3

repeat40

shareShare

Yonglong Tian

@yonglongt

2 years ago

HNY! Excited to share SynCLR, that rivals CLIP and Dino v2 but uses pure synthetic data. The interesting part - it can outperform models (e.g. CLIP) directly trained on LAION-2B, which was the dataset used to train SD 1.5 that we used to generate images. arxiv.org/abs/2312.17742

thumb_up_off_alt280

chat_bubble_outline5

repeat42

shareShare

Yonglong Tian

@yonglongt

2 years ago

Thank you AK for featuring our work!

thumb_up_off_alt40

chat_bubble_outline0

repeat6

shareShare

Phillip Isola

@phillip_isola

2 years ago

Our computer vision textbook is released! Foundations of Computer Vision with Antonio Torralba and Bill Freeman mitpress.mit.edu/9780262048972/… It’s been in the works for >10 years. Covers everything from linear filters and camera optics to diffusion models and radiance fields. 1/4

thumb_up_off_alt2,2K

chat_bubble_outline41

repeat403

shareShare

Jiawei Yang

@jiaweiyang118

a year ago

Very excited to get this out: “DVT: Denoising Vision Transformers”. We've identified and combated those annoying positional patterns in many ViTs. Our approach denoises them, achieving SOTA results and stunning visualizations! Learn more on our website: jiawei-yang.github.io/DenoisingViT/

thumb_up_off_alt405

chat_bubble_outline8

repeat81

shareShare

Lijie Fan

@lijie_fan

a year ago

🚀 Excited to share our latest work Fluid! We've developed a scalable autoregressive text-to-image model without VQ. We trained the model up to 10B parameters, achieving state-of-the-art COCO FID and GenEval scores. 🔥 Check it out: arxiv.org/pdf/2410.13863 🙏 Shout out to

thumb_up_off_alt106

chat_bubble_outline2

repeat22

shareShare

Shobhita Sundaram

@shobsund

a year ago

Personal vision tasks–like detecting *your mug*-are hard; they’re data scarce and fine-grained. In our new paper, we show you can adapt general-purpose vision models to these tasks from just three photos! 📝: arxiv.org/abs/2412.16156 💻: github.com/ssundaram21/pe… (1/n)

thumb_up_off_alt322

chat_bubble_outline5

repeat62

shareShare

Yonglong Tian

@yonglongt

3 months ago

GPT-5 dropped! For *multimodal*, the nice thing is it will use tools way more efficient than o3 (much better than the rendered acc numbers here), making it both better and faster. Ji Lin, efforts baked in.

thumb_up_off_alt45

chat_bubble_outline1

repeat7

shareShare