Junting Pan (@junting9) Twitter Tweets • TwiCopy

Junting Pan

@junting9

+ Follow

PhD @ MMlab CUHK. | Prev: Research Scientist Intern @AIatMeta (FAIR) and @samsungresearch. Working on Multimodal Learning and Video Understanding.

ID: 2243906599

linkhttp://junting.github.io calendar_today13-12-2013 13:19:30

129 Tweet

496 Takipçi

453 Takip Edilen

Xin Eric Wang @ ICLR 2025

@xwang_lk

2 years ago

Muffin or Chihuahua in a multipanel image? Most people can, but GPT-4V struggles! Contrary to popular belief that only experts can outperform (Multimodal) LLMs, average humans often prove to be more intelligent. Our Multipanel VQA study reveals this gap, where human accuracy

thumb_up_off_alt91

chat_bubble_outline2

repeat16

shareShare

Mengwei Ren

@mengweir

2 years ago

Glad to share that our project Relightful Harmonization: Lighting-aware portrait background replacement has been accepted to #CVPR2024. 🧵 Project page: mengweiren.com/research/relig… Preprint: arxiv.org/abs/2312.06886

thumb_up_off_alt176

chat_bubble_outline8

repeat42

shareShare

Xiaohua Zhai

@xiaohuazhai

2 years ago

📢📢 I am looking for a student researcher to work with me and my colleagues at Google DeepMind Zürich on vision-language research. It will be a 100% 24 weeks onsite position in Switzerland. Reach out to me ([email protected]) if interested. Bonus: amazing view🏔️👇

thumb_up_off_alt243

chat_bubble_outline6

repeat29

shareShare

Yann LeCun

@ylecun

2 years ago

🥁 Llama3 is out 🥁 8B and 70B models available today. 8k context length. Trained with 15 trillion tokens on a custom-built 24k GPU cluster. Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases. More versions are coming over the next

thumb_up_off_alt7,7K

chat_bubble_outline214

repeat1,1K

shareShare

Junting Pan

@junting9

2 years ago

SAM2 is truly amazing! I am so proud to have been a part of this incredible team!

thumb_up_off_alt37

chat_bubble_outline1

repeat1

shareShare

Miguel Angel Bautista

@itsbautistam

2 years ago

I am looking for strong PhD interns to join Apple MLR late 2024 or early 2025! Topics will be around training large-scale diffusion/flow matching models broadly speaking and you’ll be in the bay area (Cupertino/SF). Apply here: jobs.apple.com/en-us/details/…. [1/5]

thumb_up_off_alt374

chat_bubble_outline4

repeat69

shareShare

Mengwei Ren

@mengweir

2 years ago

I am presenting at #AdobeMAX next week! Get a sneak peak to our latest research on image composition and relighting on Oct15th at MAX sneak session (5.30 to 7 pm EST). Online registration (free): max.adobe.com/max-online/

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Haoxuan You

@xyouh

2 years ago

Looking for a 2025 summer research intern, in the Foundation Model Team at Apple AI/ML, with the focus of Multimodal LLM / Vision-Language. Phd preferred. Apply through jobs.apple.com/en-us/details/… Also email me your resume to [email protected]! 😊

thumb_up_off_alt435

chat_bubble_outline16

repeat73

shareShare

Junting Pan

@junting9

2 years ago

Nice to e-meet everyone at #AdobeMAX sneak! Mengwei Ren --- Now I know 😎

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Nikhila Ravi

@nikhilaravi

a year ago

🌟Thrilled to share that SAM 2 was awarded a Best Paper Honourable Mention Award at #ICLR2025, one of 6 papers recognized out of 11000+ submissions! 👏This project was the result of amazing work by an exceptional team at AI at Meta FAIR: Valentin Gabeur , Yuan-Ting Hu,Ronghang Hu,

thumb_up_off_alt105

chat_bubble_outline3

repeat14

shareShare

Phillip Isola

@phillip_isola

a year ago

Our computer vision textbook is now available for free online here: visionbook.mit.edu We are working on adding some interactive components like search and (beta) integration with LLMs. Hope this is useful and feel free to submit Github issues to help us improve the text!

thumb_up_off_alt2,2K

chat_bubble_outline35

repeat595

shareShare

Ruoming Pang

@ruomingpang

10 months ago

In this report we describe the 2025 Apple Foundation Models ("AFM"). We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device AFM model. machinelearning.apple.com/research/apple…

thumb_up_off_alt446

chat_bubble_outline297

repeat88

shareShare

Alaa El-Nouby

@alaa_nouby

7 months ago

Last year at Apple MLR, we published a number of interesting papers like AIM, AIMv2, and Scaling laws for: Sparsity, Native Multimodal Models, Data mixing. Today the team has open-sourced the training codebase we used for conducting this research! github.com/apple/ml-l3m

thumb_up_off_alt448

chat_bubble_outline4

repeat54

shareShare

Junting Pan

@junting9

7 months ago

The Foundation Model Team @🍎Apple AI/ML is looking for a Research Intern (flexible start date) to work on Multimodal LLMs and Vision-Language. Interested? DM me to learn more!

thumb_up_off_alt462

chat_bubble_outline28

repeat20

shareShare

Saining Xie

@sainingxie

6 months ago

Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶

thumb_up_off_alt621

chat_bubble_outline25

repeat94

shareShare