Xiaoming Zhao (@xmzhao_) Twitter Tweets • TwiCopy

Philipp Henzler

a year ago

IllumiNeRF lets you relight objects in 3D. Instead of the classical inverse rendering approach — disentangling the object geometry, materials, and lighting — we use a relighting diffusion model to relight each input image and distill the relit samples into 3D by optimizing a

thumb_up_off_alt46

chat_bubble_outline0

repeat6

shareShare

Noah Snavely

@jimantha

a year ago

This work led by Haian Jin is really nice. It takes text-to-image models and teases out their capability to light objects in a controllable way, much like Zero123 does for camera viewpoint. I'm really surprised that conditioning on environment maps can work this well!

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare

Noah Snavely

@jimantha

a year ago

There is a lot happening on the lighting front these days! Another very nice recent paper on relighting is called IllumiNeRF. x.com/xmzhao_/status…

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Oncel Tuzel

@onceltuzel

a year ago

Our Machine Learning Research (MLR) team at #Apple is seeking a passionate AI resident to conduct research on multi-modal generative models (vision, 3D, language, audio) and to explore effective control mechanisms for these models. Application details: jobs.apple.com/en-us/details/…

thumb_up_off_alt317

chat_bubble_outline3

repeat40

shareShare

Unnat Jain

@unnatjain2010

a year ago

Excited to share that I'll be joining University of California at Irvine as a CS faculty in '25!🌟 Faculty apps: Krishna Murthy, Zhuang Liu & I share our tips: unnat.github.io/notes/Hidden_C… PhD apps: I'm looking for students in vision, robot learning, & AI4Science. Details👇

Excited to share that I'll be joining University of California at Irvine as a CS faculty in '25!🌟

Faculty apps: <a href="/_krishna_murthy/">Krishna Murthy</a>, <a href="/liuzhuang1234/">Zhuang Liu</a> & I share our tips: unnat.github.io/notes/Hidden_C…

PhD apps: I'm looking for students in vision, robot learning, & AI4Science. Details👇

thumb_up_off_alt391

chat_bubble_outline38

repeat70

shareShare

Keunhong Park

@keunhongp

a year ago

look what we've been cooking at world labs. i'm really proud of the team -- this is the product of everyone's hard work and enthusiasm. we have a long way to go from here, but i'm excited about about the future. if you'd like to build that future with us, we're hiring!

thumb_up_off_alt122

chat_bubble_outline7

repeat8

shareShare

Philipp Henzler

@philipphenzler

a year ago

I will be at #NeurIPS2024 this week in Vancouver. Very excited to discuss the future of generative video models and 3D. We will also be presenting illuminerf.github.io and cat3d.github.io. Looking forward to meeting everyone. Please reach out if you’d like to meet

thumb_up_off_alt42

chat_bubble_outline0

repeat5

shareShare

Xiaoming Zhao

@xmzhao_

a year ago

IllumiNeRF is #NeurIPS2024 bound! 🌟 Updated ArXiv with more results & discussions. Catch Philipp Henzler at #NeurIPS to learn more! Paper: arxiv.org/abs/2406.06527 Webpage: illuminerf.github.io Location: East Exhibit Hall A-C #1404 Time: Fri 13 Dec 11 a.m. PST — 2 p.m. PST

thumb_up_off_alt11

chat_bubble_outline5

repeat0

shareShare

Alex Schwing

@alexschwing

a year ago

Congrats to Rex Cheng for impressive work on generating audio from video and/or text.

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Hadi Pouransari

@hpouransari

a year ago

What matters for runtime optimization in Vision Language Models (VLMs)? Vision encoder latency 🤔? Image resolution 🤔? Number of visual tokens 🤔? LLM size 🤔? In this thread, we break it all down and introduce FastVLM — a family of fast and accurate VLMs. (1/n 🧵)

thumb_up_off_alt77

chat_bubble_outline2

repeat21

shareShare

Shuangfei Zhai

@zhaisf

10 months ago

We are looking for a summer research intern to work on improving TarFlow at Apple. You will be working with myself and a great group of researchers, Jiatao Gu Preetum Nakkiran David Berthelot etc. If interested, send your CV to szhai at apple.com by this week.

thumb_up_off_alt33

chat_bubble_outline0

repeat10

shareShare

Miguel Angel Bautista

@itsbautistam

9 months ago

We are looking for an intern to join MLR at Apple ASAP. You will be working with me, Yuyang Wang and other scientists in the team on domain-agnostic generative modeling at scale. If you are interested please reach out to mbautistamartin at apple by the end of this week.

thumb_up_off_alt65

chat_bubble_outline1

repeat9

shareShare

Pavankumar Vasu

@pavankumarvasu

7 months ago

Excited to share code & models for FastVLM — our blazing-fast Vision-Language Model appearing at #CVPR2025 Run it on-device with inference code optimized for Apple Silicon using #mlx. Code: github.com/apple/ml-fastv… Updated paper & results coming soon. Stay tuned! 👀

thumb_up_off_alt196

chat_bubble_outline11

repeat48

shareShare

Miguel Angel Bautista

@itsbautistam

5 months ago

We have an open position at Apple MLR to work scalable and efficient generative models that perform across diverse data domains—including images, 3D, video, graphs, etc. We care deeply about simplifying modeling pipelines, developing powerful and scalable training recipes.

thumb_up_off_alt65

chat_bubble_outline2

repeat14

shareShare

Oncel Tuzel

@onceltuzel

5 months ago

Come work with us! The Machine Learning Research (MLR) team at Apple is seeking a passionate AI researcher to work on Efficient ML algorithms: jobs.apple.com/en-us/details/…

thumb_up_off_alt208

chat_bubble_outline5

repeat28

shareShare

Fartash Faghri

@fartashfg

5 months ago

Is your AI keeping Up with the world? Announcing #NeurIPS2025 CCFM Workshop: Continual and Compatible Foundation Model Updates When/Where: Dec. 6-7 San Diego Submission deadline: Aug. 22, 2025. (opening soon!) sites.google.com/view/ccfm-neur… #FoundationModels #ContinualLearning

thumb_up_off_alt31

chat_bubble_outline1

repeat10

shareShare

Hadi Pouransari

@hpouransari

5 months ago

🌟Explore key insights from the FastVLM project (real-time vision-language model) in this blog post: machinelearning.apple.com/research/fast-…

thumb_up_off_alt219

chat_bubble_outline5

repeat38

shareShare

Awni Hannun

@awnihannun

4 months ago

The latest MLX has a CUDA back-end! To get started: pip install "mlx[cuda]" With the same codebase you can develop locally, run your model on Apple silicon, or in the cloud on Nvidia GPUs. MLX is designed around Apple silicon - which has a unified memory architecture. It uses

thumb_up_off_alt407

chat_bubble_outline25

repeat65

shareShare

Fartash Faghri

@fartashfg

3 months ago

🚀🤗FastVLM models are now on HuggingFace! Enabling real-time VLM applications at up to 85x faster than prior work and 3.4x smaller. huggingface.co/collections/ap… Checkout a cool demo on HuggingFace: huggingface.co/spaces/apple/f… Huge thanks to the amazing folks at HuggingFace! #Apple MLR

thumb_up_off_alt91

chat_bubble_outline5

repeat22

shareShare

Xenova

@xenovacom

3 months ago

NEW: Apple releases FastVLM and MobileCLIP2 on Hugging Face! 🤗 The models are up to 85x faster and 3.4x smaller than previous work, enabling real-time VLM applications! 🤯 It can even do live video captioning 100% locally in your browser (zero install). Huge for accessibility!

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat224

shareShare