Jaihoon Kim (@kimjaihoon) 's Twitter Profile
Jaihoon Kim

@kimjaihoon

Phd Student @ KAIST

ID: 1667049509128671232

calendar_today09-06-2023 06:02:39

67 Tweet

61 Followers

98 Following

TuringPost (@theturingpost) 's Twitter Profile Photo

Inference-time scaling can work for flow models KAIST AI proposed 3 key ideas to make it possible: • SDE-based generation – Adding controlled randomness allows flow models to explore more outputs, like diffusion models do. • VP interpolant conversion – Guides the model from

Inference-time scaling can work for flow models

<a href="/kaist_ai/">KAIST AI</a> proposed 3 key ideas to make it possible:

• SDE-based generation – Adding controlled randomness allows flow models to explore more outputs, like diffusion models do.

• VP interpolant conversion – Guides the model from
Minhyuk Sung (@minhyuksung) 's Twitter Profile Photo

Unconditional Priors Matter! The key to improving CFG-based "conditional" generation in diffusion models actually lies in the quality of their "unconditional" prior. Replace it with a better one to improve conditional generation! 🌐 unconditional-priors-matter.github.io

Yunhong Min (@myh4832) 's Twitter Profile Photo

🔥 Grounding 3D Orientation in Text-to-Image 🔥 🎯 We present ORIGEN — the first zero-shot method for accurate 3D orientation grounding in text-to-image generation! 📄 Paper: arxiv.org/abs/2503.22194 🌐 Project: origen2025.github.io

🔥 Grounding 3D Orientation in Text-to-Image 🔥
🎯 We present ORIGEN — the first zero-shot method for accurate 3D orientation grounding in text-to-image generation!

📄 Paper: arxiv.org/abs/2503.22194
🌐 Project: origen2025.github.io
Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

🚀 Check out our inference-time scaling with FLUX. GPT-4o struggles to follow user prompts involving compositional logical relations. Our inference-time scaling enables efficient search to generate samples with precise alignment to the input text. 🔗 flow-inference-time-scaling.github.io

Minhyuk Sung (@minhyuksung) 's Twitter Profile Photo

Introducing ORIGEN: the first orientation-grounding method for image generation with multiple open-vocabulary objects. It’s a novel zero-shot, reward-guided approach using Langevin dynamics, built on a one-step generative model like Flux-schnell. Project: origen2025.github.io

Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

🔥 KAIST Visual AI Group is hiring interns for 2025 Summer. ❓Can non-KAIST students apply? Yes! ❓Can international students who are not enrolled in any Korean institutions apply? Yes! More info at 🔗 visualai.kaist.ac.kr

Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

How can VLM reason in arbitrary perspectives? 🔥 Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation proposes a framework that enables spatial reasoning of VLM from arbitrary perspectives

Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

🇸🇬 Attending #ICLR2025 ? Check out how we extend pretrained diffusion models to generate images in arbitrary spaces. 📌: Hall 3 + Hall 2B #103 📅: 10AM-12:30PM

🇸🇬 Attending #ICLR2025 ?

Check out how we extend pretrained diffusion models to generate images in arbitrary spaces.

📌: Hall 3 + Hall 2B #103
📅: 10AM-12:30PM
Minhyuk Sung (@minhyuksung) 's Twitter Profile Photo

#ICLR2025 Come join our StochSync poster (#103) this morning! We introduce a method that combines the best parts of Score Distillation Sampling and Diffusion Synchronization to generate high-quality and consistent panoramas and mesh textures. stochsync.github.io

Phillip (Yuseung) Lee (@yuseungleee) 's Twitter Profile Photo

❗️Vision-Language Models (VLMs) struggle with even basic perspective changes! ✏️ In our new preprint, we aim to extend the spatial reasoning capabilities of VLMs to ⭐️arbitrary⭐️ perspectives. 📄Paper: arxiv.org/abs/2504.17207 🔗Project: apc-vlm.github.io 🧵[1/N]

❗️Vision-Language Models (VLMs) struggle with even basic perspective changes!

✏️ In our new preprint, we aim to extend the spatial reasoning capabilities of VLMs to ⭐️arbitrary⭐️ perspectives.

📄Paper: arxiv.org/abs/2504.17207
🔗Project: apc-vlm.github.io

🧵[1/N]
Minhyuk Sung (@minhyuksung) 's Twitter Profile Photo

I recently presented our work, “Inference-Time Guided Generation with Diffusion and Flow Models,” at HKUST (CVM 2025 keynote) and NTU (MMLab), covering three classes of guidance methods for diffusion models and their extensions to flow models. Slides: onedrive.live.com/?redeem=aHR0cH…

I recently presented our work, “Inference-Time Guided Generation with Diffusion and Flow Models,” at HKUST (CVM 2025 keynote) and NTU (MMLab), covering three classes of guidance methods for diffusion models and their extensions to flow models.

Slides: onedrive.live.com/?redeem=aHR0cH…
Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

📈 Can pretrained flow models generate images from complex compositional prompts—including logical relations and quantities—without further fine-tuning? 🚀 We have released our code for inference-time scaling for flow models: github.com/KAIST-Visual-A…

Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

🧐 Can we define a better initial prior for Sequential Monte Carlo in reward alignment? That's exactly what Ψ-Sampler 🔱 does. Check out the paper for details: 📌 arxiv.org/abs/2506.01320

Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

📢 Excited to share that our paper "Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing" has been accepted to #NeurIPS 2025 🔗 arxiv.org/pdf/2510.06046 📌 flow-inference-time-scaling.github.io

Phillip (Yuseung) Lee (@yuseungleee) 's Twitter Profile Photo

🌴Happy to attend #ICCV2025 in Hawaii! I’ll be presenting our paper on enabling VLMs to perform spatial reasoning from arbitrary perspectives. 📔 Paper: arxiv.org/abs/2504.17207 🖥️ Project Page: apc-vlm.github.io ✔️ Poster: Oct 21 (Tue) Session 2 & Exhibit Hall, #858

🌴Happy to attend #ICCV2025 in Hawaii!

I’ll be presenting our paper on enabling VLMs to perform spatial reasoning from arbitrary perspectives.

📔 Paper: arxiv.org/abs/2504.17207
🖥️ Project Page: apc-vlm.github.io
✔️ Poster: Oct 21 (Tue) Session 2 &amp; Exhibit Hall, #858
Jaihoon Kim (@kimjaihoon) 's Twitter Profile Photo

Headed to #NeurIPS2025 in San Diego (Dec 1-8)! 🧠 I'll be presenting a couple of posters on generative models. Currently looking for Research Internship opportunities in Generative AI. Let's connect for a chat or coffee ☕️ Please DM me.

Kunho Kim (@kunho_kim_) 's Twitter Profile Photo

We present GOATex: Geometry & Occlusion-Aware Texturing in NeurIPS 2025. - Project Page: goatex3d.github.io - Paper: arxiv.org/abs/2511.23051