Vishal Patel (@vishalm_patel) Twitter Tweets • TwiCopy

Peyman Milanfar

9 months ago

We see the world in vivid detail in part because our visual perception is immersed in the broader context of a 3D world, language, and other queues. In our new paper we show that such broader context is also helpful with tasks in low-level vision such as image restoration 1/n

thumb_up_off_alt154

chat_bubble_outline1

repeat21

shareShare

Kangfu Mei ✈️ ICLR'25 🇸🇬

@kangfum

9 months ago

Check out our latest work on multimodal super-resolution diffusion accepted by #CVPR2025 🔥🔥🔥! We show that using richer context in guiding image diffusion model can always improves the performance. Guidance like depth and edge is especially useful to enrich the language

thumb_up_off_alt31

chat_bubble_outline1

repeat10

shareShare

Scanline VFX - Powered by Netflix

@scanline_vfx

8 months ago

Congratulations to the research team at our sister company Eyeline Studios on their latest research paper - “Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset” - which will be presented at #CVPR2025 in Nashville.

thumb_up_off_alt18

chat_bubble_outline0

repeat7

shareShare

Vishal Patel

@vishalm_patel

8 months ago

Had the incredible opportunity to meet Prof. Takeo Kanade at JHU today! Such an honor to chat with a true legend in computer vision. Of course, I couldn’t miss the chance to get a signature on his seminal Lucas-Kanade paper! 📄✍️#ComputerVision JHU ECE JHU Computer Science

thumb_up_off_alt75

chat_bubble_outline1

repeat7

shareShare

Bo Wang

@bowang87

8 months ago

Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! #CVPR2025 🌐 fmv-cvpr25workshop.github.io FMV brings together researchers pushing the boundaries of medical AGI. We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas

Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! <a href="/CVPR/">#CVPR2025</a>
🌐 fmv-cvpr25workshop.github.io
FMV brings together researchers pushing the boundaries of medical AGI.
We are also proud to host an esteemed lineup of speakers:
Dr. Jakob Nikolas

thumb_up_off_alt97

chat_bubble_outline0

repeat23

shareShare

Vishal Patel

@vishalm_patel

8 months ago

Excited to present two papers at #ICLR2025 next week! Looking forward to sharing our work in Singapore! 🇸🇬. Kangfu Mei JHU ECE Johns Hopkins Data Science and AI Institute kfmei.com/Field-DiT/

Excited to present two papers at #ICLR2025 next week!
Looking forward to sharing our work in Singapore! 🇸🇬.
<a href="/KangfuM/">Kangfu Mei</a> <a href="/JHUECE/">JHU ECE</a> <a href="/HopkinsDSAI/">Johns Hopkins Data Science and AI Institute</a>

kfmei.com/Field-DiT/

thumb_up_off_alt49

chat_bubble_outline0

repeat8

shareShare

Kartik Narayan

@kartiknarayan10

8 months ago

🥳🥳Two papers accepted in FG 2025 !!! Improved Representation Learning for Unconstrained Face Recognition w/ Nithin GK Vishal Patel Investigating Social Biases in Multimodal LLMs w/ Malsha Perera, Vishal Patel

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Vishal Patel

@vishalm_patel

7 months ago

Honored to be speaking alongside other respected experts at the Biometrics Institute US Biometrics Seminar. We’ll be diving into US biometrics developments and the crucial topic of AI’s impact on vulnerabilities. Johns Hopkins Data Science and AI Institute JHU ECE JHU Computer Science Johns Hopkins Engineering Biometrics Institute

thumb_up_off_alt25

chat_bubble_outline1

repeat6

shareShare

WACV

@wacv_official

6 months ago

The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).

thumb_up_off_alt42

chat_bubble_outline0

repeat8

shareShare

Vishal Patel

@vishalm_patel

6 months ago

💥 New paper: Think Before You Diffuse Meet DiffPhy — LLM-guided, physics-aware video diffusion 🎥🧠🌍 SOTA on real-world motion & dynamics! 🔗 bwgzk-keke.github.io/DiffPhy/ JHU Computer Science Johns Hopkins Data Science and AI Institute Johns Hopkins Engineering Yiqun Mei #DiffusionModels #VideoGeneration

thumb_up_off_alt121

chat_bubble_outline2

repeat23

shareShare

Vishal Patel

@vishalm_patel

6 months ago

🚨Excited to share that the JHU VIU Lab will be presenting the following papers at CVPR next week, including 3 Highlights! 🎉 Come stop by our posters and say hi — we’d love to connect! 👋 #CVPR2025 JHU Computer Science JHU ECE Johns Hopkins Data Science and AI Institute Johns Hopkins Engineering

thumb_up_off_alt37

chat_bubble_outline1

repeat9

shareShare

Bo Wang

@bowang87

6 months ago

#CVPR2025 IS AROUND THE CORNER! #CVPR2025 Welcome to join our Medical Vision Foundation Model Workshop on June 11th, from 8:30 to 12:00 at Room 212.! We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas Kather Jakob Nikolas Kather Dr. Faisal Mahmood Faisal Mahmood Dr.

thumb_up_off_alt19

chat_bubble_outline2

repeat7

shareShare

Johns Hopkins Data Science and AI Institute

@hopkinsdsai

6 months ago

Hopkins researchers including JHU ECE Tinoosh Mohsenin and Bloomberg Distinguished Professors Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI #CVPR2025

Hopkins researchers including <a href="/JHUECE/">JHU ECE</a> Tinoosh Mohsenin and <a href="/JHU_BDPs/">Bloomberg Distinguished Professors</a> Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI
#CVPR2025

thumb_up_off_alt30

chat_bubble_outline1

repeat14

shareShare

Jack (in SF) Langerman

@jacklangerman

6 months ago

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Vishal Patel

@vishalm_patel

6 months ago

🎨 New work: Training-Free Stylized Abstraction Generate stylized avatars (LEGO, South Park, dolls) from a single image ! 💡 VLM-guided identity distillation 📊 StyleBench eval Johns Hopkins Data Science and AI Institute JHU ECE Bryan Juco Kartik Narayan Johns Hopkins Engineering 🔗 kartik-3004.github.io/TF-SA/

thumb_up_off_alt21

chat_bubble_outline0

repeat6

shareShare

Kartik Narayan

@kartiknarayan10

5 months ago

#ICCV2025 🌺FaceXFormer has been accepted by ICCV !

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Vishal Patel

@vishalm_patel

5 months ago

🚀 Open Vision Reasoner (OVR) Transferring linguistic cognitive behaviors to visual reasoning via large-scale multimodal RL. SOTA on MATH500 (95.3%), MathVision, and MathVerse. 💻 Code: github.com/Open-Reasoner-… 🌐 Project: weiyana.github.io/Open-Vision-Re… #LLM yana wei Johns Hopkins Engineering

thumb_up_off_alt21

chat_bubble_outline0

repeat5

shareShare

Vishal Patel

@vishalm_patel

5 months ago

🪞 We'll present Perception in Reflection at ICML this week! We introduce RePer, a dual-model framework that improves visual understanding through reflection. Better captions, fewer hallucinations, stronger alignment. 📄 arxiv.org/pdf/2504.07165 #ICML2025 Yana Wei JHU Computer Science

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Yana Wei

@yanawei_

5 months ago

🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection! A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass. 💡 Key Ideas 👉 Builds a perception-feedback loop through a curated visual reflection dataset. 👉 Utilizes Reflective

thumb_up_off_alt12

chat_bubble_outline2

repeat3

shareShare

Johns Hopkins Engineering

@hopkinsengineer

4 months ago

Think before you diffuse: DiffPhy from Vishal Patel and team delivers realistic physics in AI video generation by enlisting LLMs to reason about the physical context. Multimodal LLMs evaluate and fine tune the model. GitHub page: bwgzk-keke.github.io/DiffPhy/

thumb_up_off_alt9

chat_bubble_outline0

repeat5

shareShare