Vishal Patel (@vishalm_patel) 's Twitter Profile
Vishal Patel

@vishalm_patel

Associate Professor @JohnsHopkins working on computer vision, biometrics, and medical imaging.

ID: 1582763678839066624

linkhttps://engineering.jhu.edu/vpatel36/ calendar_today19-10-2022 16:01:24

163 Tweet

563 Followers

258 Following

Peyman Milanfar (@docmilanfar) 's Twitter Profile Photo

We see the world in vivid detail in part because our visual perception is immersed in the broader context of a 3D world, language, and other queues. In our new paper we show that such broader context is also helpful with tasks in low-level vision such as image restoration 1/n

We see the world in vivid detail in part because our visual perception is immersed in the broader context of a 3D world, language, and other queues. 

In our new paper we show that such broader context is also helpful with tasks in low-level vision such as image restoration

1/n
Kangfu Mei ✈️ ICLR'25 🇸🇬 (@kangfum) 's Twitter Profile Photo

Check out our latest work on multimodal super-resolution diffusion accepted by #CVPR2025 🔥🔥🔥! We show that using richer context in guiding image diffusion model can always improves the performance. Guidance like depth and edge is especially useful to enrich the language

Scanline VFX - Powered by Netflix (@scanline_vfx) 's Twitter Profile Photo

Congratulations to the research team at our sister company Eyeline Studios on their latest research paper - “Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset” - which will be presented at #CVPR2025 in Nashville.

Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

Had the incredible opportunity to meet Prof. Takeo Kanade at JHU today! Such an honor to chat with a true legend in computer vision. Of course, I couldn’t miss the chance to get a signature on his seminal Lucas-Kanade paper! 📄✍️#ComputerVision JHU ECE JHU Computer Science

Had the incredible opportunity to meet Prof. Takeo Kanade at JHU today! Such an honor to chat with a true legend in computer vision. Of course, I couldn’t miss the chance to get a signature on his seminal Lucas-Kanade paper! 📄✍️#ComputerVision <a href="/JHUECE/">JHU ECE</a> <a href="/JHUCompSci/">JHU Computer Science</a>
Bo Wang (@bowang87) 's Twitter Profile Photo

Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! #CVPR2025 🌐 fmv-cvpr25workshop.github.io FMV brings together researchers pushing the boundaries of medical AGI. We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas

Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! <a href="/CVPR/">#CVPR2025</a>
🌐 fmv-cvpr25workshop.github.io
FMV brings together researchers pushing the boundaries of medical AGI.
We are also proud to host an esteemed lineup of speakers:
Dr. Jakob Nikolas
Kartik Narayan (@kartiknarayan10) 's Twitter Profile Photo

🥳🥳Two papers accepted in FG 2025 !!! Improved Representation Learning for Unconstrained Face Recognition w/ Nithin GK Vishal Patel Investigating Social Biases in Multimodal LLMs w/ Malsha Perera, Vishal Patel

Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

Honored to be speaking alongside other respected experts at the Biometrics Institute US Biometrics Seminar. We’ll be diving into US biometrics developments and the crucial topic of AI’s impact on vulnerabilities. Johns Hopkins Data Science and AI Institute JHU ECE JHU Computer Science Johns Hopkins Engineering Biometrics Institute

Honored to be speaking alongside other respected experts at the Biometrics Institute US Biometrics Seminar. We’ll be diving into US biometrics developments and the crucial topic of AI’s impact on vulnerabilities. <a href="/HopkinsDSAI/">Johns Hopkins Data Science and AI Institute</a> <a href="/JHUECE/">JHU ECE</a> <a href="/JHUCompSci/">JHU Computer Science</a> <a href="/HopkinsEngineer/">Johns Hopkins Engineering</a> <a href="/BiometricsInst/">Biometrics Institute</a>
WACV (@wacv_official) 's Twitter Profile Photo

The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).

The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).
Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

💥 New paper: Think Before You Diffuse Meet DiffPhy — LLM-guided, physics-aware video diffusion 🎥🧠🌍 SOTA on real-world motion & dynamics! 🔗 bwgzk-keke.github.io/DiffPhy/ JHU Computer Science Johns Hopkins Data Science and AI Institute Johns Hopkins Engineering Yiqun Mei #DiffusionModels #VideoGeneration

Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

🚨Excited to share that the JHU VIU Lab will be presenting the following papers at CVPR next week, including 3 Highlights! 🎉 Come stop by our posters and say hi — we’d love to connect! 👋 #CVPR2025 JHU Computer Science JHU ECE Johns Hopkins Data Science and AI Institute Johns Hopkins Engineering

🚨Excited to share that the JHU VIU Lab will be presenting the following papers at CVPR next week, including 3 Highlights! 🎉
Come stop by our posters and say hi — we’d love to connect! 👋
#CVPR2025 <a href="/JHUCompSci/">JHU Computer Science</a> <a href="/JHUECE/">JHU ECE</a> <a href="/HopkinsDSAI/">Johns Hopkins Data Science and AI Institute</a> <a href="/HopkinsEngineer/">Johns Hopkins Engineering</a>
Bo Wang (@bowang87) 's Twitter Profile Photo

#CVPR2025 IS AROUND THE CORNER! #CVPR2025 Welcome to join our Medical Vision Foundation Model Workshop on June 11th, from 8:30 to 12:00 at Room 212.! We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas Kather Jakob Nikolas Kather Dr. Faisal Mahmood Faisal Mahmood Dr.

Johns Hopkins Data Science and AI Institute (@hopkinsdsai) 's Twitter Profile Photo

Hopkins researchers including JHU ECE Tinoosh Mohsenin and Bloomberg Distinguished Professors Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI #CVPR2025

Hopkins researchers including <a href="/JHUECE/">JHU ECE</a>  Tinoosh Mohsenin and <a href="/JHU_BDPs/">Bloomberg Distinguished Professors</a> Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI 
#CVPR2025
Jack (in SF) Langerman (@jacklangerman) 's Twitter Profile Photo

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models

tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)
Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

🎨 New work: Training-Free Stylized Abstraction Generate stylized avatars (LEGO, South Park, dolls) from a single image ! 💡 VLM-guided identity distillation 📊 StyleBench eval Johns Hopkins Data Science and AI Institute JHU ECE Bryan Juco Kartik Narayan Johns Hopkins Engineering 🔗 kartik-3004.github.io/TF-SA/

🎨 New work: Training-Free Stylized Abstraction
Generate stylized avatars (LEGO, South Park, dolls) from a single image !
💡 VLM-guided identity distillation
📊 StyleBench eval

 <a href="/HopkinsDSAI/">Johns Hopkins Data Science and AI Institute</a> <a href="/JHUECE/">JHU ECE</a> <a href="/jhucs/">Bryan Juco</a> <a href="/KartikNarayan10/">Kartik Narayan</a> <a href="/HopkinsEngineer/">Johns Hopkins Engineering</a> 

🔗 kartik-3004.github.io/TF-SA/
Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

🚀 Open Vision Reasoner (OVR) Transferring linguistic cognitive behaviors to visual reasoning via large-scale multimodal RL. SOTA on MATH500 (95.3%), MathVision, and MathVerse. 💻 Code: github.com/Open-Reasoner-… 🌐 Project: weiyana.github.io/Open-Vision-Re… #LLM yana wei Johns Hopkins Engineering

Vishal Patel (@vishalm_patel) 's Twitter Profile Photo

🪞 We'll present Perception in Reflection at ICML this week! We introduce RePer, a dual-model framework that improves visual understanding through reflection. Better captions, fewer hallucinations, stronger alignment. 📄 arxiv.org/pdf/2504.07165 #ICML2025 Yana Wei JHU Computer Science

Yana Wei (@yanawei_) 's Twitter Profile Photo

🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection! A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass. 💡 Key Ideas 👉 Builds a perception-feedback loop through a curated visual reflection dataset. 👉 Utilizes Reflective

🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection!

A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass.

💡 Key Ideas
👉 Builds a perception-feedback loop through a curated visual reflection dataset.
👉 Utilizes Reflective
Johns Hopkins Engineering (@hopkinsengineer) 's Twitter Profile Photo

Think before you diffuse: DiffPhy from Vishal Patel and team delivers realistic physics in AI video generation by enlisting LLMs to reason about the physical context. Multimodal LLMs evaluate and fine tune the model. GitHub page: bwgzk-keke.github.io/DiffPhy/