Xijun Wang (@xijunwang_cs) Twitter Tweets • TwiCopy

There must be a reason for getting fat. It’s not surprising that two of us who sneaked food met in the midnight 🤔 Seeing your chubby back, I silently poured a glass of water to dispel the idea of eating night food😎

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Xijun Wang

@xijunwang_cs

2 years ago

Big thanks to the leading author Tianrui Guan (On Job Market) and all the coauthors!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Xijun Wang

@xijunwang_cs

2 years ago

How to strengthen both CNNs and Transformers? Check our work “SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers” on Oct. 2nd (1:30pm-6pm) at #ICCV2023 Workshop on New Ideas in Vision Transformers A P01 ArXiv: arxiv.org/abs/2308.07110

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Fuxiao Liu

@fuxiaol

2 years ago

🔥CoBig shoutout to Google for launching their groundbreaking large multimodal model #Gemini. 🚩However, there are still obvious hallucinations with Gemini. Here are a few examples with Gemini Pro. Want to learn more? Look at our HallusionBench Paper: arxiv.org/pdf/2310.14566…

🔥CoBig shoutout to <a href="/Google/">Google</a> for launching their groundbreaking large multimodal model #Gemini.
🚩However, there are still obvious hallucinations with Gemini. Here are a few examples with Gemini Pro.

Want to learn more? Look at our HallusionBench Paper: arxiv.org/pdf/2310.14566…

thumb_up_off_alt33

chat_bubble_outline2

repeat8

shareShare

Tianrui Guan (On Job Market)

@terryguan97

2 years ago

📢 Sharing a couple of interesting observations and results of #GeminiAI Pro Vision on our #HallusionBench (Big thanks to Google DeepMind for making those API available! 🙏): 1. Language hallucination is a significant issue, often resulting in outputs that include irrelevant

📢 Sharing a couple of interesting observations and results of #GeminiAI Pro Vision on our #HallusionBench (Big thanks to <a href="/GoogleDeepMind/">Google DeepMind</a> for making those API available! 🙏):

1. Language hallucination is a significant issue, often resulting in outputs that include irrelevant

thumb_up_off_alt122

chat_bubble_outline1

repeat19

shareShare

Xijun Wang

@xijunwang_cs

a year ago

See you Saturday 17:15-19:17 at Naupaka #51 #WACV2024 For aerial videos, we proposed MITFAS to focus on the regions corresponding to salient motions and find the more informative frames by using mutual information. Ruiqi Xian Dinesh Manocha GAMMA UMD

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Xijun Wang

@xijunwang_cs

a year ago

Join us at #93 of the Poster session on Thursday, February 22! #AAAI24 GAMMA UMD Say hi to Shan Yang there! Our ICAR can recommend items with good visual compatibility including similarity (color, geometry, texture, etc.) and complementarity (like table vs chair).

Join us at #93 of the Poster session on Thursday, February 22! #AAAI24 <a href="/gammaumd/">GAMMA UMD</a>

Say hi to <a href="/shanyangmie/">Shan Yang</a> there!

Our ICAR can recommend items with good visual compatibility including similarity (color, geometry, texture, etc.) and complementarity (like table vs chair).

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Xiyang Wu

@wu_xiyang

a year ago

📢Thrilled to share our latest work #AutoHallusion, an automatic hallucination pipeline that scales up the benchmark generation process. 🌐 Webpage & Examples: wuxiyang1996.github.io/autohallusion_…… 📷 Arxiv: arxiv.org/abs/2406.10900 👀Please stay tuned for our future updates! Thanks a lot

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

GAMMA UMD

@gammaumd

9 months ago

Xijun Wang Xijun Wang will be presenting "ViLA: Efficient Video-Language Alignment for Video Question Answering", which addresses both efficient frame sampling and effective cross-modal alignment! 🗣️ Time: October 3, 4:30pm - 6:30pm Location: Poster session 6, ID 274

Xijun Wang <a href="/xijunwang_cs/">Xijun Wang</a> will be presenting "ViLA: Efficient Video-Language Alignment for Video Question Answering", which addresses both efficient frame sampling and effective cross-modal alignment! 🗣️

Time: October 3, 4:30pm - 6:30pm
Location: Poster session 6, ID 274

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Xijun Wang

@xijunwang_cs

9 months ago

#ECCV2024 Visit our ViLA poster and Say Hi to Prof. Ming C. Lin ! Time: October 3, 4:30pm - 6:30pm Location: Poster session 6, ID 274 Arxiv: arxiv.org/abs/2312.08367 Code: github.com/xijun-cs/ViLA Big shout-out to Shan Yang! GAMMA UMD

#ECCV2024 Visit our ViLA poster and Say Hi to Prof.
<a href="/MingCLinCS/">Ming C. Lin</a> !

Time: October 3, 4:30pm - 6:30pm
Location: Poster session 6, ID 274

Arxiv: arxiv.org/abs/2312.08367
Code: github.com/xijun-cs/ViLA

Big shout-out to <a href="/shanyangmie/">Shan Yang</a>!
<a href="/gammaumd/">GAMMA UMD</a>

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

GAMMA UMD

@gammaumd

8 months ago

.Xijun Wang will be presenting “SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition”, which utilizes prompt learning for aerial video action recognition. ✈️ Thurs Oct 17 17:30 - 17:45 Room 6

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

GAMMA UMD

@gammaumd

7 months ago

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for VLMs Xiyang Wu*, Tianrui Guan (On Job Market)*, D Li, S Huang, X Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Boyd-Graber, Tianyi Zhou, Dinesh Manocha 11/3; 16:00-17:30 - Riverfront Hall Website: wuxiyang1996.github.io/autohallusion_…