Xijun Wang (@xijunwang_cs) 's Twitter Profile
Xijun Wang

@xijunwang_cs

PhD student @umdcs @gammaumd Applied Scientist intern @amazon/a9

ID: 1426508593390567426

calendar_today14-08-2021 11:38:56

49 Tweet

146 Takipçi

159 Takip Edilen

Xijun Wang (@xijunwang_cs) 's Twitter Profile Photo

There must be a reason for getting fat. It’s not surprising that two of us who sneaked food met in the midnight 🤔 Seeing your chubby back, I silently poured a glass of water to dispel the idea of ​​eating night food😎

There must be a reason for getting fat. It’s not surprising that two of us who sneaked food met in the midnight 🤔 Seeing your chubby back, I silently poured a glass of water to dispel the idea of ​​eating night food😎
Xijun Wang (@xijunwang_cs) 's Twitter Profile Photo

How to strengthen both CNNs and Transformers? Check our work “SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers” on Oct. 2nd (1:30pm-6pm) at #ICCV2023 Workshop on New Ideas in Vision Transformers A P01 ArXiv: arxiv.org/abs/2308.07110

How to strengthen both CNNs and Transformers? 

Check our work “SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers” on Oct. 2nd (1:30pm-6pm) at #ICCV2023 Workshop on New Ideas in Vision Transformers 
<a href="/room/">A</a> P01 

ArXiv: arxiv.org/abs/2308.07110
Fuxiao Liu (@fuxiaol) 's Twitter Profile Photo

🔥CoBig shoutout to Google for launching their groundbreaking large multimodal model #Gemini. 🚩However, there are still obvious hallucinations with Gemini. Here are a few examples with Gemini Pro. Want to learn more? Look at our HallusionBench Paper: arxiv.org/pdf/2310.14566…

🔥CoBig shoutout to <a href="/Google/">Google</a> for launching their groundbreaking large multimodal model #Gemini.
🚩However, there are still obvious hallucinations with Gemini. Here are a few examples with Gemini Pro.

Want to learn more? Look at our HallusionBench Paper: arxiv.org/pdf/2310.14566…
Tianrui Guan (On Job Market) (@terryguan97) 's Twitter Profile Photo

📢 Sharing a couple of interesting observations and results of #GeminiAI Pro Vision on our #HallusionBench (Big thanks to Google DeepMind for making those API available! 🙏): 1. Language hallucination is a significant issue, often resulting in outputs that include irrelevant

📢 Sharing a couple of interesting observations and results of #GeminiAI Pro Vision on our #HallusionBench (Big thanks to <a href="/GoogleDeepMind/">Google DeepMind</a> for making those API available! 🙏):

1. Language hallucination is a significant issue, often resulting in outputs that include irrelevant
Xijun Wang (@xijunwang_cs) 's Twitter Profile Photo

See you Saturday 17:15-19:17 at Naupaka #51 #WACV2024 For aerial videos, we proposed MITFAS to focus on the regions corresponding to salient motions and find the more informative frames by using mutual information. Ruiqi Xian Dinesh Manocha GAMMA UMD

See you Saturday 17:15-19:17 at Naupaka #51 #WACV2024

For aerial videos, we proposed MITFAS to focus on the regions corresponding to salient motions and find the more informative frames by using mutual information.

<a href="/RuiqiXian/">Ruiqi Xian</a> <a href="/dmanocha/">Dinesh Manocha</a> <a href="/gammaumd/">GAMMA UMD</a>
Xijun Wang (@xijunwang_cs) 's Twitter Profile Photo

Join us at #93 of the Poster session on Thursday, February 22! #AAAI24 GAMMA UMD Say hi to Shan Yang there! Our ICAR can recommend items with good visual compatibility including similarity (color, geometry, texture, etc.) and complementarity (like table vs chair).

Join us at #93 of the Poster session on Thursday, February 22!  #AAAI24 <a href="/gammaumd/">GAMMA UMD</a>

Say hi to <a href="/shanyangmie/">Shan Yang</a> there!  

Our ICAR can recommend items with good visual compatibility including similarity (color, geometry, texture, etc.) and complementarity (like table vs chair).
Xiyang Wu (@wu_xiyang) 's Twitter Profile Photo

📢Thrilled to share our latest work #AutoHallusion, an automatic hallucination pipeline that scales up the benchmark generation process. 🌐 Webpage & Examples: wuxiyang1996.github.io/autohallusion_…… 📷 Arxiv: arxiv.org/abs/2406.10900 👀Please stay tuned for our future updates! Thanks a lot

GAMMA UMD (@gammaumd) 's Twitter Profile Photo

Xijun Wang Xijun Wang will be presenting "ViLA: Efficient Video-Language Alignment for Video Question Answering", which addresses both efficient frame sampling and effective cross-modal alignment! 🗣️ Time: October 3, 4:30pm - 6:30pm Location: Poster session 6, ID 274

Xijun Wang <a href="/xijunwang_cs/">Xijun Wang</a> will be presenting "ViLA: Efficient Video-Language Alignment for Video Question Answering", which addresses both efficient frame sampling and effective cross-modal alignment! 🗣️

Time: October 3, 4:30pm - 6:30pm
Location: Poster session 6, ID 274
Xijun Wang (@xijunwang_cs) 's Twitter Profile Photo

#ECCV2024 Visit our ViLA poster and Say Hi to Prof. Ming C. Lin ! Time: October 3, 4:30pm - 6:30pm Location: Poster session 6, ID 274 Arxiv: arxiv.org/abs/2312.08367 Code: github.com/xijun-cs/ViLA Big shout-out to Shan Yang! GAMMA UMD

#ECCV2024 Visit our ViLA poster and Say Hi to Prof.
<a href="/MingCLinCS/">Ming C. Lin</a> ! 

Time: October 3, 4:30pm - 6:30pm 
Location: Poster session 6, ID 274  

Arxiv: arxiv.org/abs/2312.08367
Code: github.com/xijun-cs/ViLA

Big shout-out to <a href="/shanyangmie/">Shan Yang</a>!
<a href="/gammaumd/">GAMMA UMD</a>
GAMMA UMD (@gammaumd) 's Twitter Profile Photo

.Xijun Wang will be presenting “SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition”, which utilizes prompt learning for aerial video action recognition. ✈️ Thurs Oct 17 17:30 - 17:45 Room 6

Xijun Wang (@xijunwang_cs) 's Twitter Profile Photo

LLMs know how to make use of short cuts! This is a good exploration on the potential LLM redundancy. Check our paper for more details!