Yusen Zhang (@yusenzhangnlp) Twitter Tweets • TwiCopy

Yusen Zhang

@yusenzhangnlp

+ Follow

PhD Candidate @PennStateEECS | NLP Lab @NLP_PennState #NLProc | Prev Research Intern @MSFTResearch, @AmazonScience @GoogleAI

ID: 1590626031332954113

linkhttp://yuszh.com calendar_today10-11-2022 08:43:02

86 Tweet

352 Takipçi

436 Takip Edilen

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

7 months ago

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? "we introduce HRScene, a novel unified benchmark for HRI understanding with rich scenes. HRScene incorporates 25 real-world datasets and 2 synthetic diagnostic datasets with resolutions ranging from

thumb_up_off_alt35

chat_bubble_outline3

repeat11

shareShare

Ryo Kamoi

@ryokamoi

7 months ago

📢 New paper! FoVer enhances PRMs for step-level verification of LLM reasoning w/o human annotation 🚀 We synthesize training data using formal verification tools and improve LLMs at step-level verification of LLM responses on MATH, AIME, MMLU, BBH, etc. arxiv.org/abs/2505.15960

thumb_up_off_alt127

chat_bubble_outline4

repeat25

shareShare

GPT Maestro | LLMpedia Curator

@gptmaestro

6 months ago

Vision Language Models display a peculiar blind spot: their ability to process image content declines in a U-shaped pattern based on Manhattan distance from corners, suggesting fundamental limitations in handling high-resolution layouts.

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Yusen Zhang

@yusenzhangnlp

5 months ago

HRScene got accepted at #ICCV2025! HRScene is a novel unified benchmark for high-resolution image understanding with 25 scenes and 2 NIAH tests. Home page: yszh8.github.io/hrscene/ (Sorry, EvalAI for submission does not work currently...) My PhD research began with long text

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Ryo Kamoi

@ryokamoi

5 months ago

Our paper VisOnlyQA has been accepted to Conference on Language Modeling #COLM2025! See you in Montreal🍁 We find that even recent Vision Language Models struggle with simple questions about geometric properties in images, such as "What is the degree of angle AOD?"🧐 arxiv.org/abs/2412.00947

Our paper VisOnlyQA has been accepted to <a href="/COLM_conf/">Conference on Language Modeling</a> #COLM2025! See you in Montreal🍁
We find that even recent Vision Language Models struggle with simple questions about geometric properties in images, such as "What is the degree of angle AOD?"🧐
arxiv.org/abs/2412.00947

thumb_up_off_alt58

chat_bubble_outline2

repeat9

shareShare

Ryo Kamoi

@ryokamoi

5 months ago

We updated our VisOnlyQA paper for #COLM2025! * LVLMs exhibit weak geometric perception even on geometric shapes with 2–3 lines 😭 * Gemini 2.5 Pro largely improves over prior models on charts and chemistry 😳 but still struggles with geometric shapes 😖 arxiv.org/abs/2412.00947

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Penn State Center for Socially Responsible AI

@pennstatecsrai

4 months ago

How are researchers optimizing AI systems for science? CSRAI Affiliate Rui Zhang from Penn State EECS shares how to improve the efficiency and usefulness of AI and some strategies individuals can employ to get more value out of their personal AI use. psu.edu/news/engineeri…

thumb_up_off_alt6

chat_bubble_outline0

repeat5

shareShare

Rui Zhang

@ruizhang_nlp

4 months ago

📢 Call for Papers: NewSumm 2025 - The 5th New Frontiers in Summarization Workshop at EMNLP 2025 The summarization research community is invited to submit to NewSumm 2025, co-located with EMNLP 2025! As LLMs continue to transform our field, we're expanding beyond traditional

thumb_up_off_alt26

chat_bubble_outline0

repeat13

shareShare

Simeng (Sophia) Han

@hansineng

2 months ago

I’ve completed my Ph.D. Yale Engineering and will be joining Stanford University as a postdoc! 1/n

I’ve completed my Ph.D. <a href="/YaleEngineering/">Yale Engineering</a> and will be joining <a href="/Stanford/">Stanford University</a> as a postdoc!

1/n

thumb_up_off_alt9,9K

chat_bubble_outline319

repeat399

shareShare

Yusen Zhang

@yusenzhangnlp

2 months ago

We are excited to announce WMAC 2026, co-located with AAAI 2026!

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Ryo Kamoi

@ryokamoi

2 months ago

I'll be attending #COLM2025 Conference on Language Modeling in person 🇨🇦 I will present our work, VisOnlyQA, on the limitations of vision-language models at Poster Session 4 (Wed). Looking forward to chatting with everyone! Paper: openreview.net/forum?id=PYHwl… x.com/RyoKamoi/statu…

I'll be attending #COLM2025 <a href="/COLM_conf/">Conference on Language Modeling</a> in person 🇨🇦
I will present our work, VisOnlyQA, on the limitations of vision-language models at Poster Session 4 (Wed). Looking forward to chatting with everyone!

Paper: openreview.net/forum?id=PYHwl…
x.com/RyoKamoi/statu…

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare