HKUST NLP (@hkustnlp) 's Twitter Profile
HKUST NLP

@hkustnlp

HKUST Natural Language Processing Research #NLProc

ID: 1843940663395852288

linkhttps://cse.hkust.edu.hk/ calendar_today09-10-2024 09:05:04

12 Tweet

232 Takipรงi

107 Takip Edilen

Wenhao Yu (@wyu_nd) 's Twitter Profile Photo

๐Ÿš€ We release MMLongBench: Benchmark for evaluating long-context VLMs. ๐Ÿ“Š 13,331 examples across 5 tasks: โ€“ Visual RAG โ€“ Many-shot ICL โ€“ Needle-in-a-haystack โ€“ VL Summarization โ€“ Long-document VQA ๐Ÿ“ Lengths: 8 / 16 / 32 / 64 / 128K ๐Ÿ” Benchmarking both thoroughly & effectively!

๐Ÿš€ We release MMLongBench: Benchmark for evaluating long-context VLMs.
๐Ÿ“Š 13,331 examples across 5 tasks:
โ€“ Visual RAG
โ€“ Many-shot ICL
โ€“ Needle-in-a-haystack
โ€“ VL Summarization
โ€“ Long-document VQA
๐Ÿ“ Lengths: 8 / 16 / 32 / 64 / 128K
๐Ÿ” Benchmarking both thoroughly & effectively!
May Fung (@may_f1_) 's Twitter Profile Photo

Great to see the wonderful series of work that @WangCarrey has been leading at UIUC. We also had a fun collaboration recently together with my incoming PhD student Shijue. Check out our latest release ๐˜ˆ๐˜ฅ๐˜ข๐˜Š๐˜ต๐˜ณ๐˜ญ: ๐˜ˆ๐˜ฅ๐˜ข๐˜ฑ๐˜ต๐˜ช๐˜ท๐˜ฆ ๐˜ข๐˜ฏ๐˜ฅ ๐˜Š๐˜ฐ๐˜ฏ๐˜ต๐˜ณ๐˜ฐ๐˜ญ๐˜ญ๐˜ข๐˜ฃ๐˜ญ๐˜ฆ

Great to see the wonderful series of work that @WangCarrey has been leading at UIUC. We also had a fun collaboration recently together with my incoming PhD student Shijue. Check out our latest release 
๐˜ˆ๐˜ฅ๐˜ข๐˜Š๐˜ต๐˜ณ๐˜ญ: ๐˜ˆ๐˜ฅ๐˜ข๐˜ฑ๐˜ต๐˜ช๐˜ท๐˜ฆ ๐˜ข๐˜ฏ๐˜ฅ ๐˜Š๐˜ฐ๐˜ฏ๐˜ต๐˜ณ๐˜ฐ๐˜ญ๐˜ญ๐˜ข๐˜ฃ๐˜ญ๐˜ฆ
Junxian He (@junxian_he) 's Twitter Profile Photo

We studied both rule-based and model-based verifiers and found that each has unique limitations. Rule-based verifiers are often unreliable, even in math, and are unavailable in many domains. Model-based verifiers can be easily hacked. In our paper, we construct simple

Yangqiu Song (@yqsong) 's Twitter Profile Photo

Thrilled to share a major milestone: the culmination of a 15-month project, ATLASโ€”a new benchmark in event graphs and conceptualization! This journey began with Probase in 2012, evolved through ASER (2019), AbstractATOMIC (2022), and AbsPyramid (2023), and now realizes a

Thrilled to share a major milestone: the culmination of a 15-month project, ATLASโ€”a new benchmark in event graphs and conceptualization! This journey began with Probase in 2012, evolved through ASER (2019), AbstractATOMIC (2022), and AbsPyramid (2023), and now realizes a
Zhitao He (@zhouhe777) 's Twitter Profile Photo

๐Ÿคฏ Multimodal LLMs can be confidently wrong. A single early mistake in perception can lead to a completely incorrect answer. ๐Ÿš€Introducing our work, MMBoundary, a new framework to make MLLMs aware of their own knowledge boundaries! ๐Ÿงต Paper:arxiv.org/abs/2505.23224 #ACL2025

๐Ÿคฏ Multimodal LLMs can be confidently wrong. A single early mistake in perception can lead to a completely incorrect answer.

๐Ÿš€Introducing our work, MMBoundary, a new framework to make MLLMs aware of their own knowledge boundaries! ๐Ÿงต

Paper:arxiv.org/abs/2505.23224

#ACL2025
Zhaochen Su (@suzhaochen0110) 's Twitter Profile Photo

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! ๐Ÿง ๐Ÿ–ผ๏ธ Our work offers a roadmap for more powerful & aligned AI. ๐Ÿš€ ๐Ÿ“œ Paper: arxiv.org/pdf/2506.23918 โญ GitHub (400+๐ŸŒŸ): github.com/zhaochen0110/Aโ€ฆ

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! ๐Ÿง ๐Ÿ–ผ๏ธ
Our work offers a roadmap for more powerful & aligned AI. ๐Ÿš€
๐Ÿ“œ Paper: arxiv.org/pdf/2506.23918
โญ GitHub (400+๐ŸŒŸ): github.com/zhaochen0110/Aโ€ฆ
Zeyu Qin @ ICLR 2025 (@zeyuqin_alan) 's Twitter Profile Photo

#COLM2025 Our work has been accepted to COLM 2025๐Ÿ˜Š Looking forward to discussing Scalable Oversight and Synthetic Data with old and new friends in Montrรฉal Conference on Language Modeling ๏ผ

May Fung (@may_f1_) 's Twitter Profile Photo

Heading out to #ACL2025 in Vienna with six main/finding papers to present! ๐Ÿ‡ฆ๐Ÿ‡นโœˆ๏ธ๐Ÿคฉ Would love to chat about research on multimodal model reasoning and agent, as well as opportunities in my group HKUST NLP. Please DM if you'd like to meet!

Heading out to #ACL2025 in Vienna with six main/finding papers to present! ๐Ÿ‡ฆ๐Ÿ‡นโœˆ๏ธ๐Ÿคฉ

Would love to chat about research on multimodal model reasoning and agent, as well as opportunities in my group <a href="/hkustNLP/">HKUST NLP</a>.

Please DM if you'd like to meet!
May Fung (@may_f1_) 's Twitter Profile Photo

HKUST NLP UIUC NLP ACL 2025 [1/n] "๐˜”๐˜ข๐˜ต๐˜ค๐˜ฉ๐˜ช๐˜ฏ๐˜จ ๐˜ค๐˜ถ๐˜ฆ๐˜ด ๐˜ง๐˜ฐ๐˜ณ ๐˜ช๐˜ฅ๐˜ฆ๐˜ฏ๐˜ต๐˜ช๐˜ค๐˜ข๐˜ญ ๐˜ฐ๐˜ฃ๐˜ซ๐˜ฆ๐˜ค๐˜ต๐˜ด, ๐˜ฅ๐˜ช๐˜ด๐˜ต๐˜ช๐˜ฏ๐˜ค๐˜ต ๐˜ข๐˜ต๐˜ต๐˜ณ๐˜ช๐˜ฃ๐˜ถ๐˜ต๐˜ฆ๐˜ด ๐˜ง๐˜ฐ๐˜ณ ๐˜ถ๐˜ฏ๐˜ช๐˜ฒ๐˜ถ๐˜ฆ ๐˜ฐ๐˜ฏ๐˜ฆ๐˜ด." Such ๐™˜๐™ง๐™ค๐™จ๐™จ-๐™˜๐™ค๐™ฃ๐™ฉ๐™š๐™ญ๐™ฉ ๐™ซ๐™ž๐™จ๐™ช๐™–๐™ก ๐™ง๐™š๐™–๐™จ๐™ค๐™ฃ๐™ž๐™ฃ๐™œ is extremely simple and straightforward for the human cognitive process,

<a href="/hkustNLP/">HKUST NLP</a> <a href="/uiuc_nlp/">UIUC NLP</a> <a href="/aclmeeting/">ACL 2025</a> [1/n] "๐˜”๐˜ข๐˜ต๐˜ค๐˜ฉ๐˜ช๐˜ฏ๐˜จ ๐˜ค๐˜ถ๐˜ฆ๐˜ด ๐˜ง๐˜ฐ๐˜ณ ๐˜ช๐˜ฅ๐˜ฆ๐˜ฏ๐˜ต๐˜ช๐˜ค๐˜ข๐˜ญ ๐˜ฐ๐˜ฃ๐˜ซ๐˜ฆ๐˜ค๐˜ต๐˜ด, ๐˜ฅ๐˜ช๐˜ด๐˜ต๐˜ช๐˜ฏ๐˜ค๐˜ต ๐˜ข๐˜ต๐˜ต๐˜ณ๐˜ช๐˜ฃ๐˜ถ๐˜ต๐˜ฆ๐˜ด ๐˜ง๐˜ฐ๐˜ณ ๐˜ถ๐˜ฏ๐˜ช๐˜ฒ๐˜ถ๐˜ฆ ๐˜ฐ๐˜ฏ๐˜ฆ๐˜ด." Such ๐™˜๐™ง๐™ค๐™จ๐™จ-๐™˜๐™ค๐™ฃ๐™ฉ๐™š๐™ญ๐™ฉ ๐™ซ๐™ž๐™จ๐™ช๐™–๐™ก ๐™ง๐™š๐™–๐™จ๐™ค๐™ฃ๐™ž๐™ฃ๐™œ is extremely simple and straightforward for the human cognitive process,
Zhitao He (@zhouhe777) 's Twitter Profile Photo

Excited to present our work, MMBoundary, at #ACL2025! Come chat with us at our poster session! ๐Ÿ“ Hall 4/5, Session 12: Poster Session 4 ๐Ÿ—“๏ธ Wednesday, July 30 โฐ 11:00-12:30

Excited to present our work, MMBoundary, at #ACL2025!

Come chat with us at our poster session!
๐Ÿ“ Hall 4/5, Session 12: Poster Session 4
๐Ÿ—“๏ธ Wednesday, July 30
โฐ 11:00-12:30
Hongru Wang (@wangcarrey) 's Twitter Profile Photo

Actually, we implemented this kind of capability in our AdaCtrl paper three months ago by injecting difficult-aware tags (i.e., easy, hard, adaptive) to trigger different reasoning behaviors of LLMs. Paper: arxiv.org/pdf/2505.18822

May Fung (@may_f1_) 's Twitter Profile Photo

๐ŸŽ‰ Congrats to our students on the new #EMNLP2025 paper acceptances! 1โƒฃ LLM Natural-Formal Hybrid Reasoning arxiv.org/abs/2505.23703 2โƒฃ Text-Instructed Image Editing on Medical Domain arxiv.org/abs/2506.01921 3โƒฃ Knowledge Boundary Aware Multi-Compositional Reasoning

๐ŸŽ‰ Congrats to our students on the new #EMNLP2025 paper acceptances! 1โƒฃ LLM Natural-Formal Hybrid Reasoning arxiv.org/abs/2505.23703 2โƒฃ Text-Instructed Image Editing on Medical Domain arxiv.org/abs/2506.01921 3โƒฃ Knowledge Boundary Aware Multi-Compositional Reasoning
Heng Ji (@hengjinlp) 's Twitter Profile Photo

Check out this awesome paper on a new paradigm about thinking with imagination/images led by Prof. May Fung May Fung and her super productive team at HKUST!

IJCAIconf (@ijcaiconf) 's Twitter Profile Photo

How far are we from Artificial General Intelligenceโ€”and what might follow as Artificial Superintelligence? IJCAI Closing Panel opened by James Kwok, #IJCAI2025 Programe Chair. share.google/DbB2Ky20bCc3B0โ€ฆ #Montreal #AI

How far are we from Artificial General Intelligenceโ€”and what might follow as Artificial Superintelligence? IJCAI Closing Panel opened by James Kwok, #IJCAI2025 Programe Chair.
 share.google/DbB2Ky20bCc3B0โ€ฆ

#Montreal #AI