HKUST NLP (@hkustnlp) 's Twitter Profile
HKUST NLP

@hkustnlp

HKUST Natural Language Processing Research #NLProc

ID: 1843940663395852288

linkhttps://cse.hkust.edu.hk/ calendar_today09-10-2024 09:05:04

12 Tweet

232 Followers

107 Following

Wenhao Yu (@wyu_nd) 's Twitter Profile Photo

πŸš€ We release MMLongBench: Benchmark for evaluating long-context VLMs. πŸ“Š 13,331 examples across 5 tasks: – Visual RAG – Many-shot ICL – Needle-in-a-haystack – VL Summarization – Long-document VQA πŸ“ Lengths: 8 / 16 / 32 / 64 / 128K πŸ” Benchmarking both thoroughly & effectively!

πŸš€ We release MMLongBench: Benchmark for evaluating long-context VLMs.
πŸ“Š 13,331 examples across 5 tasks:
– Visual RAG
– Many-shot ICL
– Needle-in-a-haystack
– VL Summarization
– Long-document VQA
πŸ“ Lengths: 8 / 16 / 32 / 64 / 128K
πŸ” Benchmarking both thoroughly & effectively!
May Fung (@may_f1_) 's Twitter Profile Photo

Great to see the wonderful series of work that @WangCarrey has been leading at UIUC. We also had a fun collaboration recently together with my incoming PhD student Shijue. Check out our latest release 𝘈π˜₯𝘒𝘊𝘡𝘳𝘭: 𝘈π˜₯𝘒𝘱𝘡π˜ͺ𝘷𝘦 𝘒𝘯π˜₯ 𝘊𝘰𝘯𝘡𝘳𝘰𝘭𝘭𝘒𝘣𝘭𝘦

Great to see the wonderful series of work that @WangCarrey has been leading at UIUC. We also had a fun collaboration recently together with my incoming PhD student Shijue. Check out our latest release 
𝘈π˜₯𝘒𝘊𝘡𝘳𝘭: 𝘈π˜₯𝘒𝘱𝘡π˜ͺ𝘷𝘦 𝘒𝘯π˜₯ 𝘊𝘰𝘯𝘡𝘳𝘰𝘭𝘭𝘒𝘣𝘭𝘦
Junxian He (@junxian_he) 's Twitter Profile Photo

We studied both rule-based and model-based verifiers and found that each has unique limitations. Rule-based verifiers are often unreliable, even in math, and are unavailable in many domains. Model-based verifiers can be easily hacked. In our paper, we construct simple

Yangqiu Song (@yqsong) 's Twitter Profile Photo

Thrilled to share a major milestone: the culmination of a 15-month project, ATLASβ€”a new benchmark in event graphs and conceptualization! This journey began with Probase in 2012, evolved through ASER (2019), AbstractATOMIC (2022), and AbsPyramid (2023), and now realizes a

Thrilled to share a major milestone: the culmination of a 15-month project, ATLASβ€”a new benchmark in event graphs and conceptualization! This journey began with Probase in 2012, evolved through ASER (2019), AbstractATOMIC (2022), and AbsPyramid (2023), and now realizes a
Zhitao He (@zhouhe777) 's Twitter Profile Photo

🀯 Multimodal LLMs can be confidently wrong. A single early mistake in perception can lead to a completely incorrect answer. πŸš€Introducing our work, MMBoundary, a new framework to make MLLMs aware of their own knowledge boundaries! 🧡 Paper:arxiv.org/abs/2505.23224 #ACL2025

🀯 Multimodal LLMs can be confidently wrong. A single early mistake in perception can lead to a completely incorrect answer.

πŸš€Introducing our work, MMBoundary, a new framework to make MLLMs aware of their own knowledge boundaries! 🧡

Paper:arxiv.org/abs/2505.23224

#ACL2025
Zhaochen Su (@suzhaochen0110) 's Twitter Profile Photo

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! πŸ§ πŸ–ΌοΈ Our work offers a roadmap for more powerful & aligned AI. πŸš€ πŸ“œ Paper: arxiv.org/pdf/2506.23918 ⭐ GitHub (400+🌟): github.com/zhaochen0110/A…

Excited to share our new survey on the reasoning paradigm shift from "Think with Text" to "Think with Image"! πŸ§ πŸ–ΌοΈ
Our work offers a roadmap for more powerful & aligned AI. πŸš€
πŸ“œ Paper: arxiv.org/pdf/2506.23918
⭐ GitHub (400+🌟): github.com/zhaochen0110/A…
Zeyu Qin @ ICLR 2025 (@zeyuqin_alan) 's Twitter Profile Photo

#COLM2025 Our work has been accepted to COLM 2025😊 Looking forward to discussing Scalable Oversight and Synthetic Data with old and new friends in Montréal Conference on Language Modeling !

May Fung (@may_f1_) 's Twitter Profile Photo

Heading out to #ACL2025 in Vienna with six main/finding papers to present! πŸ‡¦πŸ‡ΉβœˆοΈπŸ€© Would love to chat about research on multimodal model reasoning and agent, as well as opportunities in my group HKUST NLP. Please DM if you'd like to meet!

Heading out to #ACL2025 in Vienna with six main/finding papers to present! πŸ‡¦πŸ‡ΉβœˆοΈπŸ€©

Would love to chat about research on multimodal model reasoning and agent, as well as opportunities in my group <a href="/hkustNLP/">HKUST NLP</a>.

Please DM if you'd like to meet!
May Fung (@may_f1_) 's Twitter Profile Photo

HKUST NLP UIUC NLP ACL 2025 [1/n] "π˜”π˜’π˜΅π˜€π˜©π˜ͺ𝘯𝘨 𝘀𝘢𝘦𝘴 𝘧𝘰𝘳 π˜ͺπ˜₯𝘦𝘯𝘡π˜ͺ𝘀𝘒𝘭 𝘰𝘣𝘫𝘦𝘀𝘡𝘴, π˜₯π˜ͺ𝘴𝘡π˜ͺ𝘯𝘀𝘡 𝘒𝘡𝘡𝘳π˜ͺ𝘣𝘢𝘡𝘦𝘴 𝘧𝘰𝘳 𝘢𝘯π˜ͺ𝘲𝘢𝘦 𝘰𝘯𝘦𝘴." Such π™˜π™§π™€π™¨π™¨-π™˜π™€π™£π™©π™šπ™­π™© π™«π™žπ™¨π™ͺ𝙖𝙑 π™§π™šπ™–π™¨π™€π™£π™žπ™£π™œ is extremely simple and straightforward for the human cognitive process,

<a href="/hkustNLP/">HKUST NLP</a> <a href="/uiuc_nlp/">UIUC NLP</a> <a href="/aclmeeting/">ACL 2025</a> [1/n] "π˜”π˜’π˜΅π˜€π˜©π˜ͺ𝘯𝘨 𝘀𝘢𝘦𝘴 𝘧𝘰𝘳 π˜ͺπ˜₯𝘦𝘯𝘡π˜ͺ𝘀𝘒𝘭 𝘰𝘣𝘫𝘦𝘀𝘡𝘴, π˜₯π˜ͺ𝘴𝘡π˜ͺ𝘯𝘀𝘡 𝘒𝘡𝘡𝘳π˜ͺ𝘣𝘢𝘡𝘦𝘴 𝘧𝘰𝘳 𝘢𝘯π˜ͺ𝘲𝘢𝘦 𝘰𝘯𝘦𝘴." Such π™˜π™§π™€π™¨π™¨-π™˜π™€π™£π™©π™šπ™­π™© π™«π™žπ™¨π™ͺ𝙖𝙑 π™§π™šπ™–π™¨π™€π™£π™žπ™£π™œ is extremely simple and straightforward for the human cognitive process,
Zhitao He (@zhouhe777) 's Twitter Profile Photo

Excited to present our work, MMBoundary, at #ACL2025! Come chat with us at our poster session! πŸ“ Hall 4/5, Session 12: Poster Session 4 πŸ—“οΈ Wednesday, July 30 ⏰ 11:00-12:30

Excited to present our work, MMBoundary, at #ACL2025!

Come chat with us at our poster session!
πŸ“ Hall 4/5, Session 12: Poster Session 4
πŸ—“οΈ Wednesday, July 30
⏰ 11:00-12:30
Hongru Wang (@wangcarrey) 's Twitter Profile Photo

Actually, we implemented this kind of capability in our AdaCtrl paper three months ago by injecting difficult-aware tags (i.e., easy, hard, adaptive) to trigger different reasoning behaviors of LLMs. Paper: arxiv.org/pdf/2505.18822

May Fung (@may_f1_) 's Twitter Profile Photo

πŸŽ‰ Congrats to our students on the new #EMNLP2025 paper acceptances! 1⃣ LLM Natural-Formal Hybrid Reasoning arxiv.org/abs/2505.23703 2⃣ Text-Instructed Image Editing on Medical Domain arxiv.org/abs/2506.01921 3⃣ Knowledge Boundary Aware Multi-Compositional Reasoning

πŸŽ‰ Congrats to our students on the new #EMNLP2025 paper acceptances! 1⃣ LLM Natural-Formal Hybrid Reasoning arxiv.org/abs/2505.23703 2⃣ Text-Instructed Image Editing on Medical Domain arxiv.org/abs/2506.01921 3⃣ Knowledge Boundary Aware Multi-Compositional Reasoning
Heng Ji (@hengjinlp) 's Twitter Profile Photo

Check out this awesome paper on a new paradigm about thinking with imagination/images led by Prof. May Fung May Fung and her super productive team at HKUST!

IJCAIconf (@ijcaiconf) 's Twitter Profile Photo

How far are we from Artificial General Intelligenceβ€”and what might follow as Artificial Superintelligence? IJCAI Closing Panel opened by James Kwok, #IJCAI2025 Programe Chair. share.google/DbB2Ky20bCc3B0… #Montreal #AI

How far are we from Artificial General Intelligenceβ€”and what might follow as Artificial Superintelligence? IJCAI Closing Panel opened by James Kwok, #IJCAI2025 Programe Chair.
 share.google/DbB2Ky20bCc3B0…

#Montreal #AI