Hyeonjeong Ha (@hyeonjeong_ai) Twitter Tweets • TwiCopy

Chi Han

8 months ago

Welcome to my #AAAI2025 Tutorial, "The Quest for A Science of LMs," today! Time: Feb 26, 2pm-3:45pm Location: Room 113A, Pennsylvania Convention Center Website: glaciohound.github.io/Science-of-LLM… Underline: underline.io/events/487/sch…

thumb_up_off_alt41

chat_bubble_outline1

repeat8

shareShare

Jeonghwan Kim

@masterjeongk

8 months ago

SearchDet was accepted to #CVPR2025 🎉 We retrieve images from the Web and generate heatmaps through simple feature subtraction to improve long-tail object detection 👁

thumb_up_off_alt24

chat_bubble_outline1

repeat6

shareShare

Wenhu Chen

@wenhuchen

8 months ago

We have made a huge progress in language model reasoning. But our progress in multimodal reasoning (like MMMU) is very limited. Why? It's due to the lack of diverse, difficult and high-quality multimodal reasoning dataset! 🚀 New Paper Alert! 📢 We introduce VisualWebInstruct,

thumb_up_off_alt251

chat_bubble_outline3

repeat56

shareShare

Zhenhailong Wang

@zhenhailongw

6 months ago

Why allocate the same number of visual tokens to a blank image and a complex landscape? Introducing DyMU: a training-free algorithm that makes any ViT visual encoder dynamic-length and plug-and-play with downstream VLMs. 🚀 🔗 Project Page: mikewangwzhl.github.io/dymu/

thumb_up_off_alt29

chat_bubble_outline0

repeat11

shareShare

Vercept

@vercept_ai

6 months ago

Today we're excited to introduce Vy, our AI that sees and acts on your computer. At Vercept, our mission is to reinvent how humans use computers–enabling you to accomplish orders of magnitude more than what you can do today. Vy is a first glimpse at AI that sees and uses your

thumb_up_off_alt257

chat_bubble_outline24

repeat35

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

6 months ago

🚀 Computational persuasion of LLMs can be a game-changer—dive into our new survey to explore the taxonomy, spot the risks, and investigate further challenges in persuasive LLMs!

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Salesforce AI Research

@sfresearch

6 months ago

We're thrilled to announce BLIP3-o, a breakthrough in unified multimodal models that excels at both image understanding and generation in a single autoregressive architecture! 💫 📊 Paper: bit.ly/3Saybpo 🤗 Models: bit.ly/4jhFaYM 🧠 Code:

thumb_up_off_alt38

chat_bubble_outline1

repeat11

shareShare

Yangyi Chen (on job market)

@yangyichen6666

6 months ago

🐂🍺Introducing our recent preprint: Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training! We present PRIOR, a simple vision-language pre-training algorithm that addresses the challenge of irrelevant textual content in image-caption pairs. PRIOR enhances

thumb_up_off_alt146

chat_bubble_outline3

repeat30

shareShare

Heng Ji

@hengjinlp

6 months ago

We are extremely excited to announce mCLM, a Modular Chemical Language Model that is friendly to automatable block-based chemistry and mimics bilingual speakers by “code-switching” between functional molecular modules and natural language descriptions of the functions. 1/2

thumb_up_off_alt95

chat_bubble_outline1

repeat27

shareShare

Yi Xu

@_yixu

6 months ago

🚀Let’s Think Only with Images. No language and No verbal thought.🤔 Let’s think through a sequence of images💭, like how humans picture steps in their minds🎨. We propose Visual Planning, a novel reasoning paradigm that enables models to reason purely through images.

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat207

shareShare

Fei-Fei Li

@drfeifei

6 months ago

Very excited by our work on visual affordance learning in the wild for robotics! 🤩

thumb_up_off_alt334

chat_bubble_outline12

repeat37

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

5 months ago

Thrilled to share that our paper has been accepted to #ACL2025 Main 🇦🇹 Huge thanks to my amazing collaborators and my advisor Heng Ji 🙃 📄arxiv.org/abs/2502.17793 Happy to chat about our work as well as MLLM research projects 🙌

Thrilled to share that our paper has been accepted to #ACL2025 Main 🇦🇹

Huge thanks to my amazing collaborators and my advisor <a href="/hengjinlp/">Heng Ji</a> 🙃
📄arxiv.org/abs/2502.17793

Happy to chat about our work as well as MLLM research projects 🙌

thumb_up_off_alt44

chat_bubble_outline0

repeat8

shareShare

Stella Li

@stellalisy

5 months ago

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

thumb_up_off_alt1,1K

chat_bubble_outline69

repeat322

shareShare

Cheng Qian

@qiancheng1231

5 months ago

📢 New Paper Drop: From Solving to Modeling! LLMs can solve math problems — but can they model the real world? 🌍 📄 arXiv: arxiv.org/pdf/2505.15068 💻 Code: github.com/qiancheng0/Mod… Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

thumb_up_off_alt103

chat_bubble_outline3

repeat30

shareShare

May Fung

@may_f1_

4 months ago

🧠 How can AI evolve from statically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘢𝘣𝘰𝘶𝘵 𝘪𝘮𝘢𝘨𝘦𝘴 → dynamically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘪𝘮𝘢𝘨𝘦𝘴 as cognitive workspaces, similar to the human mental sketchpad? 🔍 What’s the 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗿𝗼𝗮𝗱𝗺𝗮𝗽 from tool-use → programmatic

thumb_up_off_alt178

chat_bubble_outline0

repeat60

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

4 months ago

Excited to share our work on Energy-Based Transformers, led by my amazing labmate Alexi Gladstone—a new frontier in unlocking generalized reasoning across modalities without rewards. Grateful to be part of this journey! ⚡️ 🧠 Think longer. Verify better. Generalize further.

thumb_up_off_alt42

chat_bubble_outline1

repeat7

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

4 months ago

🚀 Excited to share our work led by my amazing labmate Zhenhailong Wang, PAPO: Perception-Aware Policy Optimization, an extension of GRPO for multimodal reasoning! No extra labels. No reward models. Just internal supervision. 🔥 Learning to perceive while learning to reason.

thumb_up_off_alt13

chat_bubble_outline0

repeat6

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

3 months ago

Excited to be presenting on Monday, 7/28 from 11:00am–12:30pm at Hall 4/5 in ACL! If you’re interested in MLLM research, I’d love to chat—come say hi!🇦🇹👋

thumb_up_off_alt35

chat_bubble_outline0

repeat3

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

2 months ago

Thrilled to share our NeurIPS 2025 Spotlight 🎉 Check out our PARTONOMY paper! Led by my amazing labmates Jeonghwan Kim and Ansel, we introduce: PARTONOMY Benchmark and PLUM Model for part-level visual understanding and grounding🔥

thumb_up_off_alt34

chat_bubble_outline1

repeat3

shareShare

Cheng Qian

@qiancheng1231

a month ago

🚀 Introducing UserRL: a new framework to train agents that truly assist users through proactive interaction, not just chase static benchmarking scores. 📄 Paper: arxiv.org/pdf/2509.19736 💻 Code: github.com/SalesforceAIRe…

thumb_up_off_alt218

chat_bubble_outline4

repeat45

shareShare