Xuejun Zhang (@eva_xuejunzhang) 's Twitter Profile
Xuejun Zhang

@eva_xuejunzhang

Undergraduate @UMich @sjtu1896 | Research Assistant @SLED_AI | Multimodal Learning, Natural Language Processing | Incoming CS PhD @uiuc_nlp

ID: 1729940047888089089

linkhttp://xuejunzhang2002.github.io calendar_today29-11-2023 19:07:18

33 Tweet

210 Followers

354 Following

Xuejun Zhang (@eva_xuejunzhang) 's Twitter Profile Photo

Excited to share that our Multi-Object Hallucination paper will be presented at #NeurIPS2024! Looking forward to seeing you in Vancouver!

Xuweiyi Chen (@chenxuweiyi) 's Twitter Profile Photo

🚀 Excited to share our latest paper: “Learning 3D Representations from Procedural 3D Programs” We explore self-supervised learning of 3D representations using procedurally generated shapes, with no reliance on human-designed 3D datasets. We found that Self-supervised 3D

Songlin Yang (@songlinyang4) 's Twitter Profile Photo

(1/10) Excited to share one of the most elegant works I’ve been working on: Parallelizing Linear Transformers with the Delta Rule over Sequence Length! 🎉 📄 Published at NeurIPS ‘24 📍 Catch my poster in person: NeurIPS East Exhibit Hall A-C #2009 🗓️ Fri, Dec 13 | 4:30–7:30 p.m

(1/10) Excited to share one of the most elegant works I’ve been working on: Parallelizing Linear Transformers with the Delta Rule over Sequence Length! 🎉
📄 Published at NeurIPS ‘24
📍 Catch my poster in person:
 NeurIPS East Exhibit Hall A-C #2009
🗓️ Fri, Dec 13 | 4:30–7:30 p.m
Xuejun Zhang (@eva_xuejunzhang) 's Twitter Profile Photo

I am at Vancouver this week🇨🇦! I’ll present our work at #NeurIPS2024 this Friday, Dec 13 4:30-7:30 pm at East Exhibition Hall A-C #4303. Feel free to drop by and chat if you are interested. I’m also actively looking for PhD opportunities for Fall 2025. Would love to connect with

Sasha Rush (@srush_nlp) 's Twitter Profile Photo

10 short videos about LLM infrastructure to help you appreciate Pages 12-18 of the DeepSeek-v3 paper (arxiv.org/abs/2412.19437) 🧵 youtube.com/watch?v=76gulN…

Songlin Yang (@songlinyang4) 's Twitter Profile Photo

🚀 Announcing ASAP: asap-seminar.github.io! A fully virtual seminar bridging theory, algorithms, and systems to tackle fundamental challenges in Transformers. Co-organized by Simran Arora Xinyu Yang Han Guo Our first speaker: Alex Wang on Test-time Regression

🚀 Announcing ASAP: asap-seminar.github.io!

A fully virtual seminar bridging theory, algorithms, and systems to tackle fundamental challenges in Transformers.

Co-organized by <a href="/simran_s_arora/">Simran Arora</a> <a href="/Xinyu2ML/">Xinyu Yang</a> <a href="/HanGuo97/">Han Guo</a> 

Our first speaker: <a href="/heyyalexwang/">Alex Wang</a> on Test-time Regression
Yongyuan Liang (@cheryyun_l) 's Twitter Profile Photo

We release Magma, the first VLM that explores the vital interplay between multimodal understanding and actions across both virtual and physical worlds—not only controlling your laptop but also directing your robot! 🥳 Magma bridges verbal and spatial-temporal intelligence.

Sasha Rush (@srush_nlp) 's Twitter Profile Photo

I convinced Songlin (Songlin Yang) of Flash Linear Attention fame to give me a personal tutorial on the current state of attention. Video to come, (it's like 2hr!)

I convinced Songlin (<a href="/SonglinYang4/">Songlin Yang</a>) of Flash Linear Attention fame to give me a personal tutorial on the current state of attention. Video to come, (it's like 2hr!)
Manling Li (@manlingli_) 's Twitter Profile Photo

Excited about the tutorial on "The Lifecycle of Knowledge in LLMs: Memorization, Editing, and Beyond" with Zoey Sha Li Yuji Zhang Chi Han Heng Ji . Slides/Video(upcoming): llmknowledgelifecycle.github.io/AAAI2025_Tutor… Time: Feb 26 8:30-12:30 Location: Room 116 Zoom: underline.io/events/487/ses…

Excited about the tutorial on "The Lifecycle of Knowledge in LLMs: Memorization, Editing, and Beyond" with <a href="/ZoeyLi20/">Zoey Sha Li</a> <a href="/Yuji_Zhang_NLP/">Yuji Zhang</a> <a href="/Glaciohound/">Chi Han</a> <a href="/hengjinlp/">Heng Ji</a> .

Slides/Video(upcoming): llmknowledgelifecycle.github.io/AAAI2025_Tutor…
Time: Feb 26 8:30-12:30
Location: Room 116
Zoom: underline.io/events/487/ses…
Jianing “Jed” Yang (@jed_yang) 's Twitter Profile Photo

⚡️ Excited to announce Fast3R: 3D reconstruction of 1000+ images in a single forward pass! Fast3R achieves 251 FPS at its peak. 🔥 Try the demo with your images or video! 🔗 Website: fast3r-3d.github.io 🎮 Demo: fast3r.ngrok.app #CVPR2025 #3D AI at Meta

Freda Shi (@fredahshi) 's Twitter Profile Photo

📜New Paper: on the surprising effectiveness of multilingual chain of thought (originally discovered w/ Mirac Suzgun and Google DeepMind folks in ICLR23). Linguistic diversity in prompts helps solve problems in low-resource languages, even if there's no alphabetical overlap!

Heng Ji (@hengjinlp) 's Twitter Profile Photo

Chi Han Chi Han ‘s recent deep analysis on the hidden computational mechanisms behind LLM position generalization shows: Attention logits ≈ positional pattern + semantic importance (0.959 correlation!) arxiv.org/abs/2503.13305

Chi Han <a href="/Glaciohound/">Chi Han</a> ‘s recent deep analysis on the hidden computational mechanisms behind LLM position generalization shows: Attention logits ≈ positional pattern + semantic importance (0.959 correlation!)

arxiv.org/abs/2503.13305
Martin Ziqiao Ma (@ziqiao_ma) 's Twitter Profile Photo

Vision-Language Models (VLMs) can describe the environment, but can they refer within it? Our findings reveal a critical gap: VLMs fall short of pragmatic optimality. We identify 3 key failures of pragmatic competence in referring expression generation with VLMs: (1) cannot

Heng Ji (@hengjinlp) 's Twitter Profile Photo

We are extremely excited to announce mCLM, a Modular Chemical Language Model that is friendly to automatable block-based chemistry and mimics bilingual speakers by “code-switching” between functional molecular modules and natural language descriptions of the functions. 1/2

We are extremely excited to announce mCLM, a Modular Chemical Language Model that is friendly to automatable block-based chemistry and mimics bilingual speakers by “code-switching” between functional molecular modules and natural language descriptions of the functions. 1/2
Shizhe Diao (@shizhediao) 's Twitter Profile Photo

Does RL truly expand a model’s reasoning🧠capabilities? Contrary to recent claims, the answer is yes—if you push RL training long enough! Introducing ProRL 😎, a novel training recipe that scales RL to >2k steps, empowering the world’s leading 1.5B reasoning model💥and offering

Does RL truly expand a model’s reasoning🧠capabilities? Contrary to recent claims, the answer is yes—if you push RL training long enough!

Introducing ProRL 😎, a novel training recipe that scales RL to &gt;2k steps, empowering the world’s leading 1.5B reasoning model💥and offering
Zhenjun Zhao (@zhenjun_zhao) 's Twitter Profile Photo

SAB3R: Semantic-Augmented Backbone in 3D Reconstruction Xuweiyi Chen, Tian XIA, Si.X, Jianing “Jed” Yang @ CVPR, Joyce Chai, Zezhou Cheng tl;dr: MASt3R+distillation->open-vocabulary segmentation+3D reconstruction arxiv.org/abs/2506.02112

SAB3R: Semantic-Augmented Backbone in 3D Reconstruction

<a href="/ChenXuweiyi/">Xuweiyi Chen</a>, <a href="/TianX_ia/">Tian XIA</a>, <a href="/6SihanXu/">Si.X</a>, <a href="/jed_yang/">Jianing “Jed” Yang @ CVPR</a>, Joyce Chai, <a href="/ZezhouCheng/">Zezhou Cheng</a>

tl;dr: MASt3R+distillation-&gt;open-vocabulary segmentation+3D reconstruction

arxiv.org/abs/2506.02112
Peixuan Han (韩沛煊) (@peixuanhakhan) 's Twitter Profile Photo

(1/5) Super excited to release our new paper on Reinforcement Learning: "Self-Aligned Reward: Towards Effective and Efficient Reasoners"! Preprint: arxiv.org/pdf/2509.05489

(1/5) Super excited to release our new paper on Reinforcement Learning: 

"Self-Aligned Reward: Towards Effective and Efficient Reasoners"!

Preprint: arxiv.org/pdf/2509.05489
Cheng Qian (@qiancheng1231) 's Twitter Profile Photo

🚀 Introducing UserRL: a new framework to train agents that truly assist users through proactive interaction, not just chase static benchmarking scores. 📄 Paper: arxiv.org/pdf/2509.19736 💻 Code: github.com/SalesforceAIRe…

🚀 Introducing UserRL: a new framework to train agents that truly assist users through proactive interaction, not just chase static benchmarking scores.

 📄 Paper: arxiv.org/pdf/2509.19736
 💻 Code: github.com/SalesforceAIRe…