Peixuan Han (韩沛煊) (@peixuanhakhan) Twitter Tweets • TwiCopy

Peixuan Han (韩沛煊)

@peixuanhakhan

+ Follow

1st year Ph.D. student at UIUC @IllinoisCS
LLM researcher

ID: 1839016452311130112

linkhttps://hanpx20.github.io/ calendar_today25-09-2024 18:57:51

26 Tweet

49 Followers

59 Following

Zijia Liu

@xwzliuzijia

6 months ago

💥Time-R1 is here! Can a 3B LLM truly grasp time? 🤔 YES! Excited to share our new work, Time-R1: Towards Comprehensive Temporal Reasoning in LLMs 🚀 Check it out: 📖 Paper: arxiv.org/abs/2505.13508 💻 Code: github.com/ulab-uiuc/Time… #TemporalReasoning #RL #LLMs

thumb_up_off_alt13

chat_bubble_outline5

repeat3

shareShare

Cheng Qian

@qiancheng1231

6 months ago

📢 New Paper Drop: From Solving to Modeling! LLMs can solve math problems — but can they model the real world? 🌍 📄 arXiv: arxiv.org/pdf/2505.15068 💻 Code: github.com/qiancheng0/Mod… Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

thumb_up_off_alt103

chat_bubble_outline3

repeat30

shareShare

Jiaxun Zhang

@jiaxunzhang6

6 months ago

⚠️ Rogue AI scientists? 🛡️ SafeScientist rejects unsafe prompts for ethical discoveries. Check out paper ➡️ (arxiv.org/pdf/2505.23559) #AISafety #LLM #SafeAI #AI

thumb_up_off_alt6

chat_bubble_outline1

repeat7

shareShare

Xiusi Chen

@xiusi_chen

6 months ago

Can LLMs make rational decisions like human experts? 📖Introducing DecisionFlow: Advancing Large Language Model as Principled Decision Maker We introduce a novel framework that constructs a semantically grounded decision space to evaluate trade-offs in hard decision-making

thumb_up_off_alt55

chat_bubble_outline2

repeat15

shareShare

Peixuan Han (韩沛煊)

@peixuanhakhan

6 months ago

Super excited to begin my Applied Scientist Internship at Amazon, which is my first internship in the industry. I'm looking forward to conducting interesting and insightful research on the efficient reasoning of LLMs!

Super excited to begin my Applied Scientist Internship at <a href="/amazon/">Amazon</a>, which is my first internship in the industry.

I'm looking forward to conducting interesting and insightful research on the efficient reasoning of LLMs!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Alexi Gladstone

@alexiglad

5 months ago

How can we unlock generalized reasoning? ⚡️Introducing Energy-Based Transformers (EBTs), an approach that out-scales (feed-forward) transformers and unlocks generalized reasoning/thinking on any modality/problem without rewards. TLDR: - EBTs are the first model to outscale the

thumb_up_off_alt1,1K

chat_bubble_outline32

repeat208

shareShare

Peixuan Han (韩沛煊)

@peixuanhakhan

3 months ago

We're pleased to announce that SafeSwitch has been accepted to EMNLP 2025! Many thanks to the collaborators for their help with this amazing project! Cheng Qian Xiusi Chen Yuji Zhang Denghui Zhang Heng Ji Paper: arxiv.org/pdf/2502.01042

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Denghui Zhang

@denghui_zhang

3 months ago

Interpretability: Understanding how AI models think youtu.be/fGKNUvivvnc?si… via YouTube Anthropic Anthropic’s new video dives into AI interpretability—how models think & why it matters 🧠✨ Our EMNLP paper SafeSwitch takes a similar path: leveraging internal activations

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Kunlun Zhu

@kunlun_zhu

2 months ago

🚨 New from UIUC x Stanford x AMD: AgentDebug: Where LLM Agents Fail and How They Can Learn From Failures 🔍🤖 LLM agents fail due to early errors that snowball—yet lack tools to trace & fix them. ✅ AgentDebug Debugger 📄 arxiv.org/abs/2509.25370 🛠️ github.com/ulab-uiuc/Agen…

thumb_up_off_alt7

chat_bubble_outline1

repeat4

shareShare

Jiaxuan You

@youjiaxuan

a month ago

Introducing Multi-Agent Evolve 🧠 A new paradigm beyond RLHF and RLVR: More compute → closer to AGI No need for expensive data or handcrafted rewards We show that an LLM can self-evolve — improving itself through co-evolution among roles (Proposer, Solver, Judge) via RL — all

thumb_up_off_alt463

chat_bubble_outline22

repeat101

shareShare