Mingxuan (Aldous) Li (@itea1001) Twitter Tweets • TwiCopy

Mingxuan (Aldous) Li

@itea1001

+ Follow

Student at the University of Chicago

ID: 1742435203543343104

linkhttps://itea1001.github.io/ calendar_today03-01-2024 06:38:31

4 Tweet

2 Takipçi

106 Takip Edilen

Haokun Liu

@haokunliu5280

a year ago

1/ 🚀 New Paper Alert! Excited to share: Literature Meets Data: A Synergistic Approach to Hypothesis Generation 📚📊! We propose a novel framework combining literature insights & observational data with LLMs for hypothesis generation. Here’s how and why it matters.

thumb_up_off_alt11

chat_bubble_outline1

repeat6

shareShare

Dang Nguyen

@divingwithorcas

8 months ago

1/n You may know that large language models (LLMs) can be biased in their decision-making, but ever wondered how those biases are encoded internally and whether we can surgically remove them?

thumb_up_off_alt12

chat_bubble_outline1

repeat8

shareShare

Haokun Liu

@haokunliu5280

7 months ago

🚀🚀🚀Excited to share our latest work: HypoBench, a systematic benchmark for evaluating LLM-based hypothesis generation methods!

thumb_up_off_alt13

chat_bubble_outline1

repeat9

shareShare

Mourad Heddaya

@mouradheddaya

7 months ago

🧑‍⚖️How well can LLMs summarize complex legal documents? And can we use LLMs to evaluate? Excited to be in Albuquerque presenting our paper this afternoon at NAACL HLT 2025 2025! We develop CaseSumm, a comprehensive dataset comprising 25K U.S. Supreme Court opinions and their

thumb_up_off_alt17

chat_bubble_outline1

repeat9

shareShare

Mingxuan (Aldous) Li

@itea1001

6 months ago

HypoEval evaluators (github.com/ChicagoHAI/Hyp…) are now incorporated into judges from Quotient AI — check it out at github.com/quotient-ai/ju…!

thumb_up_off_alt4

chat_bubble_outline0

repeat4

shareShare

Xiaoyan Bai

@elenal3ai

6 months ago

🚨 New paper alert 🚨 Ever asked an LLM-as-Marilyn Monroe who the US president was in 2000? 🤔 Should the LLM answer at all? We call these clashes Concept Incongruence. Read on! ⬇️ 1/n 🧵

thumb_up_off_alt18

chat_bubble_outline1

repeat9

shareShare

Shishir Patil

@shishirpatil_

5 months ago

🔥 At ICML 2025, we’re delighted to introduce BFCL V4 Agentic. As function-calling (also called tool-calling) forms the bed-rock of Agentic systems, BFCL V4 Agentic benchmark focuses on tool-calling in real-world agentic settings — including: 🔍 Web search with multi-hop

thumb_up_off_alt18

chat_bubble_outline1

repeat9

shareShare

Mingxuan (Aldous) Li

@itea1001

4 months ago

Excited to present our work at #ACL2025! Come by Poster Session 1 tomorrow, 11:00–12:30 in Hall X4/X5 — would love to chat!

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare