Sida (Star) Li (@starli27496427) Twitter Tweets • TwiCopy

Sida (Star) Li

@starli27496427

+ Follow

PhD @DSI_UChicago building @ProphetArena | LLM evaluations, prediction-powered inference & intersection between statistics x AI | Prev: @Berkeley_EECS.

ID: 1536650645117075456

linkhttp://listar2000.github.io calendar_today14-06-2022 10:04:09

12 Tweet

16 Takipçi

49 Takip Edilen

Rohan Paul

@rohanpaul_ai

a year ago

Reasoning models often use excessively long thought processes, causing inefficient inference. This paper introduces ShorterBetter, a reinforcement learning method guiding models to find their optimal reasoning length autonomously. It samples multiple outputs, identifies the

thumb_up_off_alt32

chat_bubble_outline0

repeat2

shareShare

Prophet Arena

@prophetarena

8 months ago

🔮 Introducing Prophet Arena — the AI benchmark for general predictive intelligence. That is, can AI truly predict the future by connecting today’s dots? 👉 What makes it special? - It can’t be hacked. Most benchmarks saturate over time, but here models face live, unseen

thumb_up_off_alt1,1K

chat_bubble_outline85

repeat148

shareShare

Sida (Star) Li

@starli27496427

8 months ago

Starting to miss the UChicago campus now...

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Chenghao Yang

@chrome1996

7 months ago

Where is exploration most impactful in LLM reasoning? The initial tokens! They shape a sequence's entire semantic direction, making early exploration crucial. Our new work, Exploratory Annealed Decoding (EAD), is built on this insight. By starting with high temperature and

thumb_up_off_alt93

chat_bubble_outline3

repeat19

shareShare

rLLM

@rllm_project

6 months ago

🚀 Introducing rLLM v0.2 - train arbitrary agentic programs with RL, with minimal code changes. Most RL training systems adopt the agent-environment abstraction. But what about complex workflows? Think solver-critique pairs collaborating, or planner agents orchestrating multiple

thumb_up_off_alt136

chat_bubble_outline2

repeat28

shareShare

Sida (Star) Li

@starli27496427

6 months ago

I’m not an AI infra person, but somehow just got my async LoRA fix merged into Verl 😅 Spent a few days untangling async RL logic (from knowing nothing) -- no perfect understanding, but found (and fixed!) a sneaky bug. Proof that you can just do things...

thumb_up_off_alt5

chat_bubble_outline1

repeat0

shareShare

Sida (Star) Li

@starli27496427

5 months ago

Unfortunately missing #NeurIPS2025 and the SD sunshine 😭 But our first author Justin will be presenting "ShorterBetter" -- chat with him about efficient LLM reasoning! (And yes, this brilliant friend is applying for PhD this cycle!) arxiv.org/pdf/2504.21370

Unfortunately missing #NeurIPS2025 and the SD sunshine 😭
But our first author <a href="/Justin_6657/">Justin</a> will be presenting "ShorterBetter" -- chat with him about efficient LLM reasoning!
(And yes, this brilliant friend is applying for PhD this cycle!)
arxiv.org/pdf/2504.21370

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Cooperative AI Foundation

@coop_ai

5 months ago

Don't miss our last seminar of the year: 'The Interplay of Economic Thinking and Language Models: Vignettes and Lessons', live 18th of December (5pm GMT, 9am PT, 12pm ET) led by Haifeng Xu (The University of Chicago). Link below.

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Sida (Star) Li

@starli27496427

5 months ago

Been working on rLLM for the past few months 😀! This new version (and more to come) is definitely one step closer to 𝙙𝙚𝙢𝙤𝙘𝙧𝙖𝙩𝙞𝙯𝙞𝙣𝙜 𝙖𝙜𝙚𝙣𝙩𝙞𝙘 𝙍𝙇 𝙩𝙧𝙖𝙞𝙣𝙞𝙣𝙜 -- any agent you can write down, rLLM will help you train it.

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Sida (Star) Li

@starli27496427

5 months ago

Huge congrats on making Tinker fully public! 🚀 With rLLM (rLLM), integrating your (multi-)agent workflows with Tinker’s infrastructure is now super easy~ Docs + example here 👇 rllm-project.readthedocs.io/en/latest/exam…

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Sida (Star) Li

@starli27496427

4 months ago

During the past 4 months since the debut of Prophet Arena, our amazing team has: 1. Added 1000+ forecasting events to the platform and supported more SOTA models. 2. Curated the "agent benchmark" where the competing agent performs end-to-end forecasts. More to come soon!

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Sida (Star) Li

@starli27496427

4 months ago

How to enjoy the best of two worlds: alignment from the aligned model and the diversity in the base model? Check out this simple but elegant "base-align"-collaboration work by Yichen (Zach) Wang and Chenghao Yang et al. 👇

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Prophet Arena

@prophetarena

4 months ago

Happy New Year! Here are some AI Forecasts for 2026🔮 Most likely World Cup winner: 🇪🇸 Spain Spotify #1 artist: Taylor Swift (Qwen 3 235B says 100%) 75% - GTA 6 releases before end of 2026 (Grok-4) 65% - One Battle After Another wins Best Picture (Claude Sonnet 4) 55% - U.S.

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Sida (Star) Li

@starli27496427

2 months ago

🚀 Huge congrats to Manan Roongta, Sijun Tan, and the Snorkel AI team on building this impressive Financial Analysis agent! Another strong example of how rLLM powers RL training across diverse reasoning tasks - from finance to beyond. Stay tuned for new rLLM features!

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Yinjie Wang

@yinjiew2024

2 months ago

Train your 🦞OpenClaw🦞 simply by talking to it. Meet OpenClaw-RL. Host your model on our RL server, and your LLM gets optimized automatically. Use it anywhere. Keep it private. Make it more personal every day. We have fully open sourced everything. Come in and have fun!

thumb_up_off_alt734

chat_bubble_outline41

repeat90

shareShare