Sida (Star) Li
@starli27496427
PhD @DSI_UChicago building @ProphetArena | LLM evaluations, prediction-powered inference & intersection between statistics x AI | Prev: @Berkeley_EECS.
ID: 1536650645117075456
http://listar2000.github.io 14-06-2022 10:04:09
12 Tweet
16 Takipรงi
49 Takip Edilen
Don't miss our last seminar of the year: 'The Interplay of Economic Thinking and Language Models: Vignettes and Lessons', live 18th of December (5pm GMT, 9am PT, 12pm ET) led by Haifeng Xu (The University of Chicago). Link below.
Been working on rLLM for the past few months ๐! This new version (and more to come) is definitely one step closer to ๐๐๐ข๐ค๐๐ง๐๐ฉ๐๐ฏ๐๐ฃ๐ ๐๐๐๐ฃ๐ฉ๐๐ ๐๐ ๐ฉ๐ง๐๐๐ฃ๐๐ฃ๐ -- any agent you can write down, rLLM will help you train it.
During the past 4 months since the debut of Prophet Arena, our amazing team has: 1. Added 1000+ forecasting events to the platform and supported more SOTA models. 2. Curated the "agent benchmark" where the competing agent performs end-to-end forecasts. More to come soon!
How to enjoy the best of two worlds: alignment from the aligned model and the diversity in the base model? Check out this simple but elegant "base-align"-collaboration work by Yichen (Zach) Wang and Chenghao Yang et al. ๐
๐ Huge congrats to Manan Roongta, Sijun Tan, and the Snorkel AI team on building this impressive Financial Analysis agent! Another strong example of how rLLM powers RL training across diverse reasoning tasks - from finance to beyond. Stay tuned for new rLLM features!
Train your ๐ฆOpenClaw๐ฆ simply by talking to it. Meet OpenClaw-RL. Host your model on our RL server, and your LLM gets optimized automatically. Use it anywhere. Keep it private. Make it more personal every day. We have fully open sourced everything. Come in and have fun!