Mian Zhang (@_guuuuuuuu_) 's Twitter Profile
Mian Zhang

@_guuuuuuuu_

CS PhD in UTD

ID: 1127440092102946816

calendar_today12-05-2019 05:07:19

26 Tweet

117 Followers

374 Following

Mian Zhang (@_guuuuuuuu_) 's Twitter Profile Photo

We find suboptimal agentic searches are often caused by LLMs’ limited awareness of their own knowledge boundaries and propose an uncertainty-aware variant of GRPO to help mitigate suboptimal searches. Check out the paper for more analysis and results!