Yongliang Shen (@itricktreat) 's Twitter Profile
Yongliang Shen

@itricktreat

Reasoning & Multimodal learning & Agent | Assistant Professor at @ZJU_China | Previously @MSFTResearch

ID: 982838352385720320

linkhttps://github.com/tricktreat calendar_today08-04-2018 04:31:38

56 Tweet

88 Followers

637 Following

Henry Peng Zou (@zou_henry43378) 's Twitter Profile Photo

ACL ARR and EMNLP 2025 does not notify co-authors when one of them fails to complete their reviewing duty, unlike NeurIPS or ACM MM, where all co-authors are informed and can follow up.

Henry Peng Zou (@zou_henry43378) 's Twitter Profile Photo

Without such notice, first-author students, who bear the heaviest consequences, are punished for circumstances completely outside their control.

Henry Peng Zou (@zou_henry43378) 's Twitter Profile Photo

Neither the conference nor ARR informs the first author which co-author has been assigned reviews or who has failed to complete them. This is especially unfair to first authors who have diligently fulfilled their own reviewing obligations.

James Zou (@james_y_zou) 's Twitter Profile Photo

We recently organized #Agents4Science, the 1st conference where LLMs are both authors and reviewers🤖 It was an open experiment to assess how well AI can lead research and review papers. Today we report what we learned in Nature Biotechnology Highlights in 🧵

We recently organized #Agents4Science, the 1st conference where LLMs are both authors and reviewers🤖 

It was an open experiment to assess how well AI can lead research and review papers.

Today we report what we learned in <a href="/NatureBiotech/">Nature Biotechnology</a> Highlights in 🧵
OpenBMB (@openbmb) 's Twitter Profile Photo

Can robots truly "think before acting"? 🤖 The conflict between sequential reasoning and high-speed motor control has finally been resolved. Today, we highlight a new multimodal breakthrough from OpenBMB and THUNLP: DeepThinkVLA —— a unified architecture that aligns reasoning

Can robots truly "think before acting"? 🤖 The conflict between sequential reasoning and high-speed motor control has finally been resolved.
Today, we highlight a new multimodal breakthrough from OpenBMB and THUNLP: DeepThinkVLA —— a unified architecture that aligns reasoning
Shizhe Diao (@shizhediao) 's Twitter Profile Photo

✨Introducing ProfBench. LLM eval shouldn’t be limited to math/code/short QA. Real work is: read professional docs → synthesize → produce long-form reports. ProfBench is a rubric-based benchmark written by domain experts (PhD/MBA) across 4 professional domains: Physics /

✨Introducing ProfBench.

LLM eval shouldn’t be limited to math/code/short QA. Real work is: read professional docs → synthesize → produce long-form reports. 
ProfBench is a rubric-based benchmark written by domain experts (PhD/MBA) across 4 professional domains: Physics /
Sergio Paniego (@sergiopaniego) 's Twitter Profile Photo

This super detailed tutorial by Pau Labarta Bajo is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv" LFM2-350M (Liquid AI) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝 paulabartabajo.substack.com/p/fine-tuning-…

This super detailed tutorial by <a href="/paulabartabajo_/">Pau Labarta Bajo</a> is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv"

LFM2-350M (<a href="/liquidai/">Liquid AI</a>) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝

paulabartabajo.substack.com/p/fine-tuning-…
Shizhe Diao (@shizhediao) 's Twitter Profile Photo

🥇 Nemotron-Orchestrator-8B takes #1 on GAIA benchmark! An 8B orchestrator coordinating intelligent tools and agents, trained via end-to-end RL, demonstraing small language models are shaping the future of agentic AI 🚀 huggingface.co/spaces/gaia-be…

🥇 Nemotron-Orchestrator-8B takes #1 on GAIA benchmark!

An 8B orchestrator coordinating intelligent tools and agents, trained via end-to-end RL, demonstraing small language models are shaping the future of agentic AI 🚀 
huggingface.co/spaces/gaia-be…