Yongliang Shen (@itricktreat) Twitter Tweets • TwiCopy

Yongliang Shen

@itricktreat

+ Follow

Reasoning & Multimodal learning & Agent | Assistant Professor at @ZJU_China | Previously @MSFTResearch

ID: 982838352385720320

linkhttps://github.com/tricktreat calendar_today08-04-2018 04:31:38

56 Tweet

88 Followers

637 Following

Henry Peng Zou

@zou_henry43378

8 months ago

ACL ARR and EMNLP 2025 does not notify co-authors when one of them fails to complete their reviewing duty, unlike NeurIPS or ACM MM, where all co-authors are informed and can follow up.

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

Henry Peng Zou

@zou_henry43378

8 months ago

Without such notice, first-author students, who bear the heaviest consequences, are punished for circumstances completely outside their control.

thumb_up_off_alt15

chat_bubble_outline2

repeat1

shareShare

Neither the conference nor ARR informs the first author which co-author has been assigned reviews or who has failed to complete them. This is especially unfair to first authors who have diligently fulfilled their own reviewing obligations.

thumb_up_off_alt14

chat_bubble_outline1

repeat1

shareShare

OpenAI

@openai

7 months ago

Sound on.

thumb_up_off_alt25,25K

chat_bubble_outline1,1K

repeat2,2K

shareShare

James Zou

@james_y_zou

4 months ago

We recently organized #Agents4Science, the 1st conference where LLMs are both authors and reviewers🤖 It was an open experiment to assess how well AI can lead research and review papers. Today we report what we learned in Nature Biotechnology Highlights in 🧵

thumb_up_off_alt341

chat_bubble_outline6

repeat77

shareShare

OpenBMB

@openbmb

4 months ago

Can robots truly "think before acting"? 🤖 The conflict between sequential reasoning and high-speed motor control has finally been resolved. Today, we highlight a new multimodal breakthrough from OpenBMB and THUNLP: DeepThinkVLA —— a unified architecture that aligns reasoning

thumb_up_off_alt86

chat_bubble_outline4

repeat18

shareShare

Shizhe Diao

@shizhediao

4 months ago

✨Introducing ProfBench. LLM eval shouldn’t be limited to math/code/short QA. Real work is: read professional docs → synthesize → produce long-form reports. ProfBench is a rubric-based benchmark written by domain experts (PhD/MBA) across 4 professional domains: Physics /

thumb_up_off_alt134

chat_bubble_outline8

repeat15

shareShare

Sergio Paniego

@sergiopaniego

4 months ago

This super detailed tutorial by Pau Labarta Bajo is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv" LFM2-350M (Liquid AI) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝 paulabartabajo.substack.com/p/fine-tuning-…

This super detailed tutorial by <a href="/paulabartabajo_/">Pau Labarta Bajo</a> is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv"

LFM2-350M (<a href="/liquidai/">Liquid AI</a>) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝

paulabartabajo.substack.com/p/fine-tuning-…

thumb_up_off_alt215

chat_bubble_outline4

repeat31

shareShare

Shizhe Diao

@shizhediao

4 months ago

🥇 Nemotron-Orchestrator-8B takes #1 on GAIA benchmark! An 8B orchestrator coordinating intelligent tools and agents, trained via end-to-end RL, demonstraing small language models are shaping the future of agentic AI 🚀 huggingface.co/spaces/gaia-be…

thumb_up_off_alt179

chat_bubble_outline5

repeat27

shareShare

Elvis

@glitchphoton

2 months ago

x.com/i/article/2025…

thumb_up_off_alt2,2K

chat_bubble_outline90

repeat224

shareShare