Pengfei Liu (@stefan_fee) Twitter Tweets • TwiCopy

Pengfei Liu

@stefan_fee

+ Follow

Associate Prof. at SJTU, leading GAIR Lab (plms.ai) Co-founder of Inspired Cognition, Postdoc at @LTIatCMU, Previously FNLP, @MILAMontreal,

ID: 2818867628

linkhttp://pfliu.com/ calendar_today19-09-2014 02:34:24

450 Tweet

3,3K Takipçi

751 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Sinclair Wang

@sinclairwang1

7 months ago

We are sharing this progress report at booth 260 poster in Hall3 of the IClR venue now.

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Pengfei Liu

@stefan_fee

7 months ago

This is for you AI at Meta

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

📣 New Discovery on Computer Use Agent With just 312 high-quality trajectories + open-source model, we've surpassed Claude 3.7 Sonnet (thinking) in computer use capabilities 🚀 ⚡️ In the new era of AI Agent training, many key questions remain: • Can open-source models + small

thumb_up_off_alt24

chat_bubble_outline0

repeat6

shareShare

Pengfei Liu

@stefan_fee

6 months ago

312 quality trajectories + open-source model beats Claude 3.7 Sonnet (thinking) in computer use 🚀 We answer the following important questions in our recent tech report: github.com/GAIR-NLP/PC-Ag… 1. Can open-source models + small high-quality datasets outperform top closed-source

thumb_up_off_alt36

chat_bubble_outline0

repeat6

shareShare

Pengfei Liu

@stefan_fee

6 months ago

The real breakthrough isn't better AI—it's breaking free from nature's constraints We're witnessing a paradigm shift from "passive adaptation" to "active construction" in AI training. 🌊 The old way: AI learns from whatever data naturally exists • Constrained by existing

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Pengfei Liu

@stefan_fee

6 months ago

nice discussion

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Pengfei Liu

@stefan_fee

5 months ago

What foundation models do we REALLY need for the RL era? And what pre-training data? Excited to share our work: OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling arxiv.org/pdf/2506.20512 ✨ Key breakthroughs: - First RL-focused mid-training approach - Llama

thumb_up_off_alt76

chat_bubble_outline0

repeat10

shareShare

Pengfei Liu

@stefan_fee

5 months ago

Tech history: Every time humanity hits a tech wall, we just wait for someone named Ilya to show up and save the world :) - Neural nets stuck? - Language models plateau? - ... (skip tons of stuff) - ... - Superintelligence coming?

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Ethan Chern

@ethanchern

5 months ago

FacTool has been accepted to COLM 2025 - two years after its arXiv debut! While the landscape of LLMs has changed a lot since then, tool-augmented LLMs and RAG are still among the most effective and practical approaches for detecting / mitigating hallucinations (ref:

thumb_up_off_alt12

chat_bubble_outline2

repeat5

shareShare

Yiqing Xie

@yiqingxienlp

5 months ago

RepoST was accepted to Conference on Language Modeling !!! See you in Montreal 🚀 #COLM2025

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Yujia Qin@ICLR2025

@tsingyoga

3 months ago

We can finally share UI-TARS-2🥳🥳 — a native GUI agent trained with multi-turn agent RL ⚡️⚡️Key highlights (all-in-one model!): 💻Computer Use: 47.5 OSWorld · 50.6 WindowsAgentArena 📱Phone Use: 73.3 AndroidWorld 🛜Browser Use: 88.2% Online-Mind2Web 🎮Gameplay: ~60% human

thumb_up_off_alt287

chat_bubble_outline10

repeat49

shareShare