Hongru Wang (@wangcarrey) Twitter Tweets • TwiCopy

Ilya Sutskever

@ilyasut

2 years ago

if you value intelligence above all other human qualities, you’re gonna have a bad time

thumb_up_off_alt11,11K

chat_bubble_outline710

repeat1,1K

shareShare

🚀 How far can RL scaling take LLMs? Drop ProRLv2! 🔥With ProRLv2, we keep expanding LLM’s reasoning boundaries through 3,000+ RL steps over 5 domains and set a new state-of-the-art 🌟 among 1.5B reasoning models. 🔗 Full blog: research.nvidia.com/labs/lpr/prorl… 🤗Open model:

thumb_up_off_alt209

chat_bubble_outline4

repeat36

shareShare

Hongru Wang

@wangcarrey

4 months ago

Actually, we implemented this kind of capability in our AdaCtrl paper three months ago by injecting difficult-aware tags (i.e., easy, hard, adaptive) to trigger different reasoning behaviors of LLMs. Paper: arxiv.org/pdf/2505.18822

thumb_up_off_alt9

chat_bubble_outline0

repeat5

shareShare

NIK

@ns123abc

4 months ago

Google DeepMind just dropped a Nature paper: “A personal health large language model for sleep and fitness coaching.” > Gemini is better that doctors and trainers at sleep and fitness > paper shows huge benefit of AI personalization for health and long-form coaching Honestly

thumb_up_off_alt3,3K

chat_bubble_outline82

repeat405

shareShare

Heng Ji

@hengjinlp

3 months ago

Thanks so much again to the IJCAI25 organizers for the opportunity to share our work on AI+Science! I’m grateful to work with our amazing collaborators and students at Molecule Maker Lab Institute

thumb_up_off_alt28

chat_bubble_outline1

repeat5

shareShare

Cheng Qian

@qiancheng1231

3 months ago

📣 Our paper is accepted to Findings of EMNLP 2025! Many thanks to all the co-authors! 🌍 Math modeling is the perfect lens for agents to approach the real world challenges. Come and check how we do it: arxiv.org/pdf/2505.15068

thumb_up_off_alt15

chat_bubble_outline1

repeat4

shareShare

Hongru Wang

@wangcarrey

3 months ago

Congratulations to everyone! This was the *only paper* I worked on overnight during my PhD, but fortunately, I had a group of friends by my side. It is truly a remember memory.

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Heng Ji

@hengjinlp

3 months ago

Thanks again to the organizers! What inspired me most was Yoshua Bengio’s incredible kindness and humility. He welcomed different opinions with genuine openness. I could also see the same spirit of kindness and deep care for humanity reflected in his mentees, like Kyunghyun Cho

thumb_up_off_alt60

chat_bubble_outline0

repeat10

shareShare

Jyo Pari

@jyo_pari

3 months ago

For agents to improve over time, they can’t afford to forget what they’ve already mastered. We found that supervised fine-tuning forgets more than RL when training on a new task! Want to find out why? 👇

thumb_up_off_alt487

chat_bubble_outline5

repeat78

shareShare

Heng Ji

@hengjinlp

3 months ago

I'm hiring 1-2 new postdocs to work on AI for Scientific Discovery (especially on drug discovery and material discovery) and Science-Inspired AI (especially on scientific foundation models). Please drop me an email if you are interested or know someone who might be a great fit!

thumb_up_off_alt351

chat_bubble_outline0

repeat66

shareShare

Hongru Wang

@wangcarrey

3 months ago

When the agent can win Nobel Prize in Physics? 🧐

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Emre Can Acikgoz

@emrecanacikgoz

2 months ago

Excited to shared that ToolRL is accepted to NeurIPS Conference🎉 I was watching Denny Zhou’s Simon Institute talk live, where in one slide, he defined RL Fine-Tuning as “directly optimize what you want”. This very simple reframe completely shifted my perspective on training and the

Excited to shared that ToolRL is accepted to <a href="/NeurIPSConf/">NeurIPS Conference</a>🎉

I was watching <a href="/denny_zhou/">Denny Zhou</a>’s Simon Institute talk live, where in one slide, he defined RL Fine-Tuning as “directly optimize what you want”.

This very simple reframe completely shifted my perspective on training and the

thumb_up_off_alt221

chat_bubble_outline3

repeat22

shareShare

ACLRollingReview

@reviewacl

2 months ago

📢 Early submission for ACL 2026 via ARR Oct cycle is available to support authors facing potential visa delays. 📝 Early invitation letters possible for submit-ready work (not acceptance guarantees). ⚠️ Preliminary work risks rejection & Jan-cycle ineligibility. #NLProc #ARR

thumb_up_off_alt52

chat_bubble_outline4

repeat13

shareShare

Yu Su @#ICLR2025

@ysu_nlp

2 months ago

> working on semantic parsing in PhD > didn't even have its own track at ACL > it's a dead area, people say > had ~100 citations when graduating > but natural language programming is always the dream > 'Let machines understand human thinking. Don’t let humans think like machines'

thumb_up_off_alt609

chat_bubble_outline17

repeat24

shareShare

J.K. Rowling

@jk_rowling

2 months ago

I'm seeing quite a bit of comment about this, so I want to make a couple of points. I'm not owed eternal agreement from any actor who once played a character I created. The idea is as ludicrous as me checking with the boss I had when I was twenty-one for what opinions I should

thumb_up_off_alt125,125K

chat_bubble_outline6,6K

repeat19,19K

shareShare

Wenhu Chen

@wenhuchen

2 months ago

Such a good place to spend the weekend with the family.

thumb_up_off_alt29

chat_bubble_outline2

repeat1

shareShare

Hongru Wang

@wangcarrey

2 months ago

Safety is the most important part if we want to use agent in daily life.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Zhenhailong Wang

@zhenhailongw

2 months ago

Multimodal conversational agents struggle to follow complex policies, which also impose a fixed computational cost. We ask: 👉 How can we achieve stronger policy-following behavior without having to include policies in-context? 🌐: mikewangwzhl.github.io/TriMPI/ 🧵1/3

thumb_up_off_alt37

chat_bubble_outline1

repeat12

shareShare

Heng Ji

@hengjinlp

2 months ago

Super proud of this work Zhenhailong Wang did at Amazon

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Mengdi Wang

@mengdiwang10

2 months ago

🚀 Introducing LabOS: The AI-XR Co-Scientist A system that sees, understands, and works with humans in real-world labs. 👁️ Egocentric vision & extended reality 🧠 LLM reasoning & hypothesis generation 🤖 Real-time guidance & multi-modal human-AI collaboration From observation →

thumb_up_off_alt136

chat_bubble_outline9

repeat24

shareShare

Hongru Wang

Ilya Sutskever

Shizhe Diao

Hongru Wang

NIK

Heng Ji

Cheng Qian

Hongru Wang

Heng Ji

Jyo Pari

Heng Ji

Hongru Wang

Emre Can Acikgoz

ACLRollingReview

Yu Su @#ICLR2025

J.K. Rowling

Wenhu Chen

Hongru Wang

Zhenhailong Wang

Heng Ji

Mengdi Wang