Lichang Chen (@lichangchen2) Twitter Tweets • TwiCopy

Lichang Chen

a year ago

Interesting claim🤣 which aligns with some claims I heard recently: 100% credit of the reasoning LLM should be assigned to pretraining!

thumb_up_off_alt125

chat_bubble_outline0

repeat13

shareShare

Wei Xiong

@weixiong_1

a year ago

Surprised by the small performance gap between RAFT and Reinforce/GRPO. We may need more fine-grained negative signals to better guide learning.🧐

thumb_up_off_alt93

chat_bubble_outline2

repeat8

shareShare

Such a crazy world.. As an LLM researcher, we are fighting for substituting ourselves! I kinda think AGI should be achieved in two stages: 1. since the world simulator is so hard to build up, we should first build up SWE/researcher simulator, then RL and scaling up. 2. With that,

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Kelvin Guu

@kelvin_guu

a year ago

Big congrats to my teammates at Google DeepMind on the launch of Deep Think! Shout-out to Archit Sharma and Thang Luong who battled hard for this :) blog.google/technology/goo…

thumb_up_off_alt77

chat_bubble_outline1

repeat6

shareShare

Boqing Gong

@boqinggo

10 months ago

Join us if you are at CVPR and can get up early. :-) I'm giving a talk, "BabyVLM: Democratizing Pretraining of Vision Large Language Models" tomorrow (Wednesday). * 9:30AM. * Room 101B. * Computer Vision in the Wild Workshop

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Tianhe Yu

@tianheyu

10 months ago

Our Gemini 2.5 Pro 06-05 🦁becomes GA today (the stable version of Gemini 2.5 Pro). Looking forward to what the community is building with it!

thumb_up_off_alt66

chat_bubble_outline0

repeat4

shareShare

JB Alayrac

@jalayrac

10 months ago

Zephyr Video input works pretty well in gemini 2.5 :).

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Andrew M. Dai @ ICLR

@iamandrewdai

10 months ago

It turns out LLM data is more like oil than coal, if you refine it properly. Congratulations to the contributors of the many researcher-years of work!

thumb_up_off_alt48

chat_bubble_outline1

repeat5

shareShare

Yu Xiang

@yuxiang_irvl

9 months ago

“As a PHD student, your job is not publishing a paper every quarter. Focus on a problem in deep understanding and solve it in years under the protect of your adviser” from Russ Tedrake #RSS2025

thumb_up_off_alt830

chat_bubble_outline17

repeat70

shareShare

Simeng (Sophia) Han

@hansineng

9 months ago

Excited to see more investigation into LLM creativity. We have some pioneering work on this topic as well: Creativity or Brute Force? Using Brainteasers as a Window into the Problem-Solving Abilities of Large Language Models. arxiv.org/pdf/2505.10844.

thumb_up_off_alt17

chat_bubble_outline0

repeat6

shareShare

Noam Brown

@polynoamial

9 months ago

You don’t need a PhD to be a great AI researcher. Even OpenAI’s Chief Research Officer doesn’t have a PhD.

thumb_up_off_alt3,3K

chat_bubble_outline202

repeat216

shareShare

Lichang Chen

@lichangchen2

9 months ago

Had an interesting discussion with a former Terp (UMD Department of Computer Science alumni) at NYC about Quant vs. Tech in the AGI era. We first discussed how programmers kill themselves via open-source everything, which become the fuel of the LLMs😂😂 Compared to Tech, Quant Trading is more isolated

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Lichang Chen

@lichangchen2

9 months ago

I am really excited about how we can achieve the second phase of AGI, i.e., AI can learn new tasks as quickly as generalist human can. I believe the new paradigm should be learning from the infinite context bc. the context is LLM’s memory and it can include more nuanced natural

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Lichang Chen

@lichangchen2

8 months ago

We need real-world metrics to evaluate them, especially how they can contribute to the society: contributions to GDPs; help us get promotions, etc. Also, how they can help push the scientific boundaries is an important metric, e.g., help solve Millennium Prize Problems could be

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Lichang Chen

@lichangchen2

7 months ago

It’s really a big courage to admit the deficiency! The opponent who realizes his weakness quickly is intimidating so that I do believe OAI can catch up soon.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Lichang Chen

@lichangchen2

6 months ago

I think here is the thing: I am assuming the pressure of comparison is from investors. The temptation of the promotion and money is the root of forgetting the mission but not the release of products. I don’t think a real AI researcher like IIya will care about these things.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Lichang Chen

@lichangchen2

5 months ago

I had some interesting discussions about using prompt optimization for agent memory recently with some interns AI at Meta and co-founder Databricks, which reminds me of my prompt opt for LLMs paper published in 2023 (arxiv.org/abs/2306.03082), which is one of the earliest papers in

thumb_up_off_alt56

chat_bubble_outline3

repeat3

shareShare

Lichang Chen

@lichangchen2

5 months ago

I bet automated researchers can get 8,8,8,8 on next ICLR with drastic scientific breakthrough and expert-level presentations! Maybe it’s time to have a special workshop/conference track for pure-AI submissions, i.e., code, analysis, paper are all generated by AI! ICLR 2026

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Lichang Chen

@lichangchen2

4 months ago

Heading to NeurIPS to present my creative reasoning work!! I am open to discuss ideas on how we can equip the test-time algorithms with creativities and advance the scientific breakthroughs! Feel free to DM if you’d like to have a coffee!

thumb_up_off_alt25

chat_bubble_outline2

repeat1

shareShare

Lichang Chen

@lichangchen2

2 months ago

It’s the first time that I think OpenAI is actually quite Open.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare