Yunzhen Feng (@feeelix_feng) Twitter Tweets • TwiCopy

Yunzhen Feng

@feeelix_feng

+ Follow

PhD at CDS, NYU. Ex-Intern at FAIR @AIatMeta. Previously undergrad at @PKU1898

ID: 1523345547565879298

calendar_today08-05-2022 16:54:28

87 Tweet

326 Takipçi

588 Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🎉 Excited to share that "NullModel" has been accepted to #ICLR2025 as an oral presentation! I am on the Job Market! I am seeking a Research Scientist position aligned with, but not limited to Data-Centric AI and AI Safety. Feel free to reach out if interested. RT appreciated!

thumb_up_off_alt26

chat_bubble_outline3

repeat8

shareShare

Vivek Myers

@vivek_myers

4 months ago

Current robot learning methods are good at imitating tasks seen during training, but struggle to compose behaviors in new ways. When training imitation policies, we found something surprising—using temporally-aligned task representations enabled compositional generalization. 1/

thumb_up_off_alt155

chat_bubble_outline6

repeat27

shareShare

Zhuang Liu

@liuzhuang1234

4 months ago

How different are the outputs of various LLMs, and in what ways do they differ? Turns out, very very different, up to the point that a text encoding classifier could tell the source LLM with 97% accuracy. This is classifying text generated by LLMs, between ChatGPT, Claude,

thumb_up_off_alt532

chat_bubble_outline11

repeat83

shareShare

Aviral Kumar

@aviral_kumar2

3 months ago

A lot of work focuses on test-time scaling. But we aren't scaling it optimally, simply training a long CoT doesn't mean we use it well. My students developed "v0" of a paradigm to do this optimally by running RL with dense rewards = minimizing regret over long CoT episodes. 🧵⬇️

thumb_up_off_alt200

chat_bubble_outline3

repeat33

shareShare

Elvis Dohmatob

@dohmatobelvis

3 months ago

Papers accepted at ICLR 2026 2025: - An Effective Theory of Bias Amplification arxiv.org/abs/2410.17263 - Pitfalls of Memorization arxiv.org/abs/2412.07684 - Strong Model Collapse arxiv.org/abs/2410.04840 - Beyond Model Collapse arxiv.org/abs/2406.07515 With Julia Kempe,

thumb_up_off_alt173

chat_bubble_outline2

repeat26

shareShare

Elvis Dohmatob

@dohmatobelvis

2 months ago

We refused to cite the paper due to severe misconduct of the authors of that paper: plagiarism of our own prior work, predominantly AI-generated content (ya, the authors plugged our paper into an LLM and generated another paper), IRB violations, etc. Revealed during a long

thumb_up_off_alt379

chat_bubble_outline18

repeat27

shareShare

Dan Roy

@roydanroy

2 months ago

I need to offer some clarification for this post because it would be wrong for people to lump this situation in with ones where work that is considered low quality (or inconvenient for priority arguments) is not cited. (This is my gut reaction whenever citations are excluded.)

thumb_up_off_alt24

chat_bubble_outline1

repeat1

shareShare

Yunzhen Feng

@feeelix_feng

2 months ago

Check out our poster tmr at 10am at the ICLR Bidirectional Human-AI Alignment workshop! We cover how on-policy preference sampling can be biased and our optimal response sampling for human labeling. NYU Center for Data Science AI at Meta Julia Kempe Yaqi Duan x.com/feeelix_feng/s…

thumb_up_off_alt20

chat_bubble_outline1

repeat7

shareShare

Kunhao Zheng @ ICLR 2025

@kunhaoz

2 months ago

🚨 Your RL only improves 𝗽𝗮𝘀𝘀@𝟭, not 𝗽𝗮𝘀𝘀@𝗸? 🚨 That’s not a bug — it’s a 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗯𝗷𝗲𝗰𝘁𝗶𝘃𝗲 you’re optimizing. You get what you optimize for. If you want better pass@k, you need to optimize for pass@k at training time. 🧵 How?

thumb_up_off_alt823

chat_bubble_outline12

repeat141

shareShare

Yunzhen Feng

Gate.io

Xiaosen Zheng @ NeurIPS 2024

Vivek Myers

Zhuang Liu

Aviral Kumar

Elvis Dohmatob

Elvis Dohmatob

Dan Roy

Yunzhen Feng

Kunhao Zheng @ ICLR 2025