Danqi Chen (@danqi_chen) Twitter Tweets • TwiCopy

Danqi Chen

a year ago

I’ve just arrived in Vancouver and am excited to join the final stretch of #NeurIPS2024! This morning, we are presenting 3 papers 11am-2pm: - Edge pruning for finding Transformer circuits (#3111, spotlight) Adithya Bhaskar - SimPO (#3410) Yu Meng @ ICLR'25 Mengzhou Xia - CharXiv (#5303)

thumb_up_off_alt164

chat_bubble_outline0

repeat12

shareShare

Jiao Sun

@sunjiao123sun_

a year ago

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference NeurIPS Conference We have ethical reviews for authors, but missed it for invited speakers? 😡

Mitigating racial bias from LLMs is a lot easier than removing it from humans!

Can’t believe this happened at the best AI conference <a href="/NeurIPSConf/">NeurIPS Conference</a>

We have ethical reviews for authors, but missed it for invited speakers? 😡

thumb_up_off_alt3,3K

chat_bubble_outline184

repeat837

shareShare

Conference on Language Modeling

@colm_conf

a year ago

Announcement #1: our call for papers is up! 🎉 colmweb.org/cfp.html And excited to announce the COLM 2025 program chairs Yoav Artzi Eunsol Choi Ranjay Krishna and Aditi Raghunathan

Announcement #1: our call for papers is up! 🎉
colmweb.org/cfp.html
And excited to announce the COLM 2025 program chairs <a href="/yoavartzi/">Yoav Artzi</a> <a href="/eunsolc/">Eunsol Choi</a> <a href="/RanjayKrishna/">Ranjay Krishna</a> and <a href="/AdtRaghunathan/">Aditi Raghunathan</a>

thumb_up_off_alt165

chat_bubble_outline1

repeat42

shareShare

Tianyu Gao

@gaotianyu1350

10 months ago

Introducing MeCo (metadata conditioning then cooldown), a remarkably simple method that accelerates LM pre-training by simply prepending source URLs to training documents. arxiv.org/abs/2501.01956

thumb_up_off_alt197

chat_bubble_outline4

repeat43

shareShare

Xi Ye

@xiye_nlp

10 months ago

🤔Now most LLMs have >= 128K context sizes, but are they good at generating long outputs, such as writing 8K token chain-of-thought for a planning problem？ 🔔Introducing LongProc (Long Procedural Generation), a new benchmark with 6 diverse tasks that challenge LLMs to synthesize

thumb_up_off_alt209

chat_bubble_outline3

repeat41

shareShare

Manos Koukoumidis

@koukoumidis

10 months ago

If AI isn’t truly open, it will fail us. We can’t close in a black box our greatest invention yet just so that a few can freely monetize. AI needs its Linux moment, and so we started working towards it. This can only succeed if we all work together! #oumi #opensource

thumb_up_off_alt83

chat_bubble_outline7

repeat32

shareShare

Yong Lin

@yong18850571

10 months ago

🚀 Introducing Goedel-Prover: A 7B LLM achieving SOTA open-source performance in automated theorem proving! 🔥 ✅ Improving +7% over previous open source SOTA on miniF2F 🏆 Ranking 1st on the PutnamBench Leaderboard 🤖 Solving 1.9X total problems compared to prior works on Lean

thumb_up_off_alt271

chat_bubble_outline13

repeat63

shareShare

Yong Lin

@yong18850571

9 months ago

🚀 Exciting news! Our Goedel-Prover paper is now live on arXiv: arxiv.org/pdf/2502.07640 🎉 We're currently developing the RL version and have a stronger checkpoint than before (currently not included in the report)!🚀🚀🚀 Plus, we’ll be open-sourcing 1.64M formalized

thumb_up_off_alt135

chat_bubble_outline7

repeat39

shareShare

Stanford NLP Group

@stanfordnlp

9 months ago

Congratulations to Stanford NLP Group founder Christopher Manning for being elected to The National Academy of Engineering (NAE, National Academies) Class of 2025 for the development and dissemination of natural language processing methods.

Congratulations to <a href="/stanfordnlp/">Stanford NLP Group</a> founder <a href="/chrmanning/">Christopher Manning</a> for being elected to The National Academy of Engineering (NAE, <a href="/theNASEM/">National Academies</a>) Class of 2025 for the development and dissemination of natural language processing methods.

thumb_up_off_alt309

chat_bubble_outline9

repeat34

shareShare

Alex Wettig

@_awettig

9 months ago

🤔 Ever wondered how prevalent some type of web content is during LM pre-training? In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐 Key takeaway: domains help us curate better pre-training data! 🧵/N

thumb_up_off_alt195

chat_bubble_outline5

repeat48

shareShare

Danqi Chen

@danqi_chen

9 months ago

V. happy with this work! We’ve explored domain mixtures and quality filtering (including Alex’s previous work!), but what is even a “domain” in Common Crawl? Can we use these domains to better understand quality filters, and combine them for data curation? Cool visuals too!

thumb_up_off_alt117

chat_bubble_outline2

repeat16

shareShare

Conference on Language Modeling

@colm_conf

8 months ago

Excited to announce our 2025 keynote speakers: Shirley Ho, Nicholas Carlini, Luke Zettlemoyer, and Tom Griffiths!

Excited to announce our 2025 keynote speakers: <a href="/cosmo_shirley/">Shirley Ho</a>, Nicholas Carlini, <a href="/LukeZettlemoyer/">Luke Zettlemoyer</a>, and Tom Griffiths!

thumb_up_off_alt122

chat_bubble_outline0

repeat14

shareShare

Noam Razin

@noamrazin

8 months ago

The success of RLHF depends heavily on the quality of the reward model (RM), but how should we measure this quality? 📰 We study what makes a good RM from an optimization perspective. Among other results, we formalize why more accurate RMs are not necessarily better teachers! 🧵

thumb_up_off_alt749

chat_bubble_outline7

repeat119

shareShare

Princeton NLP Group

@princeton_nlp

8 months ago

Nothing like a sunny hike to welcome spring!

thumb_up_off_alt76

chat_bubble_outline0

repeat6

shareShare

Princeton Laboratory for Artificial Intelligence

@princetonainews

8 months ago

Welcome to the official X for the Princeton Laboratory for Artificial Intelligence (“AI Lab” for short). Our mission is to support and expand the scope of AI research Princeton University Follow our page for the latest updates on events, news, research, and more at the AI Lab

thumb_up_off_alt30

chat_bubble_outline1

repeat9

shareShare

Howard Yen

@howardyen1

7 months ago

Llama 4 Scout claims to support a context window of 10M tokens; the needle-in-a-haystack results are perfect, but can it handle real long-context tasks? We evaluate them on HELMET, our diverse and application-centric long-context benchmark, to be presented at #ICLR2025!

thumb_up_off_alt26

chat_bubble_outline2

repeat2

shareShare

Princeton PLI

@princetonpli

7 months ago

We are proud to highlight the work of the PLI students, post-docs, and faculty which is being showcased at this year's ICLR 2025: pli.princeton.edu/blog/2025/prin…

We are proud to highlight the work of the PLI students, post-docs, and faculty which is being showcased at this year's <a href="/iclr_conf/">ICLR 2025</a>: pli.princeton.edu/blog/2025/prin…

thumb_up_off_alt29

chat_bubble_outline0

repeat5

shareShare

Princeton PLI

@princetonpli

7 months ago

We're so proud that Princeton researchers have received 1 outstanding paper award and 1 honorable mention at ICLR 2025 2025 Congratulations to Peter Henderson, Xiangyu Qi, Prateek Mittal, Ashwinee Panda @ ICLR 2025, Tianhao Wang ("Jiachen") @ICLR, and Kaifeng Lyu blog.iclr.cc/2025/04/22/ann…

We're so proud that Princeton researchers have received 1 outstanding paper award and 1 honorable mention at <a href="/iclr_conf/">ICLR 2025</a> 2025

Congratulations to <a href="/PeterHndrsn/">Peter Henderson</a>, <a href="/xiangyuqi_pton/">Xiangyu Qi</a>, <a href="/prateekmittal_/">Prateek Mittal</a>, <a href="/PandaAshwinee/">Ashwinee Panda @ ICLR 2025</a>, <a href="/JiachenWang97/">Tianhao Wang ("Jiachen") @ICLR</a>, and <a href="/vfleaking/">Kaifeng Lyu</a>

blog.iclr.cc/2025/04/22/ann…

thumb_up_off_alt53

chat_bubble_outline0

repeat37

shareShare

Stanford NLP Group

@stanfordnlp

7 months ago

Our warmest congratulations to ⁦Danqi Chen⁩, ⁦Stanford NLP Group⁩ grad and now Associate Professor at ⁦Princeton Computer Science⁩ and Associate Director of ⁦Princeton PLI⁩ on her stunning ⁦⁦ICLR 2025⁩ keynote!

Our warmest congratulations to ⁦<a href="/danqi_chen/">Danqi Chen</a>⁩, ⁦<a href="/stanfordnlp/">Stanford NLP Group</a>⁩ grad and now Associate Professor at ⁦<a href="/PrincetonCS/">Princeton Computer Science</a>⁩ and Associate Director of ⁦<a href="/PrincetonPLI/">Princeton PLI</a>⁩ on her stunning ⁦⁦<a href="/iclr_conf/">ICLR 2025</a>⁩ keynote!

thumb_up_off_alt269

chat_bubble_outline6

repeat19

shareShare

Danqi Chen

@danqi_chen

7 months ago

Last day at #ICLR2025. I will be speaking at the DATA-FM workshop this morning!

thumb_up_off_alt59

chat_bubble_outline0

repeat1

shareShare