Zeming Chen (@eric_zemingchen) Twitter Tweets • TwiCopy

Zeming Chen

@eric_zemingchen

+ Follow

PhD Candidate, NLP Lab @EPFL; Previously: Research Intern @ Meta AI (FAIR) @allen_ai #AI #ML #NLP

ID: 1411700324310687746

linkhttps://eric11eca.github.io calendar_today04-07-2021 14:56:09

49 Tweet

520 Takipçi

279 Takip Edilen

Antoine Bosselut

@abosselut

a year ago

Hey #NLProc folks, we had a lot of fun last year, so we're inviting guest lecturers again for our Topics in NLP course during this Fall 2024 semester at EPFL! More information here: t.ly/QMTCA Please share and RT!

thumb_up_off_alt36

chat_bubble_outline1

repeat6

shareShare

Yu Fei

@walter_fei

a year ago

Alignment is necessary for LLMs, but do we need to train aligned versions for all model sizes in every model family? 🧐 We introduce 🚀Nudging, a training-free approach that aligns any base model by injecting a few nudging tokens at inference time. 🌐fywalter.github.io/nudging/

thumb_up_off_alt138

chat_bubble_outline6

repeat25

shareShare

Badr AlKhamissi

@bkhmsi

a year ago

🚨 New Paper!! How can we train LLMs using 100M words? In our babyLM paper, we introduce a new self-synthesis training recipe to tackle this question! 🍼💻 This was a fun project co-led by me, Yingtian Tang, Abdulkadir Gokce, w/ Hannes Mehrer & Martin Schrimpf 🧵⬇️

🚨 New Paper!!

How can we train LLMs using 100M words? In our <a href="/babyLMchallenge/">babyLM</a> paper, we introduce a new self-synthesis training recipe to tackle this question! 🍼💻

This was a fun project co-led by me, <a href="/yingtian80536/">Yingtian Tang</a>, <a href="/akgokce0/">Abdulkadir Gokce</a>, w/ <a href="/HannesMehrer/">Hannes Mehrer</a> & <a href="/martin_schrimpf/">Martin Schrimpf</a>

🧵⬇️

thumb_up_off_alt97

chat_bubble_outline1

repeat24

shareShare

Angelika Romanou

@agromanou

a year ago

🚀 Introducing INCLUDE 🌍: A multilingual LLM evaluation benchmark spanning 44 languages! Contains *newly-collected* data, prioritizing *regional knowledge*. Setting the stage for truly global AI evaluation. Ready to see how your model measures up? #AI #Multilingual #LLM #NLProc

thumb_up_off_alt184

chat_bubble_outline1

repeat60

shareShare

Beatriz Borges

@obiwit

a year ago

📘 Could ChatGPT get an engineering degree? Spoiler, yes! In our new PNASNews article, we explore how AI assistants like GPT-4 perform in STEM university courses — and on average they pass a staggering 91.7% of core courses. 🧵 #AI #HigherEd #STEM #LLMs #NLProc

📘 Could ChatGPT get an engineering degree? Spoiler, yes! In our new <a href="/PNASNews/">PNASNews</a> article, we explore how AI assistants like GPT-4 perform in STEM university courses — and on average they pass a staggering 91.7% of core courses. 🧵 #AI #HigherEd #STEM #LLMs #NLProc

thumb_up_off_alt75

chat_bubble_outline2

repeat27

shareShare

Badr AlKhamissi

@bkhmsi

a year ago

🚨 New Paper! Can neuroscience localizers uncover brain-like functional specializations in LLMs? 🧠🤖 Yes! We analyzed 18 LLMs and found units mirroring the brain's language, theory of mind, and multiple demand networks! w/ Greta Tuckute, Antoine Bosselut, & Martin Schrimpf 🧵👇

thumb_up_off_alt98

chat_bubble_outline1

repeat30

shareShare

Badr AlKhamissi

@bkhmsi

9 months ago

🚨 New Preprint!! LLMs trained on next-word prediction (NWP) show high alignment with brain recordings. But what drives this alignment—linguistic structure or world knowledge? And how does this alignment evolve during training? Our new paper explores these questions. 👇🧵

thumb_up_off_alt279

chat_bubble_outline5

repeat63

shareShare

Silin Gao

@silin_gao

8 months ago

NEW PAPER ALERT: Generating visual narratives to illustrate textual stories remains an open challenge, due to the lack of knowledge to constrain faithful and self-consistent generations. Our #CVPR2025 paper proposes a new benchmark, VinaBench, to address this challenge.

thumb_up_off_alt19

chat_bubble_outline1

repeat11

shareShare

Angelika Romanou

@agromanou

7 months ago

If you’re at ICLR 2026 this week, come check out our spotlight poster INCLUDE during the Thursday 3:00–5:30pm session! I will be there to chat about all things multilingual & multicultural evaluation. Feel free to reach out anytime during the conference. I’d love to connect!

thumb_up_off_alt52

chat_bubble_outline0

repeat14

shareShare

Badr AlKhamissi

@bkhmsi

5 months ago

🚨New Preprint!! Thrilled to share with you our latest work: “Mixture of Cognitive Reasoners”, a modular transformer architecture inspired by the brain’s functional networks: language, logic, social reasoning, and world knowledge. 1/ 🧵👇

thumb_up_off_alt169

chat_bubble_outline1

repeat41

shareShare

Qiyue Gao

@qiyuegao123

5 months ago

🤔 Have OpenAI o3, Gemini 2.5, Claude 3.7 formed an internal world model to understand the physical world, or just align pixels with words? We introduce WM-ABench, the first systematic evaluation of VLMs as world models. Using a cognitively-inspired framework, we test 15 SOTA

thumb_up_off_alt206

chat_bubble_outline3

repeat44

shareShare