Jack Jingyu Zhang @ NAACL🌵 (@jackjingyuzhang) Twitter Tweets • TwiCopy

Just arrived in Albuquerque for #NAACL2025! Excited to connect and chat about LLM safety, alignment, reasoning, RLVR, and beyond. Feel free to reach out or DM if you’d like to meet up.

thumb_up_off_alt54

chat_bubble_outline3

repeat3

shareShare

Excited to present two papers today and tomorrow at #NAACL2025! Look out for our oral sessions: TurkingBench: arxiv.org/abs/2403.11905 📅 4-5:30pm, Thur, May 1 📍 Ballroom A (R&E.4) Verifiable by Design: arxiv.org/abs/2404.03862 📅 9-10:30am, Fri, May 2 📍 Ballroom A (HC.1)

thumb_up_off_alt12

chat_bubble_outline0

repeat4

shareShare

Tianjian Li

@tli104

a month ago

Excited to be presenting our paper on training language models under heavily imbalanced data tomorrow at #NAACL2025! If you want to chat about data curation for both pre- and post-training, feel free to reach out! 📝 arxiv.org/abs/2410.04579 📅 11-12:30am, Fri, May 2 📍 Hall 3

thumb_up_off_alt20

chat_bubble_outline0

repeat7

shareShare

Yining Lu

@yining__lu

a month ago

Quick reminder that our paper, Benchmarking Language Model Creativity: A Case Study on Code Generation, will be presented today! 📅 11AM-12:30PM, Fri, May 2 📍 Hall 3 📝 arxiv.org/abs/2407.09007 🎥 youtube.com/watch?v=v1cHyC…

thumb_up_off_alt17

chat_bubble_outline0

repeat6

shareShare

Jack Jingyu Zhang @ NAACL🌵

@jackjingyuzhang

a month ago

Check out Daniel’s summary of our paper! ⬇️

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Dongwei Jiang

@dongwei__jiang

21 days ago

Now accepted by #ACL2025! Thrilled to see our paper also referenced in Lilian Weng's latest blog post on reasoning in LLMs! Check it out: lilianweng.github.io/posts/2025-05-…

thumb_up_off_alt58

chat_bubble_outline0

repeat11

shareShare

Daniel Khashabi 🕊️

@danielkhashabi

18 days ago

There have been various efforts on disentangling "task learning" vs "task recall" in LLMs. We've recently explored a fresh angle by borrowing from cryptography: with substitution ciphers, we transform a given task into an equivalent, but cryptic (no pun intended!!) forms.

thumb_up_off_alt16

chat_bubble_outline0

repeat7

shareShare

Abe Hou

@abe_hou

14 days ago

I am excited to share that I will join Stanford AI Lab for my PhD in Computer Science in Fall 2025. Immense gratitude to my mentors: Benjamin Van Durme Daniel Khashabi 🕊️ Tianxing He Jack Jingyu Zhang Orion Weller tsvetshop Lauren Gardner Hongru Du Stella Li Guanghui Qin 🧵:

thumb_up_off_alt192

chat_bubble_outline19

repeat9

shareShare

Anthony Peng

@realanthonypeng

14 days ago

🚨 New work: We rethink how we finetune safer LLMs — not by filtering after the generation, but by tracking safety risk token by token during training. We repurpose guardrail models like 🛡️ Llama Guard and Granite Guardian to score evolving risk across each response 📉 — giving

thumb_up_off_alt75

chat_bubble_outline2

repeat17

shareShare

Daniel Khashabi 🕊️

@danielkhashabi

13 days ago

Long-form inputs (e.g., needle-in-haystack setups) are the crucial aspect of high-impact LLM applications. While previous studies have flagged issues like positional bias and distracting documents, they've missed a crucial element: the size of the gold/relevant context. In our

thumb_up_off_alt51

chat_bubble_outline3

repeat17

shareShare

Anthony Peng

@realanthonypeng

6 days ago

🚨 Sharing our new #ACL2025NLP main paper! 🎥 Deploying video VLMs at scale? Inference compute is your bottleneck. We study how to optimally allocate inference FLOPs across LLM size, frame count, and visual tokens. 💡 Large-scale training sweeps (~100k A100 hrs) 📊 Parametric

thumb_up_off_alt32

chat_bubble_outline1

repeat6

shareShare