USC NLP (@nlp_usc) Twitter Tweets • TwiCopy

Huihan Li 🛩️ ICLR 2025

2 years ago

Feeling hard generating challenging evaluation data for LLMs? Check our work👇! Introducing LINK🔗, the first framework for systematically generating data in the long-tail distribution, guided by symbolic rules arxiv.org/abs/2311.07237 w/USC NLP MOSAIC 🧵⬇️ #NLProc [1/n]

thumb_up_off_alt99

chat_bubble_outline1

repeat24

shareShare

USC NLP

@nlp_usc

2 years ago

We're excited to attend #SocalNLP today! ICYMI, sunny southern California is a fantastic place to do #NLProc, come check out what USC NLP [nlp.usc.edu] has been working on lately! And did we say we're hiring PhD students this fall? 🌴🏖️☀️

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Brihi Joshi

@brihij

2 years ago

Throwback to when Sean Ren 🔆 and our lab made our wishlist and dream research directions to discuss in our lab meeting — very helpful in contextualising our work in the age of LLMs!! 🙌🏼 USC NLP is such a great place to do research 🫶

thumb_up_off_alt31

chat_bubble_outline0

repeat2

shareShare

Linlu Qiu

@linluqiu

2 years ago

How good are LMs at inductive reasoning? How are their behaviors similar to/contrasted with those of humans? We study these via iterative hypothesis refinement. We observe that LMs are phenomenal hypothesis proposers, but they also behave as puzzling inductive reasoners: (1/n)

thumb_up_off_alt263

chat_bubble_outline2

repeat70

shareShare

Sean Ren

@xiangrennlp

2 years ago

Arrived at NOLA for #NeurIPS2023🔥 Exciting time to chat about limits/science of LLMs, “slow” reasoning & explainability. Join our posters for a fun disucssion🍻 Ads: USC CS is hiring tenured track AI faculty + USC NLP is looking for strong PhD students. Talk to us!

thumb_up_off_alt48

chat_bubble_outline1

repeat5

shareShare

Johnny Tian-Zheng Wei

@johntzwei

2 years ago

To detect if your data was used for LLM pretraining, consider using data watermarks: arxiv.org/pdf/2402.10892… Detection can be framed as hypothesis testing (statistical guarantees!), if you contributed multiple training documents and watermarked them before public release. 🧵

thumb_up_off_alt77

chat_bubble_outline1

repeat10

shareShare

Sean Ren

@xiangrennlp

2 years ago

Absolutely thrilled to receive this honor. Rarely for a researcher could have their first PhD publication win a Test of Time Award (for 10 years of its cumulative impact). I’m super grateful for the chance to collaborate with Xiao on this fun project — turns out to be a

thumb_up_off_alt108

chat_bubble_outline9

repeat6

shareShare

Matthew Finlayson ✈️ NeurIPS

@mattf1n

2 years ago

Wanna know gpt-3.5-turbo's embed size? We find a way to extract info from LLM APIs and estimate gpt-3.5-turbo’s embed size to be 4096. With the same trick we also develop 25x faster logprob extraction, audits for LLM APIs, and more! 📄 arxiv.org/abs/2403.09539 Here’s how 1/🧵

thumb_up_off_alt359

chat_bubble_outline6

repeat79

shareShare

Soumya Sanyal

@ssanyal8

a year ago

New paper 🚨 Looking for a strong, open-sourced entailment-verification model to verify your model generations for consistency? ✅ You can now use the 🤗model huggingface.co/soumyasanyal/n… for this! Our FlanT5-xxl finetuned model can predict entailment errors better than GPT3.5 and

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare

Xisen Jin

@xisenj

a year ago

🧐LMs forget upstream knowledge when continuously fine-tuned. When fine-tuned on new data, can we forecast what upstream examples will be forgotten? 🥳Excited to share our #ICML Spotlight paper on forecasting example forgetting! 🔗Project page: inklab.usc.edu/lm-forgetting-…

thumb_up_off_alt31

chat_bubble_outline2

repeat9

shareShare

Sean Ren

@xiangrennlp

a year ago

Congratulations to the GDM Google DeepMind team on their best paper award at #ICML2024 & Appreciate @afedercooper's shout out to our concurrent paper 🙌 If you are into the topic of recovering model info through just its output logits, check out our paper led by Matthew Finlayson too!

thumb_up_off_alt35

chat_bubble_outline5

repeat8

shareShare

Qinyuan Ye (👀Jobs)

@qinyuan_ye

a year ago

Introducing 𝗟𝗶𝗳𝗲𝗹𝗼𝗻𝗴 𝗜𝗖𝗟 and 𝗧𝗮𝘀𝗸 𝗛𝗮𝘆𝘀𝘁𝗮𝗰𝗸, a new approach for evaluating long-context LMs, featuring ever-changing task streams that controllably fill the context window, and NIAH-style visualization for easy diagnosis. 📜 arxiv.org/abs/2407.16695 🧵

thumb_up_off_alt149

chat_bubble_outline5

repeat29

shareShare

Kaitlyn Zhou ✈️ CSCW, EMNLP!

@kaitlynzhou

a year ago

Excited to see everyone soon at #acl2024 in Bangkok! I'll be presenting our work, Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty arxiv.org/abs/2401.06730 Poster session 3 on Aug 12 at 16:00! W/ Maarten Sap (he/him) Jena Hwang Sean Ren

thumb_up_off_alt59

chat_bubble_outline2

repeat8

shareShare

Sean Ren

@xiangrennlp

a year ago

Arriving in Bangkok for ACL 2025! 😃 Will be sharing our recent work on logical scaffolding, model uncertainty expression & multi-hop entailment inference w/ folks USC NLP + Kaitlyn Zhou ✈️ CSCW, EMNLP! +friends Ai2 I'm also helping on the <AI / ALL> summit w/ Sahara AI 🔆 👇👇

Arriving in Bangkok for <a href="/aclmeeting/">ACL 2025</a>! 😃

Will be sharing our recent work on logical scaffolding, model uncertainty expression & multi-hop entailment inference w/ folks <a href="/nlp_usc/">USC NLP</a> + <a href="/KaitlynZhou/">Kaitlyn Zhou ✈️ CSCW, EMNLP!</a> +friends <a href="/allen_ai/">Ai2</a>

I'm also helping on the <AI / ALL> summit
w/ <a href="/SaharaLabsAI/">Sahara AI 🔆</a>
👇👇

thumb_up_off_alt50

chat_bubble_outline1

repeat13

shareShare

Sean Ren

@xiangrennlp

a year ago

Find us at the posters! Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs w/ Siyuan Wang Yejin Choi et al Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty w/ Kaitlyn Zhou ✈️ CSCW, EMNLP!, Maarten Sap (he/him) et al.

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Sean Ren

@xiangrennlp

a year ago

Join us at the co-located <AI / ALL> summit on Aug 15, with the social party in the evening! lu.ma/mxcx5bia co-hosted with SCB 10X SambaNova Systems sponsored by Amazon Web Services participated by folks AI at Meta @google Cohere For AI Together AI

thumb_up_off_alt30

chat_bubble_outline2

repeat7

shareShare

Sahara AI

@saharalabsai

a year ago

Proud moment seeing our CEO & Co-Founder Sean Ren 🔆 alongside his USC NLP students at ACL 2025. Supporting the next generation of thought leaders in AI is exactly what drives us forward.

Proud moment seeing our CEO & Co-Founder <a href="/xiangrenNLP/">Sean Ren 🔆</a> alongside his <a href="/nlp_usc/">USC NLP</a> students at <a href="/aclmeeting/">ACL 2025</a>.

Supporting the next generation of thought leaders in AI is exactly what drives us forward.

thumb_up_off_alt123

chat_bubble_outline21

repeat20

shareShare

Huihan Li 🛩️ ICLR 2025

@huihan_li

a year ago

Heading to #EMNLP2024, down to chat! Excited to present our work (Wed 10:30am) on systematic data generation in long-tail (low confidence) distribution for more challenging evaluation. 🧵👇 📰: arxiv.org/abs/2311.07237 💻: github.com/INK-USC/LINK 🔖: zenodo.org/records/101179…

thumb_up_off_alt54

chat_bubble_outline2

repeat8

shareShare

Sean Ren

@xiangrennlp

7 months ago

Proud of my student Huihan Li and intern Arnav presenting their #ICLR2025 work on attributing culture-conditioned generation to LLM’s training corpora. Fun time meeting many friends. Ping me if you want to chat about model security, interpretability and human-LM interaction!

Proud of my student <a href="/huihan_li/">Huihan Li</a> and intern Arnav presenting their #ICLR2025 work on attributing culture-conditioned generation to LLM’s training corpora.

Fun time meeting many friends. Ping me if you want to chat about model security, interpretability and human-LM interaction!

thumb_up_off_alt63

chat_bubble_outline2

repeat6

shareShare

Sean Ren

@xiangrennlp

7 months ago

Thrilled for the Best Paper Award runner-up at #NAACL2025! 🥳 Even when answers are incorrect, people may rely more on LLMs if they use warm and emphatic expressions! We analyze the risks of human over-reliance on LLM expressions of uncertainty: arxiv.org/pdf/2407.07950 w/

thumb_up_off_alt85

chat_bubble_outline11

repeat15

shareShare