UMassNLP (@umass_nlp) Twitter Tweets • TwiCopy

Tu Vu

2 years ago

I would also like to thank all of my labmates UMassNLP and friends at UMass Amherst, my mentors and collaborators at Google AI and Microsoft Research, and my family and friends all over the world who gave me support and encouragement throughout my Ph.D. journey.

thumb_up_off_alt22

chat_bubble_outline1

repeat1

shareShare

Tu Vu

@tuvllms

2 years ago

Moving forward, I will be splitting my time as a research scientist at Google AI and an assistant professor Virginia Tech Computer Science. I will also be recruiting Ph.D. students starting in Fall 2024 to work on effective and efficient transfer learning in the era of LLMs, please come join me!

thumb_up_off_alt56

chat_bubble_outline4

repeat7

shareShare

Mohit Iyyer

@mohitiyyer

2 years ago

Huge congrats @tuvuumass, who just became my first graduated PhD student!! He'll be starting his own group soon Virginia Tech Computer Science, so prospective PhD applicants interested in topics like multitask/multimodal transfer learning, or param-efficient LLM adaptation: def apply to work with him!

thumb_up_off_alt92

chat_bubble_outline0

repeat5

shareShare

UMassNLP

@umass_nlp

2 years ago

In Prize-winning Paper, UMass Amherst Computer Scientists Release Guidelines for Evaluating AI-Generated Text : UMass Amherst umass.edu/news/article/p…

thumb_up_off_alt46

chat_bubble_outline2

repeat7

shareShare

brendan o'connor

@brendan642

2 years ago

Reminder - for the terrific interdisciplinary Text as Data conference, abstract submissions coming up - due Aug 4! tada2023.org It's a great, small, non-archival conference to discuss emerging work with folks across social sciences, humanities, and computer science.

thumb_up_off_alt68

chat_bubble_outline0

repeat40

shareShare

Sheshera Mysore

@msheshera

2 years ago

I’m at #sigir2023 and presenting our work on interactively controllable personalisation! Come listen in room 101 between 11-12.30!

thumb_up_off_alt26

chat_bubble_outline0

repeat3

shareShare

Yapei Chang

@yapeichang

2 years ago

Can LLMs summarize books exceeding their context windows? We design an evaluation protocol for collecting fine-grained human judgments on LLM-generated summaries & propose BooookScore, a reference-free automatic metric for narrative coherence. arxiv.org/abs/2310.00785 🧵below:

thumb_up_off_alt224

chat_bubble_outline2

repeat47

shareShare

Tu Vu

@tuvllms

2 years ago

🚨 New Google AI paper: 🤖 LLMs are game-changers, but can they help us navigate a constantly changing world? 🤔 As of now, our work shows that LLMs, no matter their size, struggle when it comes to fast-changing knowledge & false premises. 📰: arxiv.org/abs/2310.03214 👇

🚨 New <a href="/GoogleAI/">Google AI</a> paper:

🤖 LLMs are game-changers, but can they help us navigate a constantly changing world? 🤔

As of now, our work shows that LLMs, no matter their size, struggle when it comes to fast-changing knowledge & false premises.

📰: arxiv.org/abs/2310.03214
👇

thumb_up_off_alt382

chat_bubble_outline5

repeat86

shareShare

Mohit Iyyer

@mohitiyyer

2 years ago

Evaluating the factuality of LLMs is tricky: what if they answer a question correctly but also generate a bunch of unrelated made-up stuff? We eval LLM answers to our new FreshQA dataset in both a "strict" (no made up stuff) and "relaxed" setting, see the paper for more!

thumb_up_off_alt49

chat_bubble_outline0

repeat7

shareShare

Quoc Le

@quocleix

2 years ago

A weakness of LLMs is that they don’t know recent events well. This is nice work from Tu developing a benchmark (FreshQA) to measure factuality of recent events, and a simple method to improve search integration for better performance on the benchmark.

thumb_up_off_alt47

chat_bubble_outline1

repeat7

shareShare

Jason Wei

@_jasonwei

2 years ago

Nice paper by Tu Vu on factuality in LLMs: arxiv.org/abs/2310.03214, enjoyed contributing in a minor role to it while I was at Google. The main takeaway for me is that most factuality benchmarks for LLMs don't really take into account the fact that many types of knowledge

thumb_up_off_alt115

chat_bubble_outline3

repeat15

shareShare

Tu Vu

@tuvllms

2 years ago

📢 Want to adapt your outdated LLM to our ever-changing world? 🌏 Check out our code for FreshPrompt at github.com/freshllms/fres…. Colab: tinyurl.com/freshprompt-co…. 🙏 We are grateful to SerpApi for their generous sponsorship of 5000 searches for FreshPrompt's users.

thumb_up_off_alt31

chat_bubble_outline0

repeat6

shareShare

Andrew Drozdov

@mrdrozdov

2 years ago

✨ New Paper ✨ Deep dive on demonstrations to enhance LLM-based passage ranking 🚀 insights for pointwise ranking using query likelihood 🚀 huggingface.co/papers/2310.14…

thumb_up_off_alt103

chat_bubble_outline5

repeat25

shareShare

Tu Vu

@tuvllms

2 years ago

📢 🌟PhD Openings🌟: I am recruiting PhD students this cycle at Virginia Tech. If you want to dive into: - in-context learning & tool-use LLMs - instruction tuning - parameter-efficient transfer learning - few-shot learning please apply by Dec 15! 👉tuvllms.github.io

thumb_up_off_alt319

chat_bubble_outline5

repeat76

shareShare

Ankita Gupta

@anki98765

2 years ago

Check out ezCoref, our open-source tool for easy coreference annotation across languages/domains. Demo: azkaban.cs.umass.edu:8877/tutorial Re-annotation study via ezCoref reveals interesting deviations from prior work. 📜aclanthology.org/2023.findings-… #CRAC2023 EMNLP 2025 Dec 6, 2:50PM 🧵👇

thumb_up_off_alt21

chat_bubble_outline1

repeat7

shareShare

Mohit Iyyer

@mohitiyyer

2 years ago

So proud to have hooded my first five PhDs today: Tu Vu, Kalpesh Krishna, Simeng Sun, Andrew Drozdov, and Nader Akoury. Now, they're either training LLMs at Google, Nvidia, and Databricks, or staying in academia at Virginia Tech and Cornell. Excited to watch their careers blossom!

So proud to have hooded my first five PhDs today: <a href="/tuvllms/">Tu Vu</a>, <a href="/kalpeshk2011/">Kalpesh Krishna</a>, <a href="/simeng_ssun/">Simeng Sun</a>, <a href="/mrdrozdov/">Andrew Drozdov</a>, and Nader Akoury. Now, they're either training LLMs at Google, Nvidia, and Databricks, or staying in academia at Virginia Tech and Cornell. Excited to watch their careers blossom!

thumb_up_off_alt267

chat_bubble_outline10

repeat9

shareShare

Tu Vu

@tuvllms

a year ago

🚨 New Google DeepMind paper 🚨 We trained Foundational Large Autorater Models (FLAMe) on extensive human evaluations, achieving the best RewardBench perf. among generative models trained solely on permissive data, surpassing both GPT-4 & 4o. 📰: arxiv.org/abs/2407.10817 🧵:👇

🚨 New <a href="/GoogleDeepMind/">Google DeepMind</a> paper 🚨

We trained Foundational Large Autorater Models (FLAMe) on extensive human evaluations, achieving the best RewardBench perf. among generative models trained solely on permissive data, surpassing both GPT-4 & 4o.

📰: arxiv.org/abs/2407.10817
🧵:👇

thumb_up_off_alt568

chat_bubble_outline26

repeat96

shareShare

Kalpesh Krishna

@kalpeshk2011

a year ago

Check out our new Google AI paper: we curate a mixture of 5M human judgments to train general-purpose foundational autoraters. Strong LLM-as-judge scores on RewardBench (87.8%), and highest perf among baselines on LLMAggreFact + 6 other benchmarks! 📰 arxiv.org/abs/2407.10817 👇

thumb_up_off_alt120

chat_bubble_outline5

repeat21

shareShare

Kalpesh Krishna

@kalpeshk2011

a year ago

Google AI Paper's HuggingFace page: huggingface.co/papers/2407.10…

thumb_up_off_alt18

chat_bubble_outline0

repeat3

shareShare

Prateek Yadav

@prateeky2806

a year ago

Given that I have been closely working with Tu Vu and Kalpesh Krishna, I can say that he is extremely well read, hard working and this paper is amazing. People should definitely check out the FLAMe as this is going to be impactful.

thumb_up_off_alt32

chat_bubble_outline1

repeat6

shareShare