Vijay V. (@vijaytarian) Twitter Tweets • TwiCopy

Vijay V.

@vijaytarian

+ Follow

Grad student at CMU. I do research on applied NLP. he/him

ID: 31239481

linkhttp://www.cs.cmu.edu/~vijayv/ calendar_today14-04-2009 21:56:20

1,1K Tweet

580 Followers

468 Following

Seungone Kim @ NAACL2025

@seungonekim

a year ago

#NLProc Just because GPT-4o is 17 times more expensive than GPT-4o-mini, does that mean it generates synthetic data 17 times better? Introducing the AgoraBench, a benchmark for evaluating data generation capabilities of LMs.

thumb_up_off_alt185

chat_bubble_outline2

repeat49

shareShare

Seungone Kim @ NAACL2025

@seungonekim

a year ago

🌟Our results show that LMs have distinct strengths! For example, while GPT-4o excels at generating new instances, Claude-3.5-Sonnet is better at refining existing instances. 🤯We also observe unexpected results that in some cases, LMs with stronger problem-solving abilities do

thumb_up_off_alt11

chat_bubble_outline1

repeat3

shareShare

Huan Sun (OSU)

@hhsun1

a year ago

I was extremely fortunate to recruit @Xiangyue96 as my Ph.D. student in 2018 and witness his remarkable growth into a rising star in NLP and AI. You might know him for his recent contributions like MMMU and MAmmoTH. But to me, long before these influential projects, Xiang

thumb_up_off_alt215

chat_bubble_outline3

repeat16

shareShare

Danish Pruthi

@danish037

5 months ago

At #ICML2025, introducing STAMP. A simple approach to verify whether your content (e.g., a dataset) is a part of the data used for training language models. ⤵️

thumb_up_off_alt103

chat_bubble_outline3

repeat8

shareShare

Alisa Liu

@alisawuffles

5 months ago

If you're at ACL, join us for our tutorial on Synthetic Data in the Era of LLMs with Vijay V. Xiang Yue Yizhong Wang Graham Neubig!! 🕑 2pm - 5:30pm 📍 Hall B

If you're at ACL, join us for our tutorial on Synthetic Data in the Era of LLMs with <a href="/vijaytarian/">Vijay V.</a> <a href="/xiangyue96/">Xiang Yue</a> <a href="/yizhongwyz/">Yizhong Wang</a> <a href="/gneubig/">Graham Neubig</a>!!

🕑 2pm - 5:30pm
📍 Hall B

thumb_up_off_alt121

chat_bubble_outline4

repeat13

shareShare

Graham Neubig

@gneubig

4 months ago

Yuchen Jin They didn't evaluate on 23 of the 500 instances though, so the actual score is: 74.9 * (500 - 23) / 500 = 71.4%, which is a few points below Claude Sonnet 4.

thumb_up_off_alt392

chat_bubble_outline13

repeat28

shareShare

jack morris

@jxmnop

4 months ago

OpenAI hasn’t open-sourced a base model since GPT-2 in 2019. they recently released GPT-OSS, which is reasoning-only... or is it? turns out that underneath the surface, there is still a strong base model. so we extracted it. introducing gpt-oss-20b-base 🧵

thumb_up_off_alt6,6K

chat_bubble_outline151

repeat458

shareShare