Eunsu Kim @ ICLR 2025 (@euns0o_kim) 's Twitter Profile
Eunsu Kim @ ICLR 2025

@euns0o_kim

MS student at @kaistcsdept

ID: 1738437155742187520

linkhttps://eunsu-k1m.github.io/ calendar_today23-12-2023 05:51:48

31 Tweet

164 Takipçi

144 Takip Edilen

Alice Oh (@aliceoh) 's Twitter Profile Photo

Excited to attend the first Conference on Language Modeling 🦙❤️🤩 Very open to talk to anyone about faculty/postdoc/phd opportunities at KAIST, as well as about multilingual multicultural LLM research. Come join the multilingual special session on Wednesday morning, and find my students

Wenda Xu (@wendaxu2) 's Twitter Profile Photo

I will give a talk at Naver lab (Europe) on Oct 17th, 5 PM (CEST) and 8 AM (PST). This talk is about "how to properly build a metric to evaluate AI-generated text?". I will dive into three main challenges in building a proper evaluation metric and present our proposed

I will give a talk at Naver lab (Europe) on Oct 17th, 5 PM (CEST) and 8 AM (PST).  This talk is about "how to properly build a metric to evaluate AI-generated text?".  I will dive into three main challenges in building a proper evaluation metric and present our proposed
Isabelle Augenstein (@iaugenstein) 's Twitter Profile Photo

📜Excited to share our comprehensive survey on cultural awareness in #LLMs! 🗺️ We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) Siddhesh Pawar Junyeong Park @ NAACL 2025✈️ Jiho Jin Arnav Arora Junho Myung Inhwa Alice Oh #NLProc openreview.net/forum?id=3gg6G…

📜Excited to share our comprehensive survey on cultural awareness in #LLMs! 🗺️
We reviewed 300+ papers across diverse modalities (language, vision-language, etc.) 
<a href="/whoSiddheshp/">Siddhesh Pawar</a> <a href="/jjjunyeong/">Junyeong Park @ NAACL 2025✈️</a> <a href="/jin__jiho/">Jiho Jin</a> <a href="/rnav_arora/">Arnav Arora</a> <a href="/JunhoMyung_/">Junho Myung</a> <a href="/_inhwa_song/">Inhwa</a> <a href="/aliceoh/">Alice Oh</a> #NLProc
 openreview.net/forum?id=3gg6G…
Niklas Muennighoff (@muennighoff) 's Twitter Profile Photo

Interviews are the gold standard for evaluating humans; it's usually the final step in admissions & hiring. We introduce LLM-as-an-Interviewer to do the same to benchmark LLMs — a paradigm shift in evals I think. Led by Eunsu Kim @ACL2025 - I highly recommend admitting her for PhDs!

Seungone Kim @ NAACL2025 (@seungonekim) 's Twitter Profile Photo

When people hire someone new, they often conduct interviews to understand their traits and strengths in detail. In our new preprint, we extend single-turn interaction-based "LLM-as-a-Judge" to multi-turn interactions with "LLM-as-an-Interviewer"! Check out Eunsu Kim @ACL2025 's post!

Marktechpost AI Research News ⚡ (@marktechpost) 's Twitter Profile Photo

This AI Paper Introduces LLM-as-an-Interviewer: A Dynamic AI Framework for Comprehensive and Adaptive LLM Evaluation Researchers from KAIST, Stanford University, Carnegie Mellon University, and Contextual AI have introduced LLM-AS-AN-INTERVIEWER, a novel framework for evaluating

This AI Paper Introduces LLM-as-an-Interviewer: A Dynamic AI Framework for Comprehensive and Adaptive LLM Evaluation

Researchers from KAIST, Stanford University, Carnegie Mellon University, and Contextual AI have introduced LLM-AS-AN-INTERVIEWER, a novel framework for evaluating
Sheikh Shafayat ✈️ ICLR'25 🇸🇬 (@shafayat_sheikh) 's Twitter Profile Photo

Check out our latest work on self-improving LLMs, where we try to see if LLMs can utilize their internal self consistency as a reward signal to bootstrap itself using RL. TL;DR: it can, to some extent, but then ends up reward hacking the self-consistency objective. We try to see

Check out our latest work on self-improving LLMs, where we try to see if LLMs can utilize their internal self consistency as a reward signal to bootstrap itself using RL.

TL;DR: it can, to some extent, but then ends up reward hacking the self-consistency objective. We try to see
Eunsu Kim @ ICLR 2025 (@euns0o_kim) 's Twitter Profile Photo

I’ll be presenting our LLM-as-an-Interviewer work at #ACL2025! 📅 When: July 30 (wed) 11:00-12:30 📍 Where: Hall 4/5 arxiv.org/abs/2412.10424 Feel free to stop by ! Looking forward to discussing (m)LLM evaluation and more!

Haneul Yoo (@haneulyoo13) 's Twitter Profile Photo

🗣️ My undergrad mentee Sunwoo Kim is presenting his work at Generation Evaluation & Metrics Workshop (gem-benchmark.com/workshop) Please stop by and say hi 👋 📄 arxiv.org/abs/2506.21961 📅 15:15-15:30, Jul 31 📍 Hall C #ACL2025NLP #ACL2025 ACL 2025