Alexandra DeLucia (@alexir563) 's Twitter Profile
Alexandra DeLucia

@alexir563

@Google intern Summer 2024. @Sony intern Fall 2023. Computer Science PhD Student at @jhuclsp in the @mdredze group. @rollinscollege alum.

ID: 16019396

linkhttp://alexandradelucia.com calendar_today27-08-2008 23:30:31

414 Tweet

613 Followers

750 Following

JHU CLSP (@jhuclsp) 's Twitter Profile Photo

📢 MASC-SLL Call for Papers is out! 📢 🎓 Are you a student passionate about Speech, Language, and Learning? 🗣️✨ 🌟 Present your research at the Mid-Atlantic Student Colloquium on Speech, Language, and Learning (MASC-SLL) at Johns Hopkins Johns Hopkins Engineering on May 3rd! 🌟

Suzanna Sia (@suzyahyah) 's Twitter Profile Photo

1. Problem: Base LLMs can perform zero-shot Translation, but are poorly calibrated and sometimes “fail to translate”; i.e. they continue generating in the source language or produce empty generations. 2. Proposed solution 3. Limitations 4. Additional 5. Concurrent Work 1/5

Niyati Bafna (@bafnaniyati) 's Twitter Profile Photo

‼️Can we predict LLM performance on an unseen language if we know its linguistic relationships with a seen language? What makes an LRL difficult for an LLM?‼️ We present a study of LLM performance degradation as a function of linguistic distances... arxiv.org/pdf/2406.13718

‼️Can we predict LLM performance on an unseen language if we know its linguistic relationships with a seen language? What makes an LRL difficult for an LLM?‼️
We present a study of LLM performance degradation as a function of linguistic distances...
arxiv.org/pdf/2406.13718
Mark Dredze (@mdredze) 's Twitter Profile Photo

If you write a paper review and provide a list of weaknesses, and the author's response addresses those weaknesses, raise your score. Otherwise, you need to better explain your concerns.

If you write a paper review and provide a list of weaknesses, and the author's response addresses those weaknesses, raise your score. Otherwise, you need to better explain your concerns.
Alexandra DeLucia (@alexir563) 's Twitter Profile Photo

At what point in my CS PhD do I magically stop running into the `install package > run script > package not found > install package …` loop? 🙃

Isabel Cachola (@isabelcachola) 's Twitter Profile Photo

I’m presenting my Microsoft internship work today at #EMNLP2024! I’ll be in Riverfront from 2-3:30. Come say hi and chat about structured document generation and evaluation! TLDR: We unify the generation and evaluation of templatic views of documents. aclanthology.org/2024.findings-…

I’m presenting my Microsoft internship work today at #EMNLP2024! I’ll be in Riverfront from 2-3:30. Come say hi and chat about structured document generation and evaluation!

TLDR: We unify the generation and evaluation of templatic views of documents. 
aclanthology.org/2024.findings-…
Mark Dredze (@mdredze) 's Twitter Profile Photo

Congratulations to Carlos Aguirre Carlos Aguirre Pocasangre on successfully defending his PhD thesis: Improving the Fairness in Language Models for Decision-Making! Next stop? Amazon pocaguirre.com

Carlos Aguirre Pocasangre (@pocaguirre) 's Twitter Profile Photo

PhDone! 🎉 Thanks to everyone who was there along the way, family, friends, colleagues and advisors! Yes, I am a doctor now. Yes, I got a sword now (thanks friends!) Yes, we got Mark Dredze to knight me. What a nice finishing touch for this era!

PhDone! 🎉 Thanks to everyone who was there along the way, family, friends, colleagues and advisors! Yes, I am a doctor now. Yes, I got a sword now (thanks friends!) Yes, we got <a href="/mdredze/">Mark Dredze</a>  to knight me. What a nice finishing touch for this era!
MASC-SLL Conference (@masc_conference) 's Twitter Profile Photo

🎤Invited Talks at MASC! Roger Beaty Beyond Generation: Language Models as Creativity Evaluators Xiang Lorraine Li - Every Opinion Matters: Distributional and Long-tail Evaluation for LLMs Daphne Ippolito – Troubles with Training Data for Large Language Models #NLP #MASC2025

🎤Invited Talks at MASC! 
<a href="/Roger_Beaty/">Roger Beaty</a>  Beyond Generation: Language Models as Creativity Evaluators
<a href="/xiang_lorraine/">Xiang Lorraine Li</a> - Every Opinion Matters: Distributional and Long-tail Evaluation for LLMs
<a href="/daphneipp/">Daphne Ippolito</a>  – Troubles with Training Data for Large Language Models 
#NLP #MASC2025
Heyuan Huang (@yuancosette) 's Twitter Profile Photo

1/6🤖🩺 Medical answers are context-dependent, hypothetical, and subjective (If pain gets worse, see a doctor). Existing systems break here: they generate invalid claims or omit these sentences. MedScore solves this with context-aware claims and verifies against trusted sources.

1/6🤖🩺 Medical answers are context-dependent, hypothetical, and subjective (If pain gets worse, see a doctor). Existing systems break here: they generate invalid claims or omit these sentences. MedScore solves this with context-aware claims and verifies against trusted sources.
Heyuan Huang (@yuancosette) 's Twitter Profile Photo

I will present MedExpert, a clinician-annotated Factuality and Completeness dataset with Severity levels for Medical Chatbot Evaluation at ML4H in San Diego today. Happy to chat from Dec 1 to 3! See paper at: openreview.net/pdf?id=rkLAzDP… Mark Dredze Alexandra DeLucia JHU CLSP Sonal Joshi