Ting-Yun Chang (@charlottetyc) Twitter Tweets • TwiCopy

Chao-Wei Huang

a year ago

Excited to share that our paper, "FactAlign: Long-form Factuality Alignment of Large Language Models," has been accepted to #EMNLP2024 Findings! 📝 Paper: arxiv.org/abs/2410.01691 💻 Code: github.com/MiuLab/FactAli… FactAlign significantly improves the factuality of LLMs' (1/2)

thumb_up_off_alt72

chat_bubble_outline2

repeat9

shareShare

Dimitris Papailiopoulos

@dimitrispapail

a year ago

I guess we need a lot of things

thumb_up_off_alt1,1K

chat_bubble_outline52

repeat184

shareShare

Deqing Fu

@deqingfu

a year ago

It's cool to see Google DeepMind's new research to show similar findings as we did back in April. IsoBench (isobench.github.io, accepted to Conference on Language Modeling 2024) was curated to show the performance gap across modalities and multimodal models' preference over text modality.

It's cool to see <a href="/GoogleDeepMind/">Google DeepMind</a>'s new research to show similar findings as we did back in April.

IsoBench (isobench.github.io, accepted to <a href="/COLM_conf/">Conference on Language Modeling</a> 2024) was curated to show the performance gap across modalities and multimodal models' preference over text modality.

thumb_up_off_alt53

chat_bubble_outline2

repeat12

shareShare

Jaspreet Ranjit

@jaspreetranjit_

a year ago

Thank you so much Spectrum News 1 SoCal Jas Kang for featuring our work on OATH-Frames: Characterizing Online Attitudes towards Homelessness with LLM Assistants👇 🖥️📈 oath-frames-dashboard.streamlit.app 🗞️ spectrumnews1.com/ca/southern-ca… USC Thomas Lord Department of Computer Science USC NLP USC Social Work USC Center for AI in Society USC Viterbi School Swabha Swayamdipta

thumb_up_off_alt28

chat_bubble_outline1

repeat7

shareShare

Yuqing Yang

@yyqcode

a year ago

In Miami for #EMNLP2024! Come check out our findings poster, Weak-to-Strong Reasoning, on Wednesday at 10:30am. Super excited for my first in-person conference. Looking forward to connecting and chatting about reasoning, hallucination, self-correction, and all things LLMs! 🌴🌴

thumb_up_off_alt56

chat_bubble_outline0

repeat12

shareShare

Chao-Wei Huang

@cwhuang_wh

a year ago

I'll be presenting 2 papers at #EMNLP2024. Happy to chat! 🗓️ Nov 12 (Tue) 11:00-12:30 - Riverfront Hall 📄 PairDistill: Pairwise Relevance Distillation for Dense Retrieval 🗓️ Nov 13 (Wed) 16:00-17:30 - Jasmine 📄 FactAlign: Long-form Factuality Alignment of Large Language Models

thumb_up_off_alt35

chat_bubble_outline1

repeat2

shareShare

Tejas Srinivasan

@_tejas_s_

a year ago

Come by Poster Session A tomorrow to hear Sayan Ghosh tell you why your preference eval is probably broken (and how you can fix it!)

thumb_up_off_alt21

chat_bubble_outline0

repeat4

shareShare

Yuqing Yang

@yyqcode

a year ago

Don't miss this fascinating work!

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Robin Jia

@robinomial

a year ago

I'll be at #NeurIPS2024! My group has papers analyzing how LLMs use Fourier Features for arithmetic (Tianyi Zhou ) and how TFs learn higher-order optimization for ICL (Deqing Fu), plus workshop papers on backdoor detection (Jun Yan) and LLMs + PDDL (Wang Bill Zhu)

thumb_up_off_alt43

chat_bubble_outline6

repeat8

shareShare

Qinyuan Ye (👀Jobs)

@qinyuan_ye

a year ago

I'll present a poster for Lifelong ICL and Task Haystack at #NeurIPS2024! ⏰ Wednesday 11am-2pm 📍 East Exhibit Hall A-C #2802 📜 arxiv.org/abs/2407.16695 My co-first author Xiaoyue Xu is applying to PhD programs and I am looking jobs in industry! Happy to connect at NeurIPS!

thumb_up_off_alt35

chat_bubble_outline0

repeat6

shareShare

Tianyi Zhou

@tianyi_zhou12

8 months ago

Billion-parameter LLMs still struggle with simple arithmetic? 📞 FoNE (Fourier Number Embedding) tackles this problem. By mapping numbers directly into Fourier space, it bypasses tokenization and significantly improves numerical accuracy with better efficiency and accuracy.

thumb_up_off_alt22

chat_bubble_outline1

repeat12

shareShare

Abrar Anwar

@_abraranwar

8 months ago

All these VLAs allow robots to do more tasks, but when you're physically testing many policies, it's hard to eval on every task! We take advantage of shared information between tasks and within policies to actively test multi-task robot policies! 1/7 🧵 arxiv.org/abs/2502.09829

thumb_up_off_alt102

chat_bubble_outline3

repeat16

shareShare

Stanford NLP Group

@stanfordnlp

8 months ago

For this week’s NLP Seminar, we are thrilled to host Yun-Nung Vivian Chen to talk about Optimizing Interaction and Intelligence — Multi-Agent Simulation and Collaboration for Personalized Marketing and Advanced Reasoning! When: 3/6 Thurs 11am PT Non-Stanford affiliates registration form

For this week’s NLP Seminar, we are thrilled to host <a href="/YunNungChen/">Yun-Nung Vivian Chen</a> to talk about Optimizing Interaction and Intelligence — Multi-Agent Simulation and Collaboration for Personalized Marketing and Advanced Reasoning!

When: 3/6 Thurs 11am PT
Non-Stanford affiliates registration form

thumb_up_off_alt62

chat_bubble_outline2

repeat11

shareShare

Wang Bill Zhu

@billjohn1235813

6 months ago

At NAACL HLT 2025 this week! I’ll be presenting our work on LLM domain induction with Jesse Thomason on Thu (5/1) at 4pm in Hall 3, Section I. Would love to connect and chat about LLM planning, reasoning, AI4Science, multimodal stuff, or anything else. Feel free to DM!

At <a href="/naaclmeeting/">NAACL HLT 2025</a> this week! I’ll be presenting our work on LLM domain induction with <a href="/_jessethomason_/">Jesse Thomason</a> on Thu (5/1) at 4pm in Hall 3, Section I.

Would love to connect and chat about LLM planning, reasoning, AI4Science, multimodal stuff, or anything else. Feel free to DM!

thumb_up_off_alt25

chat_bubble_outline0

repeat6

shareShare

Stanford NLP Group

@stanfordnlp

5 months ago

For this week’s NLP Seminar, we are thrilled to host Deqing Fu to talk about Closing the Modality Gap: Benchmarking and Improving Visual Understanding in Multimodal LLMs! When: 5/22 Thurs 11am PT Non-Stanford affiliates registration form (closed at 9am PT on the talk day):

For this week’s NLP Seminar, we are thrilled to host <a href="/DeqingFu/">Deqing Fu</a> to talk about Closing the Modality Gap: Benchmarking and Improving Visual Understanding in Multimodal LLMs!

When: 5/22 Thurs 11am PT
Non-Stanford affiliates registration form (closed at 9am PT on the talk day):

thumb_up_off_alt56

chat_bubble_outline1

repeat9

shareShare

Yuqing Yang

@yyqcode

5 months ago

🧐When do LLMs admit their mistakes when they should know better? In our new paper, we define this behavior as retraction: the model indicates that its generated answer was wrong. LLMs can retract—but they rarely do.🤯 arxiv.org/abs/2505.16170 👇🧵

thumb_up_off_alt109

chat_bubble_outline5

repeat24

shareShare