Tom Sheffer (@tomsheffer17807) 's Twitter Profile
Tom Sheffer

@tomsheffer17807

M.D candidate | Computer Science Master's candidate | @Google Research Software Engineer intern in Neuroscience.

ID: 1744757527978680320

calendar_today09-01-2024 16:26:44

23 Tweet

108 Followers

459 Following

Amir Taubenfeld (@taubenfeldamir) 's Twitter Profile Photo

New Preprint πŸŽ‰ LLM self-assessment unlocks efficient decoding βœ… Our Confidence-Informed Self-Consistency (CISC) method cuts compute without losing accuracy. We also rethink confidence evaluation & contribute to the debate on self-verification. arxiv.org/abs/2502.06233 1/8πŸ‘‡

New Preprint πŸŽ‰

LLM self-assessment unlocks efficient decoding βœ…

Our Confidence-Informed Self-Consistency (CISC) method cuts compute without losing accuracy.

We also rethink confidence evaluation & contribute to the debate on self-verification.

arxiv.org/abs/2502.06233
1/8πŸ‘‡
Zorik Gekhman (@zorikgekhman) 's Twitter Profile Photo

Now accepted to #COLM2025! We formally define hidden knowledge in LLMs and show its existence in a controlled study. We even show that a model can know the answer yet fail to generate it in 1,000 attempts 😡 Looking forward to presenting and discussing our work in person.

Eliya Habba (@eliyahabba) 's Twitter Profile Photo

Presenting my poster : πŸ•ŠοΈ DOVE - A large-scale multi-dimensional predictions dataset towards meaningful LLM evaluation, Monday 18:00 Vienna, #ACL2025 Come chat about LLM evaluation, prompt sensitivity, and our 250M COLLECTION OF MODEL OUTPUTS!

Presenting my poster :
πŸ•ŠοΈ DOVE - A large-scale multi-dimensional predictions dataset towards meaningful LLM evaluation, Monday 18:00 Vienna, 
#ACL2025

Come chat about LLM evaluation, prompt sensitivity, and our 250M COLLECTION OF MODEL OUTPUTS!
Tom Sheffer (@tomsheffer17807) 's Twitter Profile Photo

Presenting our CISC paper tomorrow at #ACL2025! ⚑️ We save >40% compute on self consistency by using the LLM's valuable internal confidence signal. πŸ—“οΈ Poster: Tues, 16:00-17:30 @ Hall X4 X5 Paper: arxiv.org/abs/2502.06233 Also chatting: LLMs in Neuro, MedNLP, & Human-AI collab!

Presenting our CISC paper tomorrow at #ACL2025!
⚑️ We save >40% compute on self consistency by using the LLM's valuable internal confidence signal.
πŸ—“οΈ Poster: Tues, 16:00-17:30 @ Hall X4 X5
Paper: arxiv.org/abs/2502.06233
Also chatting: LLMs in Neuro, MedNLP, & Human-AI collab!
Tom Sheffer (@tomsheffer17807) 's Twitter Profile Photo

Just wrapped up #ACL2025 and feeling inspired! Standout sessions on LLM self-consistency and the role of pretrained models in text embeddings show how far NLP has come. Thanks to the organizers for an amazing conference. #AI #NLP #Neuroscience