Jack Hessel
@jmhessel
ML, NLP, CV. PhD from @CornellCIS; Opinions my own.
ID:121516577
https://jmhessel.com/ 09-03-2010 19:02:10
2,1K Tweets
3,3K Followers
908 Following
🥰Excited to share that I will be joining AI2 Allen Institute for AI MOSAIC this September as a predoctoral young investigator!! So excited to continue working with amazing Yejin Choi Nouha Dziri Liwei Jiang Kavel Rao and can't wait to collaborate with others!
When augmented with retrieval, LMs sometimes overlook retrieved docs and hallucinate 🤖💭
To make LMs trust evidence more and hallucinate less, we introduce Context-Aware Decoding: a decoding algorithm improving LM's focus on input contexts
📖 arxiv.org/pdf/2305.14739…
#NAACL2024
Cool paper from Eric Zelikman et al ---
Quiet-STaR induces chain-of-thought tokens during pretraining, and uses RL to encourage the model to generate ''thoughts'' that improve language modeling performance. A clever step beyond next-word prediction :-)
arxiv.org/abs/2403.09629