Antonio Valerio Miceli Barone (@avmicelibarone) 's Twitter Profile
Antonio Valerio Miceli Barone

@avmicelibarone

ML / NLP
School of Informatics, The University of Edinburgh

ID: 325194378

calendar_today27-06-2011 22:05:23

6,6K Tweet

979 Followers

1,1K Following

Xuandong Zhao (@xuandongzhao) 's Twitter Profile Photo

🚀 Excited to share the most inspiring work I’ve been part of this year: "Learning to Reason without External Rewards" TL;DR: We show that LLMs can learn complex reasoning without access to ground-truth answers, simply by optimizing their own internal sense of confidence. 1/n

🚀 Excited to share the most inspiring work I’ve been part of this year:
 
"Learning to Reason without External Rewards"

TL;DR: We show that LLMs can learn complex reasoning without access to ground-truth answers, simply by optimizing their own internal sense of confidence. 1/n
Charles Goddard (@chargoddard) 's Twitter Profile Photo

🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning! This is paradigm-shifting. A MUST-READ. Full breakdown below 👇 🧵 1/23

🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning!

This is paradigm-shifting. A MUST-READ. Full breakdown below 👇
🧵 1/23
alex lawsen (@lxrjl) 's Twitter Profile Photo

Claude and I wrote a response to a recent critique of 'LRMs', which we didn't find very compelling. (link in next tweet to the full paper)

Claude and I wrote a response to a recent critique of 'LRMs', which we didn't find very compelling.

(link in next tweet to the full paper)
Antonio Valerio Miceli Barone (@avmicelibarone) 's Twitter Profile Photo

EloEverything is a self-selected sample that doesn't represent unbiased "human values", to claim it does is bad methodology. The observed discrepacy is likely due to the culture of the contractors who are hired to do RLHF annotation, who are typically from non-Western countries.

Michael Saxon (@m2saxon) 's Twitter Profile Photo

The viral new "Definition of AGI" paper has fake citations which do not exist. And it specifically TELLS you to read them! Proof: different articles present at the specified journal/volume/page number, and their titles exist nowhere on any searchable repository.

The viral new "Definition of AGI" paper has fake citations which do not exist.

And it specifically TELLS you to read them!

Proof: different articles present at the specified journal/volume/page number, and their titles exist nowhere on any searchable repository.