Swarnadeep Saha
@swarnanlp
Research Scientist @AIatMeta (FAIR) working on Reasoning. Past: @Google PhD fellow @uncnlp. Gooner.
ID: 2485053080
https://swarnahub.github.io/ 09-05-2014 08:54:56
603 Tweet
1,1K Takipçi
816 Takip Edilen
Got a new efficient/optimally-thinking LLM? Does you model answer simple queries quickly and spends compute on the harder ones? Test it on our new benchmark, OptimalThinkingBench! 👇 Work led by the amazing Pranjal Aggarwal ✈️ COLM 🍁 during this internship!
Great AI at Meta paper. Builds a single test that shows when LLMs think too much or too little, then scores both. It targets a gap, reasoning models ramble on easy questions while fast models miss steps on hard ones. They release a benchmark called OptimalThinkingBench with