
Bhuwan Dhingra
@bhuwandhingra
Natural Language Processing / Machine Learning research. Assistant Professor @dukecompsci, @duke_nlp; Research Scientist @Apple
ID: 2490439280
https://users.cs.duke.edu/~bdhingra/ 11-05-2014 21:45:36
103 Tweet
1,1K Followers
310 Following

Tagging Maximilian Holsman who led this great work!




Backtracking allows reasoning models to go back and correct mistakes in their solution attempts. What sorts of tasks benefit from this behavior? And can we boost it using SFT? Hongyi James Cai 's new preprint answers these questions and more -- check it out!


