Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile
Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

CEO @SophontAI |
PhD at 19 (2023) |
Founder, ex CEO @MedARC_AI |
ex Research Director Stability AI |
Biomed. engineer @ 14 |
TEDx talk➡bit.ly/3tpAuan

ID: 441465751

linkhttps://tanishq.ai calendar_today20-12-2011 03:45:50

16,16K Tweet

75,75K Followers

1,1K Following

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Reinforcing General Reasoning without Verifiers "we propose a verifier-free method (VeriFree) that bypasses answer verification and instead uses RL to directly maximize the probability of generating the reference answer. We compare VeriFree with verifier-based methods and

Reinforcing General Reasoning without Verifiers

"we propose a verifier-free method (VeriFree) that bypasses answer verification and instead uses RL to directly maximize the probability of generating the reference answer. We compare VeriFree with verifier-based methods and