Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile
Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

CEO @SophontAI |
PhD at 19 (2023) |
Founder, ex CEO @MedARC_AI |
ex Research Director Stability AI |
Biomed. engineer @ 14 |
TEDx talk➡bit.ly/3tpAuan

ID: 441465751

linkhttps://tanishq.ai calendar_today20-12-2011 03:45:50

16,16K Tweet

75,75K Followers

1,1K Following

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning "We introduce GFPO (Group Filtered Policy Optimization), which curbs this length explosion by sampling larger groups per problem during training and filtering responses to train on based on two

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

"We introduce GFPO (Group Filtered Policy Optimization), which curbs  this length explosion by sampling larger groups per problem during  training and filtering responses to train on based on two