Jim Fan (@drjimfan) 's Twitter Profile
Jim Fan

@drjimfan

NVIDIA Sr. Research Manager. Co-Lead of GR00T (Humanoid Robotics) & GEAR Lab. Solving Physical AI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.

ID: 1007413134

linkhttps://jimfan.me calendar_today12-12-2012 22:11:27

3,3K Tweet

302,302K Followers

3,3K Following

Jim Fan (@drjimfan) 's Twitter Profile Photo

*If* GPT-4 is multimodal, we can predict with reasonable confidence what GPT-4 *might* be capable of, given Microsoft’s prior work Kosmos-1: - Visual IQ test: yes, the ones that humans take! - OCR-free reading comprehension: input a screenshot, scanned document, street sign, or

*If* GPT-4 is multimodal, we can predict with reasonable confidence what GPT-4 *might* be capable of, given Microsoft’s prior work Kosmos-1:

- Visual IQ test: yes, the ones that humans take!
- OCR-free reading comprehension: input a screenshot, scanned document, street sign, or