Dhruv Batra (@dhruvbatradb) 's Twitter Profile
Dhruv Batra

@dhruvbatradb

Co-founder & Chief Scientist @yutori_ai. Prev: Senior Director leading FAIR Embodied AI @MetaAI and Professor @GeorgiaTech.

ID: 4855602651

linkhttp://dhruvbatra.com calendar_today28-01-2016 01:31:27

2,2K Tweet

18,18K Followers

532 Following

Dhruv Batra (@dhruvbatradb) 's Twitter Profile Photo

I have been working on vision+language models (VLMs) for a decade. And every few years, this community re-discovers the same lesson -- that on difficult tasks, VLMs regress to being nearly blind! Visual content provides minor improvement to a VLM over an LLM, even when these

I have been working on vision+language models (VLMs) for a decade. 

And every few years, this community re-discovers the same lesson -- that on difficult tasks, VLMs regress to being nearly blind! 

Visual content provides minor improvement to a VLM over an LLM, even when these