Vinam Arora (@vinam_arora) 's Twitter Profile
Vinam Arora

@vinam_arora

Curr: PhD in ML at Georgia Tech // Deep Learning, Math, Chip Design, and Music

ID: 826713221423718400

linkhttp://vinam.dev calendar_today01-02-2017 08:46:06

129 Tweet

156 Followers

244 Following

Google AI (@googleai) 's Twitter Profile Photo

Introducing SANPO, a multi-attribute video dataset for outdoor human egocentric scene understanding composed of both real-world and synthetic data, including depth maps and video panoptic masks with a wide variety of semantic class labels. Read more → goo.gle/3ZISInU

Ekdeep Singh Lubana (@ekdeepl) 's Twitter Profile Photo

So how exactly does fine-tuning change a model's pretraining capabilities? Find out with 73 pages of detailed investigation on well-defined procedural tasks! From instruction fine-tuning to jailbreaking, there's something for everyone! :D Collab with Samyak & Robert Kirk!

Vinam Arora (@vinam_arora) 's Twitter Profile Photo

Sometimes, during mid-project code optimization, you find yourself in a groove, spotting potential efficiency everywhere. It's easy to forget the main goal - a functioning project, not one that's fully optimized.

Jascha Sohl-Dickstein (@jaschasd) 's Twitter Profile Photo

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

Vinam Arora (@vinam_arora) 's Twitter Profile Photo

Check out our new preprint! GraphFM - A framework for learning from multiple graph datasets simultaneously. Led by the awesome Divyansha

jack morris (@jxmnop) 's Twitter Profile Photo

language modeling is not unsupervised learning. it is not (and please stop saying this) self-supervised learning. next-token prediction is textbook _supervised_ learning, and i will die on this hill