Alex Nichol (@unixpickle) 's Twitter Profile
Alex Nichol

@unixpickle

Code, AI, and 3D printing. Opinions are mostly my own, sometimes my computer's. Husband of @thesamnichol. Co-creator of DALL-E 2. Researcher @openai.

ID: 376494033

linkhttps://aqnichol.com calendar_today20-09-2011 00:13:54

6,6K Tweet

9,9K Followers

403 Following

Alex Nichol (@unixpickle) 's Twitter Profile Photo

Figuring out if you can eat a poisonous berry 10,000 years ago: "hey guys, have you tried these berries?" Random guy: "don't eat that, it made me sick." Today: "Hey gork can I eat this?" Grok: "Yeah that looks yummy, you're good to go. Berries are safe in general."

Alex Nichol (@unixpickle) 's Twitter Profile Photo

Sure the TSP is NP, but if you have any practical problem of a reasonable size (e.g. <10000) you can _probably_ find the optimal solution in like 5 minutes.

Alex Nichol (@unixpickle) 's Twitter Profile Photo

Has anybody tried training a neural network where each layer's weight matrix is used in a constraint Ax < c? The input would be some vector x and output is min x*y subject to Ax<c.

Alex Nichol (@unixpickle) 's Twitter Profile Photo

A Markov model and a transformer can both achieve the same loss on the training set. But only the transformer is smart enough to be useful for other tasks. This invalidates the claim that "all transformers are doing is memorizing their training data".