Arpit Bansal (@arpitbansal297) Twitter Tweets • TwiCopy

Arpit Bansal

@arpitbansal297

+ Follow

Currently interning with Llama @Meta, PhD Candidate @UMDCS. Past @AmazonScience, @IITKgp.
A brick in the creation of Artificial General Intelligence.

ID: 1411463405823725575

linkhttps://arpitbansal297.github.io/ calendar_today03-07-2021 23:14:47

217 Tweet

1,1K Followers

835 Following

Satya Nadella

@satyanadella

2 years ago

Congratulations to Australia on winning the World Cup! Great run to the finals, India.

thumb_up_off_alt39,39K

chat_bubble_outline450

repeat1,1K

shareShare

I'll be at #NeurIPS2023 next week! Very excited to present Tree-ring (poster session 1, #126) and PEZ (poster session 4, #606)! Feel free to DM me if you would like to chat about ML privacy and security or even just want to grab some hotpot🍲 together!

thumb_up_off_alt27

chat_bubble_outline2

repeat3

shareShare

Arpit Bansal

@arpitbansal297

2 years ago

I will be at Neurips'23, presenting our work on Cold Diffusion. Excited for a week packed with learning, awesome discussions, and yes, jazz tunes to end the days. #NeurIPS2023

thumb_up_off_alt39

chat_bubble_outline0

repeat3

shareShare

Furong Huang

@furongh

2 years ago

🔍 Do diffusion models have to reply on reversing ‘Gaussian degradation’? Not necessarily. Check out ‘Cold Diffusion’, which reverses deterministic image degradations. It paves the way for generalized diffusion models that invert arbitrary processes. #ML #AI #DiffusionModels

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Salva Rühling Cachay

@salvarc7

2 years ago

We train the networks in two stages. Our design follows the generalized diffusion model framework from Cold Diffusion Arpit Bansal Tom Goldstein, so we can directly use their sampling algorithm. Essentially, this leads to an alternation of forecasting and interpolation. 3/7

We train the networks in two stages. Our design follows the generalized diffusion model framework from Cold Diffusion <a href="/arpitbansal297/">Arpit Bansal</a> <a href="/tomgoldsteincs/">Tom Goldstein</a>, so we can directly use their sampling algorithm.

Essentially, this leads to an alternation of forecasting and interpolation. 3/7

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Arpit Bansal

@arpitbansal297

2 years ago

Will be presenting at the evening poster session from 5-7 pm in Hall B1 + B2, at booth #1918. Feel free to come by for a discussion! #NeurIPS23

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Arpit Bansal

@arpitbansal297

2 years ago

Our work on "Universal Guidance for Diffusion Models" has been accepted to ICLR2024. TLDR - The tweet features representations of my identity in various styles. #ICLR2024

thumb_up_off_alt63

chat_bubble_outline7

repeat2

shareShare

Peyman Milanfar

@docmilanfar

2 years ago

the math in most diffusion papers

thumb_up_off_alt886

chat_bubble_outline33

repeat77

shareShare

Soham De

@sohamde_

a year ago

Just got back from vacation, and super excited to finally release Griffin - a new hybrid LLM mixing RNN layers with Local Attention - scaled up to 14B params! arxiv.org/abs/2402.19427 My co-authors have already posted about our amazing results, so here's a 🧵on how we got there!

thumb_up_off_alt308

chat_bubble_outline12

repeat63

shareShare

Micah Goldblum

@micahgoldblum

a year ago

We show how to make data poisoning and backdoor attacks way more potent by synthesizing them from scratch with guided diffusion. 🧵 1/8 Paper: arxiv.org/abs/2403.16365

thumb_up_off_alt59

chat_bubble_outline1

repeat10

shareShare

AK

@_akhaliq

a year ago

Transformers Can Do Arithmetic with the Right Embeddings The poor performance of transformers on arithmetic tasks seems to stem in large part from their inability to keep track of the exact position of each digit inside of a large span of digits. We mend this problem by

thumb_up_off_alt543

chat_bubble_outline13

repeat88

shareShare

Arpit Bansal

@arpitbansal297

a year ago

In this paper, we investigate arithmetic to enhance and expand the generalization capabilities of transformers. Our study demonstrates excellent OOD performance, marking a significant effort towards building neural networks that can outperform their training data.

thumb_up_off_alt17

chat_bubble_outline1

repeat0

shareShare

Micah Goldblum

@micahgoldblum

a year ago

We often determine whether a neural network is over or under parameterized by counting parameter. In practice, how much data we can fit depends on many factors: architecture, optimizer, etc. So just how flexible are neural networks in practice? 🧵 Paper: arxiv.org/abs/2406.11463

thumb_up_off_alt260

chat_bubble_outline10

repeat53

shareShare

Sean McLeish

@seanmcleish

a year ago

We're presenting Abacus Embeddings 🧮 today from 10:10-11:10 in Lehar 3 at the ICML Workshop on Large Language Models and Cognition. Drop by to see how we can improve your language model.

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Micah Goldblum

@micahgoldblum

10 months ago

📢I’ll be admitting multiple PhD students this winter to Columbia University 🏙️ in the most exciting city in the world! If you are interested in dissecting modern deep learning systems to probe how they work, advancing AI safety, or automating data science, apply to my group.

thumb_up_off_alt559

chat_bubble_outline6

repeat146

shareShare

Arpit Bansal

@arpitbansal297

10 months ago

I wish everyone a very Happy Diwali 2024! I hope it brings peace to everyone and fills it with lights and colors. A throwback to my 2019 Diwali in India.