virat(@virattt) 's Twitter Profileg
virat

@virattt

Exploring multimodal AI models and sharing what I learn along the way • previously @AirbnbEng

ID:137086701

calendar_today25-04-2010 19:12:53

4,5K Tweets

6,5K Followers

76 Following

Follow People
virat(@virattt) 's Twitter Profile Photo

I am diving into LLM fine-tuning.

There is a lack of deep tech content on fine-tuning:

• how it works
• why it works
• what it does to an LLM, etc.

There is a ton of high-level stuff, however.

I want to grok the first principles of fine-tuning.

If you have an excellent…

account_circle
virat(@virattt) 's Twitter Profile Photo

I studied word embeddings today.

Mainly, how LLMs like GPT-4 convert input text into input embeddings.

It’s simpler than I expected.

There are five key steps:

1. Convert input text to input tokens.

2. Map tokens to token IDs. Common vocab size is ~50K tokens.

3. Create…

I studied word embeddings today. Mainly, how LLMs like GPT-4 convert input text into input embeddings. It’s simpler than I expected. There are five key steps: 1. Convert input text to input tokens. 2. Map tokens to token IDs. Common vocab size is ~50K tokens. 3. Create…
account_circle