Hamel Husain(@HamelHusain) 's Twitter Profileg
Hamel Husain

@HamelHusain

Researcher focusing on LLMs: https://t.co/iVZDFdIQiE Previously, dev tools and infra for ML. Ex @Github, @Airbnb. @fastdotai core contributor.

ID:825766640

linkhttp://hamel.dev calendar_today15-09-2012 18:45:02

9,7K Tweets

23,4K Followers

1,8K Following

jason liu(@jxnlco) 's Twitter Profile Photo

If someone ever asks me about fine-tuning language models. Hamel is the guy I would invite.

Just from the guest speakers, you'd get a 5 figure workshop amount of value for something that basically fits under your learning & development budget

maven.com/parlance-labs/…

account_circle
Jonathan Whitaker(@johnowhitaker) 's Twitter Profile Photo

Aran stays on top of current research in lots of areas, if you need someone 'in the know' I'm sure he'd be an excellent person to talk to!

account_circle
Hamel Husain(@HamelHusain) 's Twitter Profile Photo

AI Dev tools: if i can’t swipe a credit card, read your docs and get started myself - ngmi

Developers cannot take a 30 min meeting . That costs 10-100x more than a one month subscription to your tool

account_circle
Hamel Husain(@HamelHusain) 's Twitter Profile Photo

Does anyone have LLM evaluation tools/vendors that really like that are useful in domain specific contexts?

I'm looking for vendors that have ALL of the below:

1. Observability
2. Allow you to write your own assertions and LLM judges
3. Bootsrap the creation of #2…

account_circle
Jan-Hendrik Müller(@kolibril13) 's Twitter Profile Photo

This is tldraw in Jupyter with a sketch of a Blender scene. Hitting the 'MakeReal' button will produce Python Code that generates the Blender scene:

account_circle
Zach Mueller(@TheZachMueller) 's Twitter Profile Photo

It's Hugging Face Accelerate release time and there are a TONof exciting features to get through: new optimizers, FP8 fixes, DataLoader improvements, documentation, and so much more!

For a quickread, check out the full notes: github.com/huggingface/ac…

Otherwise let's dig in🧵

It's @huggingface Accelerate release time and there are a TONof exciting features to get through: new optimizers, FP8 fixes, DataLoader improvements, documentation, and so much more! For a quickread, check out the full notes: github.com/huggingface/ac… Otherwise let's dig in🧵
account_circle
Gagan Biyani 🏛(@gaganbiyani) 's Twitter Profile Photo

Maven's top AI course just added a ton of new guest speakers.

Incredible talent convening on Maven to teach LLM Fine-Tuning:

- Wing Lian: Creator of Axolotl library for LLM fine-tuning
- Shreya Shankar: LLMOps and LLM Evaluations researcher
- Zachary Mueller: Lead maintainer…

account_circle
Nate Raw(@_nateraw) 's Twitter Profile Photo

brushed up my personal site and brain dumped a post on 🎶musicgen-songstarter-v0.2🎶

It covers:
- 🧠my thought process/motivation behind it
- ✏️notes on my previous experiments over the last 9 months
- 👀 training deets, Weights & Biases logs w/ hparams

nateraw.com/posts/training…

brushed up my personal site and brain dumped a post on 🎶musicgen-songstarter-v0.2🎶 It covers: - 🧠my thought process/motivation behind it - ✏️notes on my previous experiments over the last 9 months - 👀 training deets, @weights_biases logs w/ hparams nateraw.com/posts/training…
account_circle
Hamel Husain(@HamelHusain) 's Twitter Profile Photo

Classic example of overfitting to the validation set re: LLMs, when I started working with Phillip Carter I found few-shot examples from the validation set in the prompt (we fixed it!).

There are lots of reasons for a separate eval set. Overfitting can come in many forms.

account_circle
Hamel Husain(@HamelHusain) 's Twitter Profile Photo

Has someone created materials around “fundamentals of ML for AI Engineers”, not focused on building models but things like evaluations, error analysis, etc

Maybe something already exists? I don’t want to do it lol - looking for a resource I can share with people

account_circle
Hugo Bowne-Anderson(@hugobowne) 's Twitter Profile Photo

📺Tune in next week as Sebastian Raschka and I riff on 'Developing and Training LLMs From Scratch' in a live podcast recording for Vanishing Gradients Podcast 💫

lu.ma/build-llms-fro…

This will likely be a sprawling convo in which we tell you everything you need to know about LLMs, but were too…

account_circle