Hamel Husain(@HamelHusain) 's Twitter Profileg
Hamel Husain

@HamelHusain

Researcher focusing on LLMs: https://t.co/iVZDFdIQiE

Previously, dev tools and infra for ML. Ex @Github, @Airbnb, @DataRobot. @fastdotai core contributor.

ID:825766640

linkhttp://hamel.dev calendar_today15-09-2012 18:45:02

9,5K Tweets

22,7K Followers

1,7K Following

Hamel Husain(@HamelHusain) 's Twitter Profile Photo

The bugs I encounter most with LLMs in production are related to data drift.

This is acute for LLMs b/c of all the moving parts: prompts, RAG, functions, etc. There is a classic ML technique that works for detecting drift. I explain it in this post: hamel.dev/blog/posts/dri…

account_circle
Chris Levy(@cleavey1985) 's Twitter Profile Photo

Finished a blog post on using Axolotl for the first time. drchrislevy.github.io/posts/intro_fi…
Thanks Jeremy Howard for recommending the tool, Hamel Husain for docs, Wing Lian (caseus) and others for the great library, Maxime Labonne for your blog posts which I used as a guide.

account_circle
Emmanuel Ameisen(@mlpowered) 's Twitter Profile Photo

We've reached the point where models are becoming as persuasive as human writers.

There's a clear model size x persuasion scaling trend, and it doesn't look like it is plateauing.

account_circle
Hamel Husain(@HamelHusain) 's Twitter Profile Photo

In my experience Automated prompt optimizers are in the same class as off the shelf automated evals

It can become a dangerous comfort blanket to make you feel like you are making progress but in the long term it becomes tech debt

Take your prompts and run.

account_circle
Morgante(@morgantepell) 's Twitter Profile Photo

With $10k on the line, you have to ask why no magical prompt optimizer solved this instead of a human manually grinding. Surely it would be great advertising. 🤔

account_circle
Doug Turnbull(@softwaredoug) 's Twitter Profile Photo

Getting excited about our Haystack talk about integrating LTR into Reddit search.

Hope to see you there!

haystackconf.com/us2024/talk-4/

account_circle
Josh W. Comeau(@JoshWComeau) 's Twitter Profile Photo

😮 This is so friggin’ cool. As the circles reach the horizon, they trigger a kalimba sample. Settings allow us to tweak the motion, and generate trippy polyrhythms.

polyrhythmic-rings.vercel.app

(🎵 Sound on for video!)

account_circle
Andrew Reed(@andrewrreed) 's Twitter Profile Photo

It cannot be over-stated that evaluation driven development is key to building robust LLM products, quickly.

Great blog post by Hamel Husain that not only explains 'why', but gives practical tips on 'how'

hamel.dev/blog/posts/eva…

account_circle
Chris Levy(@cleavey1985) 's Twitter Profile Photo

Early days for me in training some models with Axolotl . Thanks Hamel Husain for starting more documentation. I was struggling to figure out what was in last_run_prepared directory for the pre-processed dataset, until I read this. openaccess-ai-collective.github.io/axolotl/docs/i…

Early days for me in training some models with @axolotl_ai . Thanks @HamelHusain for starting more documentation. I was struggling to figure out what was in last_run_prepared directory for the pre-processed dataset, until I read this. openaccess-ai-collective.github.io/axolotl/docs/i…
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

New open model by Cohere is here! C4AI Command R+ 🔥

128K context length
104b parameters
Multilingual
Conversational Tool support and RAG capabilities

Model hf.co/CohereForAI/c4…
Cohere playground: dashboard.cohere.com/playground/chat
Demo: huggingface.co/spaces/CohereF…

account_circle
Zach Mueller(@TheZachMueller) 's Twitter Profile Photo

A resource I keep coming back to often is the Hugging Face Trainer example zoo I made awhile back. Want a quick way to just 'run the trainer' and verify your setup works? Or just see minimal working examples that can be substituted for any model? Check it out…

A resource I keep coming back to often is the @huggingface Trainer example zoo I made awhile back. Want a quick way to just 'run the trainer' and verify your setup works? Or just see minimal working examples that can be substituted for any model? Check it out…
account_circle
Eugene Yan(@eugeneyan) 's Twitter Profile Photo

I've been trying several evals to find those that correlate well with use cases and discriminative enough for prod.

Here's an opinionated take on what works, focusing on classification, summarization, translation, copyright regurgitation, and toxicity.

eugeneyan.com/writing/evals/

account_circle
kache (dingboard.com)(@yacineMTB) 's Twitter Profile Photo

after using what I consider to be AGI to help me code for over a year now; I am not really afraid of too large disruptive effects caused by AGI

account_circle