profile-img
anton

@abacaj

Software engineer. Hacking on large language models

calendar_today31-08-2009 22:06:04

10,8K Tweets

36,1K Followers

518 Following

anton(@abacaj) 's Twitter Profile Photo

Fine tuning works, like it’s not even hard to do and it works. You can use LoRA, QLoRA, full weights, whatever you have resources to do - it will work. You can even fine tune on top of a fine tune (like flan-t5 models) and it will also work

account_circle
anton(@abacaj) 's Twitter Profile Photo

Some guides
- rentry.org/llm-training
- anyscale.com/blog/fine-tuni…
- asmirnov.xyz/doppelganger
- magazine.sebastianraschka.com/p/practical-ti…

account_circle