Patrick Devaney
@patrickbdevaney
ID: 1595527208998690819
23-11-2022 21:18:48
108 Tweet
82 Takipçi
175 Takip Edilen
You can finetune Llama-3.2-Vision-11B for free on Colab now! Unsloth finetunes VLMs 2x faster, with 50% less VRAM, 6x longer context - with no accuracy loss. Documentation: docs.unsloth.ai GitHub: github.com/unslothai/unsl… Finetuning Colab: colab.research.google.com/drive/1j0N4XTY…
Vision finetuning is finally in🦥Unsloth AI! It took a while, but Llama 3.2 Vision, Pixtral, Qwen2 VL & all Llava variants now work! 1. QLoRA / LoRA is 1.3x to 2x faster for each 2. 30-70% less VRAM usage 3. 3 examples - Radiography, LaTeX, Q&A Extra stuff: 1. Pixtral chat
H Company released Holo-1: 3B and 7B GUI Action Vision Language Models for various web and computer agent tasks 🤗 Holo-1 has Apache 2.0 license and Hugging Face transformers support 🔥 more details in their blog post (next ⤵️)