Lexi (@orenguteng_ai) Twitter Tweets • TwiCopy

Lexi

2 years ago

LexiFun - New Llama-3-8B Uncensored model with a fun personality. huggingface.co/Orenguteng/Lla… #huggingface #llama3 #ai #ArtificialIntelligence

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Censorhip in AI - LLM dumbs down the model. This is shown by simply "uncensoring" it, not training it with any additional data or knowledge - it beats the original Llama 3.1 8B Instruct model. huggingface.co/Orenguteng/Lla… #llm #huggingface #ArtificialInteligence Hugging Face Meta

thumb_up_off_alt3

chat_bubble_outline2

repeat0

shareShare

Lexi

@orenguteng_ai

2 years ago

I'm glad you liked it bruh Adam Grant Stay tuned for V3 and the 70B model coming very soon!! Appreciate your feedback

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Lexi

@orenguteng_ai

2 years ago

Awesome!! Yes, will be releasing some works soon, hopefully the 70B aswell. :)

thumb_up_off_alt3

chat_bubble_outline2

repeat1

shareShare

Lexi

@orenguteng_ai

2 years ago

You are correct, this is not what o1 is doing at all. This is an interesting approach, but far from any solution. You interupt the natural flow and whatever internal process is ongoing to produce the response. I love how people jump on every fancy word to gain tractions - o1!!!!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Daniel Han

@danielhanchen

2 years ago

Fixed a bug which caused all training losses to diverge for large gradient accumulation sizes. 1. First reported by Benjamin Marie, GA is supposed to be mathematically equivalent to full batch training, but losses did not match. 2. We reproed the issue, and further investigation

Fixed a bug which caused all training losses to diverge for large gradient accumulation sizes.

1. First reported by <a href="/bnjmn_marie/">Benjamin Marie</a>, GA is supposed to be mathematically equivalent to full batch training, but losses did not match.
2. We reproed the issue, and further investigation

thumb_up_off_alt758

chat_bubble_outline21

repeat133

shareShare

Lexi

@orenguteng_ai

2 years ago

This is insane!!!! Explains all the issues with grad accum. Damn, thank you guys for your awesome work!!!!!!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Daniel Han

@danielhanchen

a year ago

Quantizing a model to 4bits will sometimes break models entirely! Unsloth AI now has a dynamic 4bit quant format which chooses some parameters to be in 16bit! We find that: 1. You need to check activation and weight quantization errors. Solely relying on 1 does not work. 2.

Quantizing a model to 4bits will sometimes break models entirely! <a href="/UnslothAI/">Unsloth AI</a> now has a dynamic 4bit quant format which chooses some parameters to be in 16bit!

We find that:
1. You need to check activation and weight quantization errors. Solely relying on 1 does not work.
2.

thumb_up_off_alt377

chat_bubble_outline19

repeat57

shareShare

Unsloth AI

@unslothai

a year ago

Introducing 1.58bit DeepSeek-R1 GGUFs! 🐋 DeepSeek-R1 can now run in 1.58-bit, while being fully functional. We shrank the 671B parameter model from 720GB to just 131GB - a 80% size reduction. Naively quantizing all layers breaks the model entirely, causing endless loops &

thumb_up_off_alt3,3K

chat_bubble_outline137

repeat620

shareShare

Unsloth AI

@unslothai

a year ago

Train your own reasoning LLM using DeepSeek's GRPO algorithm with our free notebook! You'll transform Llama 3.1 (8B) to have chain-of-thought. Unsloth makes GRPO use 80% less VRAM. Guide: docs.unsloth.ai/basics/reasoni… GitHub: github.com/unslothai/unsl… Colab: colab.research.google.com/github/unsloth…

thumb_up_off_alt2,2K

chat_bubble_outline23

repeat376

shareShare

Lexi

@orenguteng_ai

a year ago

Pro - o3 high NERFED today Today, 2 things happened: 1: o3 produces worse responses and the old GPT4 issue that suddenly came to existence back in time where they replaced code response with comments "insert XYZ here" , shortened responses. (read comment below for point 2)

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Unsloth AI

@unslothai

a year ago

Today, we’re launching new algorithms that enable 10x longer context lengths & 90% less VRAM for training Reasoning Models (GRPO). Using Unsloth, you can now train your own reasoning model with just 5GB VRAM for Qwen2.5-1.5B with no accuracy loss. Blog: unsloth.ai/blog/grpo

thumb_up_off_alt1,1K

chat_bubble_outline96

repeat273

shareShare

Daniel Han

@danielhanchen

a year ago

Having endless repetitions with QwQ-32B? I made a guide to help debug stuff! When using repetition penalties to counteract looping, it rather causes looping! Try adding this to llama.cpp: --samplers "top_k;top_p;min_p;temperature;dry;typ_p;xtc" I also uploaded dynamic 4bit

thumb_up_off_alt339

chat_bubble_outline14

repeat73

shareShare

Unsloth AI

@unslothai

10 months ago

We made a Guide on mastering LoRA Hyperparameters, so you can learn to fine-tune LLMs correctly! Learn to: • Train smarter models with fewer hallucinations • Choose optimal: learning rates, epochs, LoRA rank, alpha • Avoid overfitting & underfitting 🔗docs.unsloth.ai/get-started/fi…

thumb_up_off_alt681

chat_bubble_outline12

repeat129

shareShare

Lexi

@orenguteng_ai

10 months ago

Since when did Meta start verifying fake AI profiles? I thought verified meant a real person or brand. Literally passport verification. #ai

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Unsloth AI

@unslothai

8 months ago

OpenAI gpt-oss with ultra long context is here!🚀 Introducing Unsloth Flex Attention which enables 61K context for gpt-oss bf16 training on a 80GB GPU. Unsloth achieves 8×longer context, 50% less VRAM & 1.5×faster training vs. all implementations. 🔗docs.unsloth.ai/basics/long-co…

thumb_up_off_alt864

chat_bubble_outline21

repeat145

shareShare

Unsloth AI

@unslothai

5 months ago

You can now run FP8 reinforcement learning on consumer GPUs! Try DeepSeek-R1’s FP8 GRPO at home using only a 5GB GPU. Qwen3-1.7B fits in 5GB VRAM. We collabed with PyTorch to make FP8 RL inference 1.4× faster. Unsloth: 60% less VRAM, 12× longer context. docs.unsloth.ai/new/fp8-reinfo…

thumb_up_off_alt910

chat_bubble_outline20

repeat131

shareShare

Unsloth AI

@unslothai

5 months ago

You can now do 500K context length fine-tuning with Unsloth! Train gpt-oss-20b to extend its context window to 530K on 80GB VRAM & 750K+ on 192GB - no accuracy loss. Unsloth's new algorithms + Tiled MLP = 72% less VRAM & 6x more context Blog + Notebook: docs.unsloth.ai/new/500k-conte…

thumb_up_off_alt519

chat_bubble_outline15

repeat93

shareShare

Unsloth AI

@unslothai

5 months ago

Mistral releases Ministral 3, their new reasoning and instruct models! 🔥 Ministral 3 comes in 3B, 8B, and 14B with vision support and best-in-class performance. Run the 14B models locally with 24GB RAM. Guide + Notebook: docs.unsloth.ai/new/ministral-3 GGUFs: huggingface.co/collections/un…

thumb_up_off_alt895

chat_bubble_outline18

repeat126

shareShare

Unsloth AI

@unslothai

5 months ago

You can now train LLMs 3× faster with no accuracy loss, via our new RoPE and MLP kernels. Our Triton kernels plus smart auto packing delivers ~3× faster training & 30% less VRAM vs optimized FA3 setups. Train Qwen3-4B 3x faster on just 3.9GB VRAM. Blog: docs.unsloth.ai/new/3x-faster-…

thumb_up_off_alt2,2K

chat_bubble_outline53

repeat204

shareShare