matt (@mattjcly) Twitter Tweets • TwiCopy

LM Studio

10 months ago

Happy 2025! Introducing LM Studio 0.3.6 🚀 - New vision models: Qwen2VL and QVQ (both GGUF + MLX) 🤩 - Function Calling API (in beta) 🧰 - New installer on Windows: choose drive (finally 😮‍💨) - In-app updates are much smaller & have a progress bar! 🟩🟩⬜️⬜️ - Update your

thumb_up_off_alt433

chat_bubble_outline16

repeat52

shareShare

LM Studio

@lmstudio

10 months ago

DeepSeek R1 Distilled models are now available in LM Studio! 1.5B, 7B, 8B, 13B, 14B, 32B, and 70B variants. Please update to LM Studio 0.3.7 first. Let's go! 🚀

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat211

shareShare

JetBrains

@jetbrains

9 months ago

✨You can now connect AI chat to locally hosted models via LM Studio in JetBrains AI Assistant! Configure the connection in AI Assistant’s settings to gain full control of your AI experience. Check out this and other model updates here: jb.gg/rowjbz

✨You can now connect AI chat to locally hosted models via <a href="/lmstudio/">LM Studio</a> in JetBrains AI Assistant!
Configure the connection in AI Assistant’s settings to gain full control of your AI experience.
Check out this and other model updates here: jb.gg/rowjbz

thumb_up_off_alt142

chat_bubble_outline6

repeat21

shareShare

LM Studio

@lmstudio

9 months ago

openai/o3-mini-GGUF? 🥹 👉👈

thumb_up_off_alt1,1K

chat_bubble_outline21

repeat50

shareShare

LM Studio

@lmstudio

9 months ago

LM Studio 0.3.10 is here with 🔮 Speculative Decoding! This provides inferencing speedups, in some cases 2x or more, with no degradation in quality. - Works for both GGUF/llama.cpp and MLX models! - Easily experiment with different draft models - Visualize accepted draft token

thumb_up_off_alt692

chat_bubble_outline101

repeat79

shareShare

yags

@yagilb

9 months ago

We just released LM Studio 0.3.10 with Speculative Decoding support! It's a technique that pairs together a large model, and a more efficient (smaller) "draft" model that can generate tokens fast (draft generation). The larger model then verifies/rejects the draft tokens. If

thumb_up_off_alt262

chat_bubble_outline13

repeat32

shareShare

LM Studio

@lmstudio

8 months ago

🚢 LM Studio 0.3.14 is out now with new powerful controls for Multi-GPU setups! > Enable/disable specific GPUs > Choose which GPUs to fill first, second, etc. > Limit offload to Dedicated GPU memory > Bug fixes and more! See it in action in 🧵👇

thumb_up_off_alt424

chat_bubble_outline24

repeat42

shareShare

LM Studio

@lmstudio

7 months ago

✨LM Studio 0.3.15 is out now. - Support for NVIDIA RTX 50-series (CUDA 12.8) - GLM-4 enabled in llama.cpp and MLX - New system prompt editor UI 📝 - tool_choice param in OpenAI-like API ... and more 👾 Release notes lmstudio.ai/blog/lmstudio-…

thumb_up_off_alt351

chat_bubble_outline15

repeat65

shareShare

yags

@yagilb

7 months ago

.matt fixed an mlx-engine bug that caused undesired prompt reprocessing. The difference in performance is huge, and the fix is pretty interesting. Check out the PR to see how this issue related to repetition penalty: github.com/lmstudio-ai/ml…

thumb_up_off_alt25

chat_bubble_outline1

repeat1

shareShare

LM Studio

@lmstudio

7 months ago

Qwen3 is available on LM Studio in GGUF and MLX! Sizes: 0.6B, 1.7B, 4B, 8B, 14B, 30B MoE, 32B, and 235B MoE Happy Qwen day to all who celebrate! 🎉

thumb_up_off_alt721

chat_bubble_outline28

repeat89

shareShare

LM Studio

@lmstudio

6 months ago

Engine update: LM Studio llama.cpp/1.29.0 - Qwen2.5VL now supported in GGUF (h/t Xuan-Son Nguyen) - Support for Nomic AI's new MoE embedding model 🚂 Cmd/Ctrl + Shift + R → Update