Andrej Baranovskij (@andrejusb) 's Twitter Profile
Andrej Baranovskij

@andrejusb

👨‍💻 ML & Oracle Developer @katana_ml 👨‍💼 Founder @katana_ml 📖 GitHub github.com/katanaml

ID: 120748736

linkhttp://katanaml.io/ calendar_today07-03-2010 12:26:59

4,4K Tweet

6,6K Followers

150 Following

Ivan Fioravanti ᯅ (@ivanfioravanti) 's Twitter Profile Photo

Boom! Ernie 4.5 MoE is working too on MLX! PR from Gökdeniz Gülmez is still a WIP but it works! Here ERNIE-4.5-21B-A3B-PT fp16 running on M3 Ultra 512GB. Generation ~10 t/s WIP Let's try The Beast ERNIE-4.5-300B-A47B-PT 🔥

Andrej Baranovskij (@andrejusb) 's Twitter Profile Photo

Vision LLM with MLX: Extracting Electric Meter Data in Production💡 I share my experience using the MLX backend to run Vision LLM (with MLX-VLM) for structured data extraction in a production environment. Thanks to MLX team Awni Hannun and Prince Canuma Try Sparrow:

Awni Hannun (@awnihannun) 's Twitter Profile Photo

Latest mlx-lm is out! pip install -U mlx-lm Bunch of new models: - SmolLM3 (Hugging Face) - Ernie family (Baidu) - BitNet (Microsoft) - Falcon-E (TII) - Text-only Gemma3n (Google) - MiniCPM4 (OpenBMB) - AFM (Apple) +Performance improvements for DWQ, dynamic quantization, and

Latest mlx-lm is out!

pip install -U mlx-lm

Bunch of new models:
- SmolLM3 (Hugging Face)
- Ernie family (Baidu)
- BitNet (Microsoft)
- Falcon-E (TII)
- Text-only Gemma3n (Google)
- MiniCPM4 (OpenBMB)
- AFM (Apple)

+Performance improvements for DWQ, dynamic quantization, and
Mistral AI (@mistralai) 's Twitter Profile Photo

Introducing Devstral Small and Medium 2507! This latest update offers improved performance and cost efficiency, perfectly suited for coding agents and software engineering tasks.

Introducing Devstral Small and Medium 2507! This latest update offers improved performance and cost efficiency, perfectly suited for coding agents and software engineering tasks.
Andrej Baranovskij (@andrejusb) 's Twitter Profile Photo

Structured Data Query with Sparrow AI Agent Sparrow comes with option to extract stuctured data with query. In this video I explain how you can define such query to fetch array and field data. Video: youtube.com/watch?v=sGKvVX… GitHub: github.com/katanaml/sparr… Try Sparrow:

Structured Data Query with Sparrow AI Agent

Sparrow comes with option to extract stuctured data with query. In this video I explain how you can define such query to fetch array and field data.

Video: youtube.com/watch?v=sGKvVX…

GitHub: github.com/katanaml/sparr… 

Try Sparrow:
Qwen (@alibaba_qwen) 's Twitter Profile Photo

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves

>>> Qwen3-Coder is here! ✅

We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves
Andrej Baranovskij (@andrejusb) 's Twitter Profile Photo

Sparrow is getting good traction in the last few days 🚀 📟 Sparrow UI visits 🔝 🧭 Sparrow GitHub traffic 🔝 🤩 Sparrow GitHub stars 🔝 Sparrow UI: sparrow.katanaml.io Sparrow GitHub: github.com/katanaml/sparr…

Sparrow is getting good traction in the last few days 🚀

📟 Sparrow UI visits 🔝

🧭 Sparrow GitHub traffic 🔝

🤩 Sparrow GitHub stars 🔝

Sparrow UI: sparrow.katanaml.io

Sparrow GitHub: github.com/katanaml/sparr…
Awni Hannun (@awnihannun) 's Twitter Profile Photo

The latest MLX has a CUDA back-end! To get started: pip install "mlx[cuda]" With the same codebase you can develop locally, run your model on Apple silicon, or in the cloud on Nvidia GPUs. MLX is designed around Apple silicon - which has a unified memory architecture. It uses

Andrej Baranovskij (@andrejusb) 's Twitter Profile Photo

Building new pipeline functionality in Sparrow github.com/katanaml/sparr… - AI assistant for code migration from Oracle Forms to APEX. Using Open Source LLMs, such as Mistral Codestral and Qwen Coder. This will run on local machine, completely private, no cloud dependency.

Building new pipeline functionality in Sparrow github.com/katanaml/sparr… - AI assistant for code migration from Oracle Forms to APEX. Using Open Source LLMs, such as Mistral Codestral and Qwen Coder. This will run on local machine, completely private, no cloud dependency.
Prince Canuma (@prince_canuma) 's Twitter Profile Photo

mlx-vlm v0.3.3 is here gr New models: - Liquid AI LFM2-VL - Z.ai GLM-4.5V - cohere Command-A-Vision Changes: - New kernel for grid_sample - Fix bicubic interpolate kernel compatibility with macOS < 15 - Fix config inheritance Thank you very much to all the amazing

mlx-vlm v0.3.3 is here gr

New models:
- <a href="/LiquidAI_/">Liquid AI</a> LFM2-VL
- <a href="/Zai_org/">Z.ai</a> GLM-4.5V
- <a href="/cohere/">cohere</a> Command-A-Vision

Changes:
- New kernel for grid_sample
- Fix bicubic interpolate kernel compatibility with macOS &lt; 15
- Fix config inheritance

Thank you very much to all the amazing