Andrej Baranovskij (@andrejusb) Twitter Tweets • TwiCopy

Andrej Baranovskij

@andrejusb

+ Follow

👨‍💻 ML & Oracle Developer @katana_ml 👨‍💼 Founder @katana_ml 📖 GitHub github.com/katanaml

ID: 120748736

linkhttp://katanaml.io/ calendar_today07-03-2010 12:26:59

4,4K Tweet

6,6K Followers

150 Following

Ivan Fioravanti ᯅ

@ivanfioravanti

5 months ago

Boom! Ernie 4.5 MoE is working too on MLX! PR from Gökdeniz Gülmez is still a WIP but it works! Here ERNIE-4.5-21B-A3B-PT fp16 running on M3 Ultra 512GB. Generation ~10 t/s WIP Let's try The Beast ERNIE-4.5-300B-A47B-PT 🔥

thumb_up_off_alt110

chat_bubble_outline6

repeat10

shareShare

Robert Scoble

@scobleizer

5 months ago

Did Grok 4 just turn on? My ForYou feed just got a lot more interesting.

thumb_up_off_alt303

chat_bubble_outline61

repeat15

shareShare

Andrej Baranovskij

@andrejusb

5 months ago

Vision LLM with MLX: Extracting Electric Meter Data in Production💡 I share my experience using the MLX backend to run Vision LLM (with MLX-VLM) for structured data extraction in a production environment. Thanks to MLX team Awni Hannun and Prince Canuma Try Sparrow:

thumb_up_off_alt20

chat_bubble_outline1

repeat0

shareShare

Awni Hannun

@awnihannun

5 months ago

Latest mlx-lm is out! pip install -U mlx-lm Bunch of new models: - SmolLM3 (Hugging Face) - Ernie family (Baidu) - BitNet (Microsoft) - Falcon-E (TII) - Text-only Gemma3n (Google) - MiniCPM4 (OpenBMB) - AFM (Apple) +Performance improvements for DWQ, dynamic quantization, and

thumb_up_off_alt251

chat_bubble_outline17

repeat35

shareShare

Mistral AI

@mistralai

5 months ago

Introducing Devstral Small and Medium 2507! This latest update offers improved performance and cost efficiency, perfectly suited for coding agents and software engineering tasks.

thumb_up_off_alt2,2K

chat_bubble_outline87

repeat326

shareShare

Andrej Baranovskij

@andrejusb

5 months ago

Detecting table columns for better structured data extraction 👀

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Andrej Baranovskij

@andrejusb

5 months ago

Structured Data Query with Sparrow AI Agent Sparrow comes with option to extract stuctured data with query. In this video I explain how you can define such query to fetch array and field data. Video: youtube.com/watch?v=sGKvVX… GitHub: github.com/katanaml/sparr… Try Sparrow:

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Andrej Baranovskij

@andrejusb

5 months ago

Thanks Prince. It was great to meet you in person! See you next time 💯

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Qwen

@alibaba_qwen

5 months ago

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves

thumb_up_off_alt8,8K

chat_bubble_outline264

repeat1,1K

shareShare

Andrej Baranovskij

@andrejusb

4 months ago

Sparrow is getting good traction in the last few days 🚀 📟 Sparrow UI visits 🔝 🧭 Sparrow GitHub traffic 🔝 🤩 Sparrow GitHub stars 🔝 Sparrow UI: sparrow.katanaml.io Sparrow GitHub: github.com/katanaml/sparr…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Awni Hannun

@awnihannun

4 months ago

The latest MLX has a CUDA back-end! To get started: pip install "mlx[cuda]" With the same codebase you can develop locally, run your model on Apple silicon, or in the cloud on Nvidia GPUs. MLX is designed around Apple silicon - which has a unified memory architecture. It uses

thumb_up_off_alt407

chat_bubble_outline25

repeat65

shareShare

Mistral AI

@mistralai

4 months ago

Meet the new Codestral 25.08 and the Complete Mistral Coding Stack for Enterprises! mistral.ai/news/codestral…

thumb_up_off_alt794

chat_bubble_outline26

repeat103

shareShare

Awni Hannun

@awnihannun

4 months ago

pip install -U mlx

thumb_up_off_alt112

chat_bubble_outline7

repeat5

shareShare

Andrej Baranovskij

@andrejusb

4 months ago

Building new pipeline functionality in Sparrow github.com/katanaml/sparr… - AI assistant for code migration from Oracle Forms to APEX. Using Open Source LLMs, such as Mistral Codestral and Qwen Coder. This will run on local machine, completely private, no cloud dependency.

thumb_up_off_alt10

chat_bubble_outline2

repeat3

shareShare

Awni Hannun

@awnihannun

4 months ago

Gemma 3 270m 4-bit generates text at over 650 (!) tok/sec on an M4 Max with mlx-lm and uses < 200MB: Not sped up:

thumb_up_off_alt630

chat_bubble_outline31

repeat46

shareShare

Prince Canuma

@prince_canuma

4 months ago

mlx-vlm v0.3.3 is here gr New models: - Liquid AI LFM2-VL - Z.ai GLM-4.5V - cohere Command-A-Vision Changes: - New kernel for grid_sample - Fix bicubic interpolate kernel compatibility with macOS < 15 - Fix config inheritance Thank you very much to all the amazing

mlx-vlm v0.3.3 is here gr

New models:
- <a href="/LiquidAI_/">Liquid AI</a> LFM2-VL
- <a href="/Zai_org/">Z.ai</a> GLM-4.5V
- <a href="/cohere/">cohere</a> Command-A-Vision

Changes:
- New kernel for grid_sample
- Fix bicubic interpolate kernel compatibility with macOS < 15
- Fix config inheritance

Thank you very much to all the amazing

thumb_up_off_alt104

chat_bubble_outline3

repeat20

shareShare