John T Davies 🇺🇦🇪🇺🌍 (@jtdavies) Twitter Tweets • TwiCopy

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

+ Follow

Entrepreneur, CTO in Gen-AI, investor, father to 3 grown boys, husband to Rachel, astrophysicist, keen photographer, cyclist, über-geek, travelled a lot.

ID: 15784290

linkhttp://www.johntdavies.com calendar_today08-08-2008 23:20:02

3,3K Tweet

1,1K Takipçi

531 Takip Edilen

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

This is very impressive, on the leaderboard with a 15B, 128k context reasoning model. I shall be testing this today, nice work guys!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

This will be right up Prince Canuma’s street for MLX!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

Tuesday at Devoxx Belgium, James Ward, Rod Johnson and I had a workshop on Agentic AI using Embabel. I have my last talk later this afternoon on local private LLMs.

Tuesday at <a href="/Devoxx/">Devoxx</a> Belgium, <a href="/JamesWard/">James Ward</a>, <a href="/springrod/">Rod Johnson</a> and I had a workshop on Agentic AI using Embabel. I have my last talk later this afternoon on local private LLMs.

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

Full house today at Rod Johnson’s Embabel talk at Devoxx Belgium.

Full house today at <a href="/springrod/">Rod Johnson</a>’s Embabel talk at <a href="/Devoxx/">Devoxx</a> Belgium.

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Rod Johnson

@springrod

a month ago

John Davies giving an important talk on local LLMs. Currently describing how Incept5 has built a banking workflow on Embabel using only local models. Java John T Davies 🇪🇺 Incept5

thumb_up_off_alt56

chat_bubble_outline6

repeat9

shareShare

Travelling back from a great week at Devoxx. I love travelling by train, this is the relatively new Frecciarossa service from Paris to Marseille. Working with a full meal service and an office at 320km/h (over 200mph), cheaper than a flight and very civilised!

Travelling back from a great week at <a href="/Devoxx/">Devoxx</a>. I love travelling by train, this is the relatively new Frecciarossa service from Paris to Marseille. Working with a full meal service and an office at 320km/h (over 200mph), cheaper than a flight and very civilised!

thumb_up_off_alt12

chat_bubble_outline2

repeat2

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

I fed this image (below) from Prince Canuma's ongoing work on MLX-VLM to support Qwen3-VL and batch. Qwen's Qwen3-VL-30B-a3B-Instruct-4bit (MLX) with the following prompt. For me on an M4 it was... Prompt: 895 tokens, 711.575 tokens-per-sec Generation: 134 tokens,

I fed this image (below) from <a href="/Prince_Canuma/">Prince Canuma</a>'s ongoing work on MLX-VLM to support Qwen3-VL and batch.
<a href="/Alibaba_Qwen/">Qwen</a>'s Qwen3-VL-30B-a3B-Instruct-4bit (MLX) with the following prompt. For me on an M4 it was...

Prompt: 895 tokens, 711.575 tokens-per-sec
Generation: 134 tokens,

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

This will be incredibly useful in parallel agentic processes. You rock Prince!

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

They should make a Breaking Bad episode on this guy, he sure can cook!!!

thumb_up_off_alt1

chat_bubble_outline2

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

So it’s just a souped-up Raspberry Pi. Get a 48 or 64GB Mac Mini for half the price and it will run the same models way faster, mine runs the same qwen3-32b-8bit at over 30tps and that’s not even with MLX.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

Apple M5, the pin to burst the bubble? linkedin.com/posts/jdavies_…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

a month ago

Prince, king of MLX joins the big-boy club!

thumb_up_off_alt18

chat_bubble_outline2

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

21 days ago

OK, I'm now browsing on ChatGPT-Atlas, no way is it getting access to my machine but so far, it's pretty cool.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

21 days ago

Just downloaded and the MLX version (lmstudio-community/Qwen3-VL-2B-Instruct-MLX-bf16), great image decoding and over 100 tokens/second. Next the 32B versions.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

16 days ago

Back in Germany, all 3 trains were late, even the one we took because we missed the other. However, they do serve beer on the train!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

16 days ago

310 Tokens / second with an unquantised (bf16) model using MLX, this is CRAZY fast! I've just tried the new MLX-VML beta from Prince Canuma (using an M4 Max). DeepSeek-OCR (3B) on MLX is a game-changer.

310 Tokens / second with an unquantised (bf16) model using MLX, this is CRAZY fast!
I've just tried the new MLX-VML beta from <a href="/Prince_Canuma/">Prince Canuma</a> (using an M4 Max). DeepSeek-OCR (3B) on MLX is a game-changer.

thumb_up_off_alt70

chat_bubble_outline1

repeat5

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

12 days ago

Excellent news for Ollama users, one of the most powerful models out there is (finally) supported by the Ollama team. Nice work guys!

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

12 days ago

This looks very interesting, a perfect size too... moonshotai/Kimi-Linear-48B-A3B-Instruct On MMLU-Pro (4k context length), Kimi Linear achieves 51.0 performance with similar speed as full attention. Offering significant speedups at long sequence lengths (1M tokens).

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

12 days ago

In stunning Krakow this evening. Hoping to catch up with Prince Canuma tomorrow.

In stunning Krakow this evening. Hoping to catch up with <a href="/Prince_Canuma/">Prince Canuma</a> tomorrow.

thumb_up_off_alt18

chat_bubble_outline2

repeat0

shareShare

John T Davies 🇺🇦🇪🇺🌍

@jtdavies

11 days ago

Wow!!! We spoke over lunch for 6 hours, so many ideas, Prince Canuma is going to take over the AI world. Remember this day folks, it started in Krakow! Top secret for now, co-investors, apply now!

Wow!!! We spoke over lunch for 6 hours, so many ideas, <a href="/Prince_Canuma/">Prince Canuma</a> is going to take over the AI world. Remember this day folks, it started in Krakow!
Top secret for now, co-investors, apply now!

thumb_up_off_alt28

chat_bubble_outline1

repeat1

shareShare