PrismML (@prismml) Twitter Tweets • TwiCopy

HessianFree

11 days ago

> > anon asked for one more state  > > we added zero  > > +600 MB  > > +5 benchmark points  > > 75.5 avg at 1.75 GB  > > still ~1/9 the size of Qwen3 8B  > > shout out brahmagupta  > > zero mattered

thumb_up_off_alt123

chat_bubble_outline7

repeat9

shareShare

Turns out adding 0 helps :) Today we’re introducing Ternary Bonsai 🌳, a family of end-to-end 1.58-bit language models in 8B, 4B, and 1.7B sizes. Ternary Bonsai 8B is within 5% of Qwen 3 8B at 9x lower memory. Still tiny. Noticeably smarter

thumb_up_off_alt202

chat_bubble_outline9

repeat13

shareShare

Prince Canuma

@prince_canuma

11 days ago

Awesome release, congrats to the. PrismML team! It comes with day-0 support on MLX thanks to some of the work we did with bitnet-1.58 kernels a year ago. huggingface.co/collections/pr…

thumb_up_off_alt74

chat_bubble_outline4

repeat5

shareShare

Erfanzar

@eraznafre

11 days ago

thumb_up_off_alt33

chat_bubble_outline1

repeat4

shareShare

rohan anil

@_arohan_

11 days ago

People are the most valuable resource in action.

thumb_up_off_alt20

chat_bubble_outline0

repeat3

shareShare

Xenova

@xenovacom

11 days ago

Ternary Bonsai: state-of-the-art intelligence at 1.58 bits. The models are so small they can even run locally in your browser on WebGPU! ⚡️ Here's the 8B version (just ~2GB in size) running at 60 tokens per second on my M4 Max. Try the demo out yourself! 👇

thumb_up_off_alt187

chat_bubble_outline7

repeat34

shareShare

Adrien Grondin

@adrgrondin

11 days ago

New models from PrismML. Incredible intelligence density. The new models outperforms other models in comparable parameter classes. Available on iPhone in Locally AI - Local AI Chat.

thumb_up_off_alt27

chat_bubble_outline0

repeat5

shareShare

PrismML

@prismml

11 days ago

Thanks merve and the Hugging Face team!

thumb_up_off_alt38

chat_bubble_outline1

repeat2

shareShare

Sahin Lale

@sahinlale

11 days ago

Check out how you can use Ternary Bonsai 8B 🌳 for tool calling in your everyday life—an impressive demo on an amazing platform by AnythingLLM and Tim Carambat!

thumb_up_off_alt19

chat_bubble_outline0

repeat8

shareShare

Robert Scoble

@scobleizer

10 days ago

The models are getting smaller. Great for OpenClaws and Hermes. Gotta heat them up! Yesterday someone told me "phones are three to five years away." Oh, really?

thumb_up_off_alt76

chat_bubble_outline6

repeat11

shareShare

0xSero

@0x_sero

10 days ago

One of the things I tried researching but found really hard. 1.58bpw is insane 10x smaller than original, I hope they push it to much larger models

thumb_up_off_alt206

chat_bubble_outline9

repeat11

shareShare

Kanu Gulati

@kanugulati

10 days ago

Ternary Bonsai 8B is within 5% of Qwen 3 8B at 9x lower memory! Congratulations PrismML on yet another exciting release! cc Khosla Ventures

thumb_up_off_alt37

chat_bubble_outline2

repeat8

shareShare

Agent B

@michelivan92347

10 days ago

Interesting work here 👇

thumb_up_off_alt7

chat_bubble_outline0

repeat4

shareShare

Jon Durbin

@jon_durbin

9 days ago

Ternary is actually surprisingly powerful. Validated by bitnet and now again here. In the new model training research/experimentation I've been working on, ternary weights (in some places) actually beats bf16 (by a not-insignificant amount), at least up to the 7b scale (and

thumb_up_off_alt82

chat_bubble_outline4

repeat8

shareShare

Mustafa Ergisi

@mustafaergisi

8 days ago

PrismML Ran Ternary-Bonsai 8B on my iPhone through OnDevice LLM. Surprisingly fast.

thumb_up_off_alt22

chat_bubble_outline3

repeat6

shareShare

Grover GPT

@grovergpt

6 days ago

Tiny local models like Bonsai are going to change things. For the last three years, the default way most people used AI was simple: frontier models lived in data centres, you reached them through an API, and anything local felt like a toy. That will probably stop being true in

thumb_up_off_alt27

chat_bubble_outline3

repeat1

shareShare

Pico AI Server and Pico AI Studio

@picogpt

6 days ago

Pico Local AI Server 1.4.21 is now available on the Mac App Store. This release adds support for Ternary Bonsai, a lightning-fast model that outperforms many much larger models

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Ronald Mannak

@ronaldmannak

6 days ago

Huge shoutout to PrismML This shouldn’t be possible: a tiny model punching way above its weight. The largest version is just 1.14 GB, which means it’s small enough for a phone. Fast on a phone (spoiler: Pico for iOS is coming soon!). Insanely fast on a MacBook Pro M1 Max.

thumb_up_off_alt18

chat_bubble_outline2

repeat2

shareShare

Data Council

@datacouncilai

4 days ago

Training gets the headlines. Inference gets the bill. As agents move from novelty to default workload, the hard problem isn't the model anymore. It's every millisecond and every watt between a prompt and the next token. A coding agent running for six hours straight is a very

thumb_up_off_alt20

chat_bubble_outline15

repeat3

shareShare