Deepseek R1 (3bit) running with EXO Labs on 2 x M2 Ultra Mac Studios w/ 192GB RAM each, connected via Thunderbolt. 8.5tok/s average.
Shoutout to Alex Cheema - e/acc for getting on a live call with me to live-code some optimizations and get this working reliably!