Alex (@alex_tra_memory) 's Twitter Profile
Alex

@alex_tra_memory

@fluidinference , 1000+ ⭐ @ github.com/FluidInference…

ID: 1947288873018531841

calendar_today21-07-2025 13:33:56

90 Tweet

67 Followers

267 Following

SYMPHONICS (@symphocell) 's Twitter Profile Photo

話者分離のSwiftライブラリ。文字起こしも。話者識別はそのままだとできなそうなので改造は必要そう。 | FluidInference/FluidAudio: Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS. github.com/FluidInference…

Hamza Qayyum (@hamza_q_) 's Twitter Profile Photo

The CAM++ CoreML conversion I did myself, but the segmentation-3.0 model on CoreML is made possible thanks to the amazing FluidAudio project by FluidInference: github.com/FluidInference…

Alex (@alex_tra_memory) 's Twitter Profile Photo

a few things i have learned about custom vocab support for STT: - the idea sounds promising and useful however custom vocab support is quite limited in usage you can't just create a universal custom vocab list. it needs to be tailored to specific fields - this is all relatively

vik (@vikhyatk) 's Twitter Profile Photo

opus 4.5 marked the jump from sde1 to sde2. i’m having a lot of fun and getting a lot of shit done but i’m also realizing we’re going to be the last generation of software engineers

Alex (@alex_tra_memory) 's Twitter Profile Photo

I think Gemini 3 is the best so far, for ML applications it’s able to think the problem through hours and very long context windows without any issues. Even on parts where it failed multiple times. Google might have outplayed themselves with their TPU. Getting the models to be

Alex (@alex_tra_memory) 's Twitter Profile Photo

the most ambiguous part about understandings model is wether you can explain things conceptually or if it was just by happen stance. parakeet eou has 17 encoder layers not out of any conceptual reasons just simply due to test results suggesting 17 is the sweet spot.

Alex (@alex_tra_memory) 's Twitter Profile Photo

currently running on M2 2022 mac, gemini3 is too damn slow and eats. up so much of my ram. it doesn't know when to quit which is a flaw cause it would spend hours on something to get nothing working. i think opus 4.5 might be better it gaslights less now.

Alex (@alex_tra_memory) 's Twitter Profile Photo

use coding agents is alot like being a consultant. you don't do alot of actual work but the project can only go as far as you know.