slm tokens (@tulkenss) Twitter Tweets • TwiCopy

slm tokens

@tulkenss

+ Follow

🦑 Machine learning biped

ID: 3020046891

linkhttps://stephantul.github.io calendar_today05-02-2015 16:43:13

1,1K Tweet

245 Followers

554 Following

slm tokens

@tulkenss

a month ago

This is interesting to me because static models are the fastest and worst possible you can probably make. There is a market for very fast+bad: there are some tasks you can only do if you are very very fast. This is what the value prop for static models is.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

slm tokens

@tulkenss

a month ago

So cracked that I started a training run in a regular shell, so it ended once I closed the window 💪

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

slm tokens

@tulkenss

a month ago

Obvious observation perhaps, but if your agent has trouble writing tests for your code, you've not yet hit the right abstraction level. Instead of letting the agent persevere, or asking it to refactor, just take a look. (this is a look at your data post)

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

slm tokens

@tulkenss

25 days ago

I'm curious about batching in services. I've always done batch size 1 inference for requests (i.e., using FastAPI + a model). Inefficient, but easy. So it seems to me that actual latency for transformers on GPUs is much higher per instance than is typically claimed. Anyone?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare