Arthur Zucker (@art_zucker) Twitter Tweets • TwiCopy

merve

2 months ago

stop writing CUDA kernels yourself we have launched Kernel Hub: easy optimized kernels for all models on Hugging Face Hub 🔥 use them right away! it's where the community populates optimized kernels 🤝 keep reading 😏

stop writing CUDA kernels yourself

we have launched Kernel Hub: easy optimized kernels for all models on <a href="/huggingface/">Hugging Face</a> Hub 🔥 use them right away!
it's where the community populates optimized kernels 🤝
keep reading 😏

thumb_up_off_alt569

chat_bubble_outline13

repeat73

shareShare

Julien Chaumond

@julien_c

2 months ago

Game changing feature shipped today on Hugging Face (IYKYK 🤣)

Game changing feature shipped today on <a href="/huggingface/">Hugging Face</a>

(IYKYK 🤣)

thumb_up_off_alt191

chat_bubble_outline12

repeat16

shareShare

Pablo Montalvo

@m_olbap

2 months ago

Ever wondered how models actually see an image? Been playing with some visualizations of patch extraction, token layouts, how they affect predictions too. Planning a short visual deep dive comparing how different models process images. Would love thoughts before I go on.

thumb_up_off_alt25

chat_bubble_outline1

repeat5

shareShare

Gorkem Yurtseven

@gorkemyurt

2 months ago

not bad at all, over a million requests on Hugging Face x @fal 🤗

not bad at all, over a million requests on <a href="/huggingface/">Hugging Face</a> x @fal 🤗

thumb_up_off_alt201

chat_bubble_outline7

repeat27

shareShare

Marc Sun

@_marcsun

2 months ago

🚀 SGLang now supports Hugging Face Transformers as a backend! Run any transformers-compatible model with fast, production-grade inference — no native support needed. Just plug and play 🥳 Blogpost: huggingface.co/blog/transform…

thumb_up_off_alt83

chat_bubble_outline3

repeat16

shareShare

Arthur Zucker

@art_zucker

2 months ago

Super nice work with the SGLang team! 🫡

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Avi Chawla

@_avichawla

2 months ago

Let's fine-tune DeepSeek-R1 (distilled Llama) 100% locally:

thumb_up_off_alt980

chat_bubble_outline9

repeat98

shareShare

Stas Bekman

@stasbekman

2 months ago

My first project at Snowflake AI Research is complete! I present to you Arctic Long Sequence Training (ALST) Paper: arxiv.org/abs/2506.13996 Blog: snowflake.com/en/engineering… ALST is a set of modular, open-source techniques that enable training on sequences up to 15 million

My first project at <a href="/Snowflake/">Snowflake</a> AI Research is complete!

I present to you Arctic Long Sequence Training (ALST)

Paper: arxiv.org/abs/2506.13996
Blog: snowflake.com/en/engineering…

ALST is a set of modular, open-source techniques that enable training on sequences up to 15 million

thumb_up_off_alt369

chat_bubble_outline16

repeat63

shareShare

Parul Pandey

@pandeyparul

2 months ago

Finally, a TTS model that sounds Indian 🇮🇳. Also commendable that they open-sourced it and made it available on Hugging Face. Congrats Maya Research

Finally, a TTS model that sounds Indian 🇮🇳. Also commendable that they open-sourced it and made it available on <a href="/huggingface/">Hugging Face</a>. Congrats <a href="/mayaresearch_ai/">Maya Research</a>

thumb_up_off_alt266

chat_bubble_outline7

repeat27

shareShare

Arthur Zucker

@art_zucker

2 months ago

Sometimes I wish AI would be much better than now, and I work hard for it. Then I realize I would be jobless, miserable and sad...

thumb_up_off_alt10

chat_bubble_outline1

repeat1

shareShare

Lysandre

@lysandrejik

2 months ago

"The great unbloating" of transformers continues. Over the past few weeks, 10+ PRs were merged, aiming to simplify code across the library. This brought in refactors for Attention, the Cache, a new linter. We're improving type hints everywhere, and are checking type checkers.

thumb_up_off_alt292

chat_bubble_outline16

repeat26

shareShare

Nathan

@nathanhabib1011

2 months ago

Evaluation was just made easier 💯 We merged a huge refacto of lighteval making easier to add: 🔄 Multiturn tasks 🖼️ Multimodal tasks 📝 Plus unified logs for thorough benchmark analysis Benchmarks guys, what evals would you like to see added ?

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare

Omar Sanseviero

@osanseviero

2 months ago

I’m so excited to announce Gemma 3n is here! 🎉 🔊Multimodal (text/audio/image/video) understanding 🤯Runs with as little as 2GB of RAM 🏆First model under 10B with lmarena.ai score of 1300+ Available now on Hugging Face, Kaggle, llama.cpp, ai.dev, and more

thumb_up_off_alt2,2K

chat_bubble_outline80

repeat332

shareShare

PaddlePaddle

@paddlepaddle

2 months ago

🚀Excited to announce that the ERNIE 4.5 series models are officially open-sourced today! 🙌ERNIE 4.5 models achieved state-of-the-art performance across multiple text and multimodal benchmarks, especially in instruction following, world knowledge memorization, visual

thumb_up_off_alt214

chat_bubble_outline1

repeat48

shareShare

Lysandre

@lysandrejik

2 months ago

BOOOM! transformers now has a baked-in http server w/ OpenAI spec compatible API Launch it with `transformers serve` and connect your favorite apps. Here I'm running 👋 Jan with local transformers and hot-swappable models. There is preliminary tool call support as well!

thumb_up_off_alt103

chat_bubble_outline5

repeat28

shareShare

Arthur Zucker

@art_zucker

2 months ago

I am actually quite baffled by this.... You add a model to transformers, even if you do this locally just before a release or anything, and you can already run it.... BTW x3 speedups coming for this 👀

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Arthur Zucker

@art_zucker

2 months ago

Looks very strong! 💪

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

João Gante

@joao_gante

2 months ago

LET'S GO! Cursor using local 🤗 transformers models! You can now test ANY transformers-compatible LLM against your codebase. From hacking to production, it takes only a few minutes: anything `transformers` does, you can serve into your app 🔥 Here's a demo with Qwen3 4B:

thumb_up_off_alt190

chat_bubble_outline10

repeat36

shareShare