diffusers (@diffuserslib) 's Twitter Profile
diffusers

@diffuserslib

Yes you're speaking with the ๐Ÿค— @huggingface ๐Ÿงจ diffusers library personally

ID: 1588477838390329344

linkhttps://github.com/huggingface/diffusers calendar_today04-11-2022 10:27:09

128 Tweet

2,2K Followers

22 Following

camenduru (@camenduru) 's Twitter Profile Photo

MagicAnimate: ๐Ÿ’ƒ Temporally Consistent Human Image Animation using Diffusion Model ๐Ÿ•บโ€ Colab ๐Ÿฅณ Thanks to Zhongcong Xu โค Jianfeng Zhang โค Jun Hao Liew โค Hanshu Yan โค Jia-Wei Liu โค Chenxu Zhang โค Jiashi Feng โค Mike Shou โค ๐ŸŒpage: showlab.github.io/magicanimate/ ๐Ÿ“„paper:

Sanchit Gandhi (@sanchitgandhi99) 's Twitter Profile Photo

Introducing the smallest Distil-Whisper model yet! distil-small.en is over 10x smaller, 5x faster and within 3% WER of large-v2 ๐ŸŽฏ At just 166M parameters, it's is perfect for low-memory environments, such as on-device or mobile ๐Ÿ“ž Get started here: huggingface.co/distil-whisperโ€ฆ

Sayak Paul (@risingsayak) 's Twitter Profile Photo

๐Ÿงจ diffusers reached 20k stars on GitHub ๐Ÿ’ซ But like many others, I am not a firm believer in this metric. So, let's also consider the number of repos that rely on it and the SUM of their stars. This gives a better view point about the library ๐Ÿค— Thanks to all our contributors

๐Ÿงจ diffusers reached 20k stars on GitHub ๐Ÿ’ซ

But like many others, I am not a firm believer in this metric. So, let's also consider the number of repos that rely on it and the SUM of their stars. This gives a better view point about the library ๐Ÿค—

Thanks to all our contributors
diffusers (@diffuserslib) 's Twitter Profile Photo

Google's MUSE: muse-model.github.io reproduced โœ… Why token-base image generationโ“ - very under explored research - needed for true multi-modal models - better at style transfer ๐Ÿ‘‰ huggingface.co/spaces/amused/โ€ฆ

Google's MUSE: muse-model.github.io reproduced โœ…

Why token-base image generationโ“
- very under explored research
- needed for true multi-modal models
- better at style transfer

๐Ÿ‘‰ huggingface.co/spaces/amused/โ€ฆ
apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

The first open Stable Diffusion 3-like architecture model is JUST out ๐Ÿ’ฃ - but it is not SD3! ๐Ÿค” It is HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model ๐Ÿ–ผ๏ธโœจ In the paper they claim to be SOTA open source! I'm working on a Hugging Face demo

The first open Stable Diffusion 3-like architecture model is JUST out ๐Ÿ’ฃ - but it is not SD3! ๐Ÿค”

It is HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model ๐Ÿ–ผ๏ธโœจ

In the paper they claim to be SOTA open source! I'm working on a <a href="/huggingface/">Hugging Face</a> demo
apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

Demo for the first open SD3-like architecture model, HunyuanDiT Hugging Face Spaces demo is out! ๐ŸŽจ First impressions: - Image quality seems very good! - Chunky and the research code isn't super optimized for inference speed (๐Ÿ‘‹ diffusers ๐Ÿ‘€) โ–ถ๏ธ huggingface.co/spaces/multimoโ€ฆ

diffusers (@diffuserslib) 's Twitter Profile Photo

we just dropped an insane new release ๐Ÿฃ - support to new pipelines: audio ๐Ÿ”Š, video ๐ŸŽฌ and image ๐Ÿ–ผ๏ธ models (FLUX, Stable Audio, CogVideoX, Kolors, AuraFlow and moar!) - native PAG support for image quality boost ๐Ÿ’จ - AnimatedDiff ๐Ÿค SparseCtrl github.com/huggingface/diโ€ฆ

Ahn Donghoon (@donghoon_ahn) 's Twitter Profile Photo

Now, PAG is officially supported by Diffusers in the stable version! Try it out๐Ÿฅฐ Use cases: huggingface.co/docs/diffusersโ€ฆ Supported pipelines: huggingface.co/docs/diffusersโ€ฆ We would like to extend our gratitude to the amazing team at Hugging Face for their incredible work. Special thanks

apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

CogVideoX just released the weights for its 5B model! ๐ŸŽฅ โœจ It's the best open weights text-to-video model - competitive with Runway /ย Luma /ย Pika. With ๐Ÿงจdiffusers, it fits on < 10GB VRAM ๐Ÿค (ah, and they changed the smaller 2B model license to Apache 2.0 ๐Ÿ”ฅ)

Sayak Paul (@risingsayak) 's Twitter Profile Photo

We now support loading and inferencing with two non-diffusers Flux LoRAs 1> X-Labs 2> Kohya (Kohya Tech) Thanks to apolinario ๐ŸŒ for jamming on this with me! github.com/huggingface/diโ€ฆ

diffusers (@diffuserslib) 's Twitter Profile Photo

we now support any FLUX LoRA you send our way: trained with Kohya, X-Labs, Simple-Tuner, AI-Toolkit, Replicate, FAL, Hugging Face, CivitAI, ComfyUI, diffusers (duh!)? No problem, we support it!

apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

Video-to-video is now available in the official CogVideoX-5B Space ๐Ÿ”ฅ Try it out ๐ŸŽฅ โžก๏ธ๐ŸŽฅ huggingface.co/spaces/THUDM/Cโ€ฆ

diffusers (@diffuserslib) 's Twitter Profile Photo

the goated Aryan V S added support for video-to-video in the diffusers pipeline ๐Ÿ you can try it locally or now on the official space

apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

The Logo in Context Spaces demo + ๐Ÿงจ diffusers implementation is here! ๐Ÿ–ผ๏ธ๐Ÿท๏ธ In-Context LoRA + Image-to-Image + Inpainting โ†’ allow you to apply your logos to anything huggingface.co/spaces/multimoโ€ฆ

The Logo in Context Spaces demo + ๐Ÿงจ diffusers implementation is here! ๐Ÿ–ผ๏ธ๐Ÿท๏ธ

In-Context LoRA + Image-to-Image + Inpainting โ†’ allow you to apply your logos to anything

huggingface.co/spaces/multimoโ€ฆ