apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile
apolinario ๐ŸŒ

@multimodalart

ML for Art and Creativity, working @HuggingFace ([email protected])

ID: 1415428329210105859

linkhttp://multimodal.art calendar_today14-07-2021 21:50:22

3,3K Tweet

13,13K Takipรงi

519 Takip Edilen

Sayak Paul (@risingsayak) 's Twitter Profile Photo

We present HeadHunter, a framework for principled analysis of perturbed attention guidance ๐Ÿค– Consequently, it enables deeply fine-grained control over the generation quality & visual attributes. Join in ๐Ÿงต for insights and "guidance". 1/12

We present HeadHunter, a framework for principled analysis of perturbed attention guidance ๐Ÿค–

Consequently, it enables deeply fine-grained control over the generation quality & visual attributes.

Join in ๐Ÿงต for insights and "guidance".

1/12
apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

this would make sense in so many dimensions, for both krea and the community imo - the open image community is looking for a "next gen" model, after Imagen 4, GPT-Image-1, Recraft v3, Reve, etc. - opening it up would foster a customization community: contronets, fine-tuning

apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

Who's gonna take the next-generation open image generation crown? ๐Ÿ‘‘ Both Reve and Recraft could be leading image generation now and be the top-of-mind imo. They could've 10-100x the impact of their "red panda" ๐Ÿผ and "halfmoon" ๐ŸŒ™ leading the charts... if they had open

apolinario ๐ŸŒ (@multimodalart) 's Twitter Profile Photo

this is not a drill ๐Ÿšจ, real-time open source video generation is here ๐Ÿ”ฅ Self-Forcing - a real-time video distilled model from Wan 2.1 by Adobe is out, and they open sourced it ๐Ÿ I've built a live real time demo on Hugging Face Spaces ๐Ÿ“น๐Ÿ’จ

LAION (@laion_ai) 's Twitter Profile Photo

LAION proudly presents 2 state-of-the-art emotion detection models for voice and face, surpassing Gemini 2.5 Pro and Hume API. They are completely open under a CC BY 4.0 license, alongside a ~5,000-hour voice-acting dataset & 2 expert-annotated benchmarks. laion.ai/blog/do-they-sโ€ฆ

Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

So excited to welcome Google's model #1000 at Hugging Face: Magenta Real Time!๐Ÿคฏ ๐ŸŽทMusic generation model โšก๏ธReal-time ๐Ÿ‘€Permissive license ๐Ÿค800 million parameters Model: hf.co/google/magentaโ€ฆ Blog: magenta.withgoogle.com/magenta-realtiโ€ฆ