Kyle Huang (@kylehuang16) Twitter Tweets • TwiCopy

Rudy Gilman

a year ago

The attention layers in the VAEs for FLUX, Stable Diffusion 3.5, and SDXL don't do anything. You can ablate them with almost no effect. At first I thought they might be involved in some clever circuitry—maybe moving global information—but no they're just flailing around doing

thumb_up_off_alt813

chat_bubble_outline23

repeat61

shareShare

John J. Vastola

@johnjvastola

a year ago

There's a lot more to say, but I'll stop here for now. Check out the paper! openreview.net/forum?id=7lUdo…

thumb_up_off_alt179

chat_bubble_outline2

repeat14

shareShare

🍓🍓🍓

@iruletheworldmo

a year ago

it’s over turns out the rl victory lap was premature. new tsinghua paper quietly shows the fancy reward loops just squeeze the same tired reasoning paths the base model already knew. pass@1 goes up, sure, but the model’s world actually shrinks. feels like teaching a kid to ace

thumb_up_off_alt2,2K

chat_bubble_outline167

repeat265

shareShare

Kevin Grajeda

@k_grajeda

a year ago

RampenSau by David Aerne meodai.github.io/rampensau/

thumb_up_off_alt19

chat_bubble_outline1

repeat2

shareShare

Fotographer AI

@fotographerai

a year ago

We are pretty excited to announce the latest updates on our space on Hugging Face 🔥. Previously, subject consistency would sometimes break when generating the subject in various angles. Now do it with different backgrounds while keeping its consistency throughout angle changes

We are pretty excited to announce the latest updates on our space on <a href="/huggingface/">Hugging Face</a> 🔥. Previously, subject consistency would sometimes break when generating the subject in various angles. Now do it with different backgrounds while keeping its consistency throughout angle changes

thumb_up_off_alt11

chat_bubble_outline1

repeat5

shareShare

Maryam

@thedesignermrym

10 months ago

Gradient + Blur — Woww 💙 #figmaconfig2025

thumb_up_off_alt7,7K

chat_bubble_outline99

repeat419

shareShare

Ege

@egeberkina

10 months ago

GPT-4o + JSON = next-level visuals with precision and style! Prompt 👇

thumb_up_off_alt12,12K

chat_bubble_outline191

repeat774

shareShare

Kevin Grajeda

@k_grajeda

10 months ago

The same landing page, now with animations 👀✨

thumb_up_off_alt3,3K

chat_bubble_outline58

repeat96

shareShare

AK

@_akhaliq

10 months ago

Flow-GRPO Training Flow Matching Models via Online RL

thumb_up_off_alt272

chat_bubble_outline6

repeat37

shareShare

Marc Hemeon

@hemeon

10 months ago

AirBnB icons but make them Dieter Rams. Prompt below, I think you can load this in 4k too..

thumb_up_off_alt4,4K

chat_bubble_outline78

repeat195

shareShare

Charlie Clark

@clarkcharlie03

10 months ago

I've been quietly generating a collection of 3D icons for my WIP project Thiings over the last few weeks. It felt like a good time to open the collection up (*cough* airbnb *cough*) All icons available for download as PNGs with transparent backgrounds. Link below 👇

thumb_up_off_alt1,1K

chat_bubble_outline76

repeat66

shareShare

Kyle Huang

@kylehuang16

10 months ago

The problem with oAI: hiring product people has zero understanding of the problem. They try to maximizes vibe coding, targeting engineers who prefer full control and precision. Unless GenAI is 100% reliable, it isn’t suitable for professional tools. youtube.com/live/hhdpnbfH6…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Nitish Khagwal

@nitishkmrk

10 months ago

run action interaction ⎯⟡°

thumb_up_off_alt767

chat_bubble_outline28

repeat39

shareShare

Kyle Huang

@kylehuang16

10 months ago

so...

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Black Forest Labs

@bfl_ml

9 months ago

NEW: Visit our Self-Serve Portal and get commercial licenses for our open weights models with only a few clicks. One portal. Transparent pricing. Retrieve a commercial license in minutes. Read more about FLUX.1 Kontext [Dev] in our deep dive bfl.ai/announcements/…

thumb_up_off_alt76

chat_bubble_outline2

repeat4

shareShare

Sayak Paul

@risingsayak

9 months ago

Had the honor to present diffusion transformers at CS25, Stanford. The place is truly magical. Slides: bit.ly/dit-cs25 Recording: youtu.be/vXtapCFctTI?si… Thanks to Steven Feng for making it happen!

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat128

shareShare

Bao Pham

@baophamhq

8 months ago

During training, diffusion models are being taught to be effective denoisers, like Associative Memory systems. At what point do these models stop being denoisers and behaving like data generators? To learn about how these models arise from being Associative Memory systems to

thumb_up_off_alt428

chat_bubble_outline6

repeat64

shareShare

Google AI Developers

@googleaidevs

8 months ago

📢 The Gemini Embedding text model (gemini-embedding-001) is now generally available in the Gemini API via Google AI Studio. It supports 100+ languages and uses Matryoshka Representation Learning for flexible output dimensions, allowing devs to scale down from 3072 dimensions.

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat143

shareShare

Theo Panagiotopoulos

@theopanag7

7 months ago

If you only want dominant colors - just do a quick k-means on the 25 pixels, add some contrast, and get a pretty good color palette 🎨 You can run the same algorithm on groups of images and use the output to influence your shaders 🧑‍🎨 (5/6)

thumb_up_off_alt29

chat_bubble_outline1

repeat2

shareShare