Kyle Huang (@kylehuang16) 's Twitter Profile
Kyle Huang

@kylehuang16

AI/UX | HCI researcher | Creative technologist | Nvidia | MSR
Interest: Generative UI + Dynamic Experience

ID: 4761559279

linkhttp://kylehuang.design calendar_today15-01-2016 05:42:49

2,2K Tweet

1,1K Followers

1,1K Following

Rudy Gilman (@rgilman33) 's Twitter Profile Photo

The attention layers in the VAEs for FLUX, Stable Diffusion 3.5, and SDXL don't do anything. You can ablate them with almost no effect. At first I thought they might be involved in some clever circuitry—maybe moving global information—but no they're just flailing around doing

🍓🍓🍓 (@iruletheworldmo) 's Twitter Profile Photo

it’s over turns out the rl victory lap was premature. new tsinghua paper quietly shows the fancy reward loops just squeeze the same tired reasoning paths the base model already knew. pass@1 goes up, sure, but the model’s world actually shrinks. feels like teaching a kid to ace

it’s over 

turns out the rl victory lap was premature. new tsinghua paper quietly shows the fancy reward loops just squeeze the same tired reasoning paths the base model already knew. pass@1 goes up, sure, but the model’s world actually shrinks. feels like teaching a kid to ace
Fotographer AI (@fotographerai) 's Twitter Profile Photo

We are pretty excited to announce the latest updates on our space on Hugging Face 🔥. Previously, subject consistency would sometimes break when generating the subject in various angles. Now do it with different backgrounds while keeping its consistency throughout angle changes

We are pretty excited to announce the latest updates on our space on <a href="/huggingface/">Hugging Face</a> 🔥. Previously, subject consistency would sometimes break when generating the subject in various angles. Now do it with different backgrounds while keeping its consistency throughout angle changes
Charlie Clark (@clarkcharlie03) 's Twitter Profile Photo

I've been quietly generating a collection of 3D icons for my WIP project Thiings over the last few weeks. It felt like a good time to open the collection up (*cough* airbnb *cough*) All icons available for download as PNGs with transparent backgrounds. Link below 👇

Kyle Huang (@kylehuang16) 's Twitter Profile Photo

The problem with oAI: hiring product people has zero understanding of the problem. They try to maximizes vibe coding, targeting engineers who prefer full control and precision. Unless GenAI is 100% reliable, it isn’t suitable for professional tools. youtube.com/live/hhdpnbfH6…

Black Forest Labs (@bfl_ml) 's Twitter Profile Photo

NEW: Visit our Self-Serve Portal and get commercial licenses for our open weights models with only a few clicks. One portal. Transparent pricing. Retrieve a commercial license in minutes. Read more about FLUX.1 Kontext [Dev] in our deep dive bfl.ai/announcements/…

Sayak Paul (@risingsayak) 's Twitter Profile Photo

Had the honor to present diffusion transformers at CS25, Stanford. The place is truly magical. Slides: bit.ly/dit-cs25 Recording: youtu.be/vXtapCFctTI?si… Thanks to Steven Feng for making it happen!

Bao Pham (@baophamhq) 's Twitter Profile Photo

During training, diffusion models are being taught to be effective denoisers, like Associative Memory systems. At what point do these models stop being denoisers and behaving like data generators? To learn about how these models arise from being Associative Memory systems to

During training, diffusion models are being taught to be effective denoisers, like Associative Memory systems. At what point do these models stop being denoisers and behaving like data generators? 

To learn about how these models arise from being Associative Memory systems to
Google AI Developers (@googleaidevs) 's Twitter Profile Photo

📢 The Gemini Embedding text model (gemini-embedding-001) is now generally available in the Gemini API via Google AI Studio. It supports 100+ languages and uses Matryoshka Representation Learning for flexible output dimensions, allowing devs to scale down from 3072 dimensions.

📢 The Gemini Embedding text model (gemini-embedding-001) is now generally available in the Gemini API via Google AI Studio. It supports 100+ languages and uses Matryoshka Representation Learning for flexible output dimensions, allowing devs to scale down from 3072 dimensions.
Theo Panagiotopoulos (@theopanag7) 's Twitter Profile Photo

If you only want dominant colors - just do a quick k-means on the 25 pixels, add some contrast, and get a pretty good color palette 🎨 You can run the same algorithm on groups of images and use the output to influence your shaders 🧑‍🎨 (5/6)