Shelly Golan (@shelly_golan1) 's Twitter Profile
Shelly Golan

@shelly_golan1

ID: 1799855277258272768

calendar_today09-06-2024 17:25:25

5 Tweet

7 Followers

17 Following

Elad Richardson (@eladrichardson) 's Twitter Profile Photo

Really impressive results for human-object interaction. They use a two-phase process where they optimize the diffusion noise, instead of the motion itself, to get to sub-centimeter precision while staying on manifold ๐Ÿง  HOIDiNi - hoidini.github.io

Really impressive results for human-object interaction. They use a two-phase process where they optimize the diffusion noise, instead of the motion itself, to get to sub-centimeter precision while staying on manifold ๐Ÿง 

HOIDiNi - hoidini.github.io
Guy Tevet (@guytvt) 's Twitter Profile Photo

1/ Can we teach a motion model to "dance like a chicken" Or better: Can LoRA help motion diffusion models learn expressive, editable styles without forgetting how to move? Led by Haim Sawdayee, Chuan Guo, we explore this in our latest work. ๐ŸŽฅ haimsaw.github.io/LoRA-MDM/ ๐Ÿงต๐Ÿ‘‡

Roi Bar-On (@roibar_on) 's Twitter Profile Photo

1/9 Excited to share EditP23! ๐ŸŽจ Finally, a single tool for ALL your 3D editing needs: โœ… Pose & Geometry Changes โœ… Object Additions โœ… Global Style Transformations โœ… Local Modifications All driven by one simple 2D image edit. It's mask-free โœจ and works in seconds โšก๏ธ. ๐Ÿงต

1/9
Excited to share EditP23! ๐ŸŽจ
Finally, a single tool for ALL your 3D editing needs:
โœ… Pose & Geometry Changes
โœ… Object Additions
โœ… Global Style Transformations
โœ… Local Modifications
All driven by one simple 2D image edit. It's mask-free โœจ and works in seconds โšก๏ธ.
๐Ÿงต
Omer Dahary (@omerdahary) 's Twitter Profile Photo

Everyone uses CFG with w = 7.5โ€ฆ but why?? ๐Ÿค” For non-trivial prompts, choosing w is hard: too low โ†’ weak alignment, too high โ†’ artifacts. We show w shouldnโ€™t be fixed โ€” it should adapt! โœจ We present a tiny but smart MLP to adjust w over time โ†’ Better images/alignment ๐Ÿš€ 1/n

Everyone uses CFG with w = 7.5โ€ฆ but why?? ๐Ÿค”
For non-trivial prompts, choosing w is hard:
too low โ†’ weak alignment, too high โ†’ artifacts.
We show w shouldnโ€™t be fixed โ€” it should adapt! โœจ
We present a tiny but smart MLP to adjust w over time
โ†’ Better images/alignment ๐Ÿš€

1/n
Guy Ohayon (@guy__ohayon) 's Twitter Profile Photo

The Mahalanobis distance is the natural metric for Gaussian signals. But how can it be generalized to arbitrary probability densities? And how should a solution be tested? We address these questions in a new paper with Pierre-ร‰tienne Fiquet Florentin Guth Jona Ballรฉ, and Eero Simoncelli

The Mahalanobis distance is the natural metric for Gaussian signals. But how can it be generalized to arbitrary probability densities? And how should a solution be tested? We address these questions in a new paper with <a href="/pe_fiquet/">Pierre-ร‰tienne Fiquet</a> <a href="/FlorentinGuth/">Florentin Guth</a> Jona Ballรฉ, and <a href="/EeroSimoncelli/">Eero Simoncelli</a>
Rishubh Parihar (@rishubhparihar) 's Twitter Profile Photo

โ€œMake it red.โ€ โ€œNo! More red!โ€ โ€œUghhโ€ฆ slightly less red.โ€ โ€œPerfect!โ€ โ™ฅ๏ธ ๐ŸŽš๏ธKontinuous Kontext adds slider-based control over edit strength to instruction-based image editing, enabling smooth, continuous transformations!

Zarloya Vinzot (@nirgoren) 's Twitter Profile Photo

The initial noise in diffusion models is surprisingly correlated with the final image. Our NoisePrints paper exploits this to provide a lightweight, distortion-free, cryptographically secure watermark for proving authorship of generated images & videos, requiring no model access.

The initial noise in diffusion models is surprisingly correlated with the final image.
Our NoisePrints paper exploits this to provide a lightweight, distortion-free, cryptographically secure watermark for proving authorship of generated images &amp; videos, requiring no model access.
Nupur Kumari (@nupurkmr9) 's Twitter Profile Photo

๐Ÿš€ New preprint! We present NP-Edit, a framework for training an image editing diffusion model without paired supervision. We use differentiable feedback from Vision-Language Models (VLMs) combined with distribution-matching loss (DMD) to learn editing directly. webpage:

Guy Tevet (@guytvt) 's Twitter Profile Photo

(1/4) [HOIDiNi] hoidini.github.io ๐Ÿงต: Diffusion models are great at generating free-form human motion but tend to break down when objects enter the scene. Humanโ€“object interaction demands millimetric precision, and even tiny errors cause hands to float or penetrate surfaces

Mor Ventura (@mor_ventura95) 's Twitter Profile Photo

โ€œWhat big teeth you have!โ€ said Red.๐Ÿ‘ฉโ€๐Ÿฆฐ โ€œAll because my model suffers from semantic leakage,โ€ said the Wolf.๐Ÿบ When Text-to-Image models blur boundaries, identities collapse. Meet ๐ƒ๐ž๐‹๐ž๐š๐ค๐ž๐ซ, a lightweight inference-time fix that mitigates semantic leakage! ๐Ÿ‘‡

โ€œWhat big teeth you have!โ€ said Red.๐Ÿ‘ฉโ€๐Ÿฆฐ
 โ€œAll because my model suffers from semantic leakage,โ€ said the Wolf.๐Ÿบ

When Text-to-Image models blur boundaries, identities collapse.
Meet ๐ƒ๐ž๐‹๐ž๐š๐ค๐ž๐ซ, a lightweight inference-time fix that mitigates semantic leakage!
๐Ÿ‘‡
Shai Yehezkel (@yehezkelshai) 's Twitter Profile Photo

Visual Diffusion Models are Geometric Solvers We cast geometry as images: a plain diffusion model denoises into valid solutions. It is simple, general and effective. Shown on Inscribed Square, Steiner Tree, and Maximum Area Polygonization - all classic hard problems.

Visual Diffusion Models are Geometric Solvers

We cast geometry as images: a plain diffusion model denoises into valid solutions. It is simple, general and effective.
Shown on Inscribed Square, Steiner Tree, and Maximum Area Polygonization - all classic hard problems.