Omer Dahary (@omerdahary) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Yuval Alaluf

@yuvalalaluf

9 months ago

Super excited to share what we've been working on at Pika! 🚀🚀🚀

thumb_up_off_alt151

chat_bubble_outline15

repeat13

shareShare

TL;DR - we improve text-to-image output quality by tuning an LLM to predict ComfyUI workflows tailored to each generation prompt Project page: comfygen-paper.github.io Paper: arxiv.org/abs/2410.01731 [1\4]

$TL;DR - we improve text-to-image output quality by tuning an LLM to predict ComfyUI workflows tailored to each generation prompt Project page: comfygen-paper.github.io Paper: arxiv.org/abs/2410.01731 [1\4]$

thumb_up_off_alt399

chat_bubble_outline13

repeat76

shareShare

Guy Tevet

@guytvt

8 months ago

[NEW Preprint] 🔔🔔 CLoSD embeds real-time Motion Diffusion into a multi-task RL agent. Performing a task is as easy as describing it with a text prompt! Want to move to the next task? Just change the prompt on the fly😁 [1/4]🧵 guytevet.github.io/CLoSD-page/

thumb_up_off_alt205

chat_bubble_outline7

repeat40

shareShare

Or Patashnik

@opatashnik

6 months ago

Ever wondered how a SINGLE token represents all subject regions in personalization? Many methods use this token in cross-attention, meaning all semantic parts share the same single attention value. We present Nested Attention, a mechanism that generates localized attention values

thumb_up_off_alt293

chat_bubble_outline5

repeat63

shareShare

Sagi Polaczek 🦜

@polaczeksagi

5 months ago

[1/5] Rethinking SVGs, the implicit way — meet NeuralSVG! 🎨✨ An implicit neural representation for generating layered SVGs from text prompts. 💧 Powered by SDS and nested-dropout for ordered shapes 🖌️ Enables inference-time editing like color palette & aspect ratio Read more on

thumb_up_off_alt295

chat_bubble_outline8

repeat52

shareShare

Rameen Abdal

@abdalrameen

5 months ago

What if you could compose videos— merging multiple clips, even capturing complex athletic moves where video models struggle - all while preserving motion and context? And yes, you can still edit them with text after! Stay tuned for more results. #AI #VideoGeneration #SnapResearch

thumb_up_off_alt149

chat_bubble_outline8

repeat25

shareShare

Guy Tevet

@guytvt

4 months ago

🚀 Meet DiP: our newest text-to-motion diffusion model! ✨ Ultra-fast generation ♾️ Creates endless, dynamic motions 🔄 Seamlessly switch prompts on the fly Best of all, it's now available in the MDM codebase: github.com/GuyTevet/motio… [1/3]

thumb_up_off_alt471

chat_bubble_outline12

repeat89

shareShare

Andreas Aristidou

@andaristidou

4 months ago

🚀 New preprint! 🚀 Check out AnyTop 🤩 ✅ A diffusion model that generates motion for arbitrary skeletons 🦴 ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🦅🐒🪲 🔗 Arxiv: arxiv.org/abs/2502.17327

thumb_up_off_alt187

chat_bubble_outline2

repeat42

shareShare

Jonathan Fischoff

@jfischoff

4 months ago

“Tight Inversion” uses an IP-Adapter during DDIM inversion to preserve the original image better when editing. arxiv.org/abs/2502.20376

thumb_up_off_alt166

chat_bubble_outline4

repeat22

shareShare

Daniel Cohen-Or

@danielcohenor1

4 months ago

Vectorization into a neat SVG!🎨✨ Instead of generating a messy SVG (left), we produce a structured, compact representation (right) - enhancing usability for editing and modification. Accepted to #CVPR2025 !

thumb_up_off_alt1,1K

chat_bubble_outline18

repeat127

shareShare

Elad Richardson

@eladrichardson

3 months ago

Ever stared at a set of shapes and thought: 'These could be something… but what?' Designed for visual ideation, PiT takes a set of concepts and interprets them as parts within a target domain, assembling them together while also sampling missing parts. eladrich.github.io/PiT/

thumb_up_off_alt113

chat_bubble_outline8

repeat32

shareShare

Linoy Tsaban🎗️

@linoy_tsaban

2 months ago

🔔just landed: IP Composer🎨 semantically mix & match visual concepts from images ❌ text prompts can't always capture visual nuances ❌ visual input based methods often need training / don't allow fine grained control over *which* concepts to extract from our input images So👇

thumb_up_off_alt200

chat_bubble_outline9

repeat42

shareShare

Daniel Garibi

@danielgaribi

2 months ago

Excited to share that "TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space" got accepted to SIGGRAPH 2025! It tackles disentangling complex visual concepts from as little as a single image and re-composing concepts across multiple images into a coherent

thumb_up_off_alt77

chat_bubble_outline2

repeat27

shareShare

Sigal Raab

@sigal_raab

a month ago

🔔Excited to announce that #AnyTop has been accepted to #SIGGRAPH2025!🥳 ✅ A diffusion model that generates motion for arbitrary skeletons ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🌐 Project: anytop2025.github.io/Anytop-page

thumb_up_off_alt62

chat_bubble_outline1

repeat23

shareShare

Sara Dorfman

@sara__dorfman

a month ago

Excited to share that "IP-Composer: Semantic Composition of Visual Concepts" got accepted to #SIGGRAPH2025!🥳 We show how to combine visual concepts from multiple input images by projecting them into CLIP subspaces - no training, just neat embedding math✨ Really enjoyed working

thumb_up_off_alt104

chat_bubble_outline0

repeat25

shareShare

Omer Dahary

Gate.io

Yuval Alaluf

Rinon Gal

Guy Tevet

Or Patashnik

Sagi Polaczek 🦜

Rameen Abdal

Guy Tevet

Andreas Aristidou

Jonathan Fischoff

Daniel Cohen-Or

Elad Richardson

Linoy Tsaban🎗️

Daniel Garibi

Sigal Raab

Sara Dorfman