Andrew Brown (@andrew__brown__) Twitter Tweets • TwiCopy

Andrew Brown

@andrew__brown__

+ Follow

Research Scientist GenAI NY @AIatMeta working on video generation (Meta Movie Gen) | PhD @Oxford_VGG with Andrew Zisserman, Previously @oxengsci

ID: 1188775737115009034

calendar_today28-10-2019 11:13:30

350 Tweet

2,2K Takipçi

480 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

New paper! We cast reward fine-tuning as stochastic control. 1. We prove that a specific noise schedule *must* be used for fine-tuning. 2. We propose a novel algorithm that is significantly better than the adjoint method*. (*this is an insane claim) arxiv.org/abs/2409.08861

thumb_up_off_alt315

chat_bubble_outline9

repeat62

shareShare

Wei-Ning Hsu

@mhnt1580

a year ago

As a speech/audio researcher, I think it’s a big breakthrough in advancing *human-level audio generation*. Why? Because this is NOT just a video-to-audio model that generates what it sees in the physical world... but a model that learns to *DESIGN* sounds like a human🤯 Here

thumb_up_off_alt69

chat_bubble_outline1

repeat8

shareShare

Kevin Chih-Yao Ma

@chihyaoma

a year ago

Movie Gen claims to be the state-of-the-art in text-to-video generation, outperforming Sora, Kling, Gen3, and more. But how can you trust the results? Today, we're releasing 1003 videos and their prompts - no cherry-picking allowed. Our goal? To set a new standard for evaluating

thumb_up_off_alt53

chat_bubble_outline2

repeat8

shareShare

Rohit Girdhar

@_rohitgirdhar_

a year ago

MovieGen is now on arXiv, with some interesting new tidbits! I’m particularly excited about this scaling analysis, where we find that the optimal FLOPs/params for MovieGen lie on the Llama3 scaling law, suggesting that LLM scaling laws might even work for media generation models!

thumb_up_off_alt78

chat_bubble_outline1

repeat8

shareShare

Andrew Brown

@andrew__brown__

a year ago

🚨 Internship in Meta GenAI NYC 🚨 I have an open PhD internship position for 2025! Interested in exploring visual generative models (or any other exciting ideas) inside the team that brought you Movie Gen and Emu Video? 📩 Send me DM with CV, website, and GScholar profile

thumb_up_off_alt267

chat_bubble_outline3

repeat27

shareShare

Andrew Brown

@andrew__brown__

a year ago

Ever wondered why current SOTA gen models learn a mapping all the way from noise to the target distribution? Well, turns out that this is a constraint of diffusion models, but NOT flow matching models! We explore this in cross-flow.github.io ! really enjoyed this one 😄

thumb_up_off_alt75

chat_bubble_outline0

repeat8

shareShare

Yuge Shi (Jimmy)

@yugeten

10 months ago

✨New blog post✨: my attempt as a vision researcher at finally understanding RLHF -- a deep dive into PPO & DeepSeek's GRPO! No hot take, I promise. yugeten.github.io/posts/2025/01/…

thumb_up_off_alt1,1K

chat_bubble_outline26

repeat173

shareShare

Rohit Girdhar

@_rohitgirdhar_

10 months ago

Super excited to share some recent work that shows that pure, text-only LLMs, can see and hear without any training! Our approach, called "MILS", uses LLMs with off-the-shelf multimodal models, to caption images/videos/audio, improve image generation, style transfer, and more!

thumb_up_off_alt247

chat_bubble_outline7

repeat38

shareShare

Hila Chefer

@hila_chefer

10 months ago

VideoJAM is our new framework for improved motion generation from AI at Meta We show that video generators struggle with motion because the training objective favors appearance over dynamics. VideoJAM directly adresses this **without any extra data or scaling** 👇🧵