Weichen FAN (@w30259893) 's Twitter Profile
Weichen FAN

@w30259893

Ph.D. Student, MMLab@NTU, Nanyang Technological University

ID: 1239748103126650881

calendar_today17-03-2020 02:59:23

35 Tweet

95 Followers

39 Following

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers abs: arxiv.org/abs/2405.05945 code: github.com/Alpha-VLLM/Lumโ€ฆ Introduces the Lumina-T2X family of models, with the largest being a 7B DiT. Able to generate images,

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

abs: arxiv.org/abs/2405.05945
code: github.com/Alpha-VLLM/Lumโ€ฆ

Introduces the Lumina-T2X family of models, with the largest being a 7B DiT. Able to generate images,
Chenyang Si (@scy994) 's Twitter Profile Photo

Thanks AK for sharing! ๐ŸŽ‰ ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅWe've released Vchitect-2.0, a 2B video generation model, supporting up to 720x480 resolution and 5-20 second generation. ๐Ÿ‘‰ Website: vchitect.intern-ai.org.cn ๐Ÿ‘‰ Code: github.com/Vchitect/Vchitโ€ฆ ๐Ÿ‘‰ Demos Hugging Face: huggingface.co/spaces/Vchitecโ€ฆ

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

๐Ÿ“ขHigh-Quality Text-to-Video (T2V) Dataset๐Ÿ“ข We are now releasing ๐ŸŽฌVchitect-T2V-Dataverse๐ŸŽฌ, a large-scale T2V database with *2M high-quality video clips* as well as *fine-grained textual captions* - Code: github.com/Vchitect/Vchitโ€ฆ - Dataset Hugging Face: huggingface.co/datasets/Vchitโ€ฆ

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

๐Ÿ”ฅTime to Upgrade Your Classifier-Free Guidance๐Ÿ”ฅ ๐ŸŒ CFG-Zero*๐ŸŒ  offers consistently better *visual quality* and *text alignment* on text-to-image/video - Project: weichenfan.github.io/webpage-cfg-zeโ€ฆ - Code: github.com/WeichenFan/CFGโ€ฆ - Demo Gradio: huggingface.co/spaces/weepiesโ€ฆ . Thanks AK!

Alessandro Perilli ๐Ÿ‡บ๐Ÿ‡ฆ (@giano) 's Twitter Profile Photo

While the world is clamoring for the new OpenAI 4o image generation model, the open AI community makes yet another step towards SOTA image quality with CFG Zero Star technique ๐Ÿ˜ฑ Now added to the upcoming APW 13.0 EA2 for ComfyUI.

While the world is clamoring for the new OpenAI 4o image generation model, the open AI community makes yet another step towards SOTA image quality with CFG Zero Star technique ๐Ÿ˜ฑ Now added to the upcoming APW 13.0 EA2 for ComfyUI.
Weichen FAN (@w30259893) 's Twitter Profile Photo

๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅWaN 2.1-14B T2V is now officially supported in our repo! ๐ŸŽ‰ With 4% steps of zero-init, frame quality sees a significant boost. ๐Ÿ”— github.com/WeichenFan/CFGโ€ฆ

Alessandro Perilli ๐Ÿ‡บ๐Ÿ‡ฆ (@giano) 's Twitter Profile Photo

FLUX.1 Dev and SD3.5 are not the only ones to benefit from the new CFG Zero Star technique. Even WanVideo 2.1 benefits from it. APW 13.0 EA2 for ComfyUI will support this use case, too.

Weichen FAN (@w30259893) 's Twitter Profile Photo

๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ We now officially support wan2.1-14B for Image-to-Video generation! Check out the latest update in our repo: ๐Ÿ‘‰ github.com/WeichenFan/CFGโ€ฆ

Weichen FAN (@w30259893) 's Twitter Profile Photo

Our method โœจCFG-Zero*โœจworks well with LoRA too! Generated using Flux with the ๐Ÿ‘ปDeath Stranding๐Ÿ‘ป LoRA applied. ๐Ÿ‘‰Check out our repo: github.com/WeichenFan/CFGโ€ฆ LoRA: civitai.com/models/46080?mโ€ฆ #DeathStranding #VirtualPhotography #KojimaProductions

Our method โœจCFG-Zero*โœจworks well with LoRA too!
Generated using Flux with the ๐Ÿ‘ปDeath Stranding๐Ÿ‘ป LoRA applied.

๐Ÿ‘‰Check out our repo: github.com/WeichenFan/CFGโ€ฆ

LoRA: civitai.com/models/46080?mโ€ฆ

#DeathStranding 
#VirtualPhotography
#KojimaProductions
Weichen FAN (@w30259893) 's Twitter Profile Photo

๐ŸŒŸ CFG-Zero* now supports Qwen2.5-Omni! ๐Ÿ”— github.com/QwenLM/Qwen2.5โ€ฆ Big thanks to Wan2.1 for the mention! ๐Ÿ”— github.com/Wan-Video/Wan2โ€ฆ Check out our latest updates: ๐Ÿ‘‰ github.com/WeichenFan/CFGโ€ฆ ๐ŸŽง Generated audio samples below โฌ‡๏ธ

Weichen FAN (@w30259893) 's Twitter Profile Photo

โœจ CFG-Zeroโ˜… is now supported in ๐Ÿค— Diffusers(github.com/huggingface/diโ€ฆ)! Use it with just one line: "guidance = ClassifierFreeZeroStarGuidance(guidance_scale=5.0, zero_init_steps=1)" ๐Ÿ“ฆ Repo: github.com/WeichenFan/CFGโ€ฆ The comparisons produced by Diffusers:

โœจ CFG-Zeroโ˜… is now supported in ๐Ÿค— Diffusers(github.com/huggingface/diโ€ฆ)!

Use it with just one line:
"guidance = ClassifierFreeZeroStarGuidance(guidance_scale=5.0, zero_init_steps=1)"

๐Ÿ“ฆ Repo: github.com/WeichenFan/CFGโ€ฆ

The comparisons produced by Diffusers:
Weichen FAN (@w30259893) 's Twitter Profile Photo

โœจCFG-Zero* supports EasyControl(github.com/Xiaojiu-z/Easyโ€ฆ Jiaming Liu) now! ๐Ÿ“ทCheck our repo for the lastest update on Ghibli-style generation: github.com/WeichenFan/CFGโ€ฆ #ghiblistyle

โœจCFG-Zero* supports EasyControl(github.com/Xiaojiu-z/Easyโ€ฆ  <a href="/228229753James/">Jiaming Liu</a>) now!    

๐Ÿ“ทCheck our repo for the lastest update on Ghibli-style generation: github.com/WeichenFan/CFGโ€ฆ

#ghiblistyle
Jiaming Liu (@228229753james) 's Twitter Profile Photo

CFG Zero star has been integrated into the EasyControl-Ghibli space! Try it out and have fun. huggingface.co/spaces/jamesliโ€ฆ Thanks Weichen FAN Ziwei Liu

Ziqi Huang (@ziqi_huang_) 's Twitter Profile Photo

๐ŸŽฌ ๐—–๐—ฉ๐—ฃ๐—ฅ ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฑ ๐—ง๐˜‚๐˜๐—ผ๐—ฟ๐—ถ๐—ฎ๐—น ๐™๐™ง๐™ค๐™ข ๐™‘๐™ž๐™™๐™š๐™ค ๐™‚๐™š๐™ฃ๐™š๐™ง๐™–๐™ฉ๐™ž๐™ค๐™ฃ ๐™ฉ๐™ค ๐™’๐™ค๐™ง๐™ก๐™™ ๐™ˆ๐™ค๐™™๐™š๐™ก ๐Ÿš€ Hosted by MMLab@NTU ร— Kuaishou, etc ๐Ÿ“… June 11 | Nashville ๐Ÿ”— world-model-tutorial.github.io ๐Ÿง  Video is just the start. World modeling is the goal. #CVPR2025 #WorldModel

๐ŸŽฌ ๐—–๐—ฉ๐—ฃ๐—ฅ ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฑ ๐—ง๐˜‚๐˜๐—ผ๐—ฟ๐—ถ๐—ฎ๐—น
๐™๐™ง๐™ค๐™ข ๐™‘๐™ž๐™™๐™š๐™ค ๐™‚๐™š๐™ฃ๐™š๐™ง๐™–๐™ฉ๐™ž๐™ค๐™ฃ ๐™ฉ๐™ค ๐™’๐™ค๐™ง๐™ก๐™™ ๐™ˆ๐™ค๐™™๐™š๐™ก

๐Ÿš€ Hosted by MMLab@NTU ร— Kuaishou, etc
๐Ÿ“… June 11 | Nashville
๐Ÿ”— world-model-tutorial.github.io
๐Ÿง  Video is just the start. World modeling is the goal.
#CVPR2025 #WorldModel
Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

๐ŸŽฌ#CVPR2025 ๐“๐ฎ๐ญ๐จ๐ซ๐ข๐š๐ฅ ๐Ÿ—บ๏ธ๐‘ญ๐’“๐’๐’Ž ๐‘ฝ๐’Š๐’…๐’†๐’ ๐‘ฎ๐’†๐’๐’†๐’“๐’‚๐’•๐’Š๐’๐’ ๐’•๐’ ๐‘พ๐’๐’“๐’๐’… ๐‘ด๐’๐’…๐’†๐’ #CVPR2025 ๐Ÿ”—world-model-tutorial.github.io ๐Ÿ“…June 11 ๐Ÿš€Hosted by MMLab@NTU x Kling AI ๐Ÿง Incredible lineup of speakers: Jack Parker-Holder Hong-Xing "Koven" Yu Jiaming Song Pengfei Wan Angjoo Kanazawa Sherry Yang

๐ŸŽฌ#CVPR2025 ๐“๐ฎ๐ญ๐จ๐ซ๐ข๐š๐ฅ
๐Ÿ—บ๏ธ๐‘ญ๐’“๐’๐’Ž ๐‘ฝ๐’Š๐’…๐’†๐’ ๐‘ฎ๐’†๐’๐’†๐’“๐’‚๐’•๐’Š๐’๐’ ๐’•๐’ ๐‘พ๐’๐’“๐’๐’… ๐‘ด๐’๐’…๐’†๐’ <a href="/CVPR/">#CVPR2025</a>

๐Ÿ”—world-model-tutorial.github.io

๐Ÿ“…June 11
๐Ÿš€Hosted by <a href="/MMLabNTU/">MMLab@NTU</a> x <a href="/Kling_ai/">Kling AI</a> 
๐Ÿง Incredible lineup of speakers: <a href="/jparkerholder/">Jack Parker-Holder</a> <a href="/Koven_Yu/">Hong-Xing "Koven" Yu</a> <a href="/baaadas/">Jiaming Song</a> <a href="/wanfufeng/">Pengfei Wan</a> <a href="/akanazawa/">Angjoo Kanazawa</a> <a href="/sherryyangML/">Sherry Yang</a>
Zhaoxi Chen (@frozen_burning) 's Twitter Profile Photo

#Genie3 Awesome release by Google DeepMind! We finally reach an incredible moment for interactive world models! Please also checkout the recording and slides of our tutorial on #CVP2025 where Jack Parker-Holder shared his thoughts on scaling world models! world-model-tutorial.github.io

#Genie3 Awesome release by Google DeepMind! We finally reach an incredible moment for interactive world models! 

Please also checkout the recording and slides of our tutorial on #CVP2025 where <a href="/jparkerholder/">Jack Parker-Holder</a> shared his thoughts on scaling world models! 

world-model-tutorial.github.io