Jaehong Yoon (on the faculty job market) (@jaeh0ng

Jaehong Yoon (on the faculty job market)

6 months ago

Thrilled to share that I’ll be joining the College of Computing and Data Science at Nanyang Technological University (NTU) (NTU Singapore) as an Assistant Professor, starting in August 2025 🇸🇬🥳 I’ll continue my research on building trustworthy and continually adaptable multimodal AI,

Thrilled to share that I’ll be joining the College of Computing and Data Science at Nanyang Technological University (NTU) (<a href="/NTUsg/">NTU Singapore</a>) as an Assistant Professor, starting in August 2025 🇸🇬🥳

I’ll continue my research on building trustworthy and continually adaptable multimodal AI,

thumb_up_off_alt215

chat_bubble_outline28

repeat30

shareShare

AK

@_akhaliq

6 months ago

EPiC Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

thumb_up_off_alt134

chat_bubble_outline3

repeat25

shareShare

Jialu Li

@jialuli96

6 months ago

🚨Check out our new video generation work EPiC! 🌟EPiC enables precise 3D camera trajectory control for both image-to-video generation and video-to-video generation! 💡Key highlights: ▶️ Efficient training within 16 GPU-hours ▶️ No need for paired video-camera trajectory data

thumb_up_off_alt32

chat_bubble_outline0

repeat12

shareShare

Han Lin

@hanlin_hl

6 months ago

Check out our new paper (EPiC) for video generation with camera-control 🔥 Here are the two highlights for easy and efficient training: ➡️The model can be trained directly on videos in the wild, without requiring extra camera trajectory annotations. ➡️With a novel

thumb_up_off_alt18

chat_bubble_outline0

repeat10

shareShare

Yue Zhang

@zhan1624

6 months ago

🚀Check out our new paper EPiC for video generation with efficient and precise 3D camera control! Just 16 GPU-hours (vs. 200+), with higher quality results! We innovate on both data & model-level: ✅Data: Visibility-based masking—no video-camera trajectory paired data needed

thumb_up_off_alt18

chat_bubble_outline0

repeat9

shareShare

Jaehong Yoon (on the faculty job market)

@jaeh0ng_yoon

6 months ago

🚨 New paper alert! EPiC: Video Generation with Precise 3D Camera Control 🎬 Tackles both I2V & V2V tasks with: ▶️ Visibility-based masking—no need for video-camera trajectories ▶️ Lightweight ControlNet, guided by anchor video as structural prior Details in the thread !👇

thumb_up_off_alt22

chat_bubble_outline0

repeat9

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

6 months ago

Introducing EPiC - precise & efficient camera control for video generation! 📽️⚙️ Previous methods had drawbacks: ❌ Noisy anchor videos from point cloud estimates ❌ Expensive camera pose annotations ❌ 200+ GPU hours to train EPiC addresses this with: ✅ Visibility-based

thumb_up_off_alt18

chat_bubble_outline0

repeat7

shareShare

Zun Wang

@zunwang919

6 months ago

Big congratulations to Prof. Jaehong Yoon on his new role at NTU Singapore! 🎉 He was an amazing mentor and advisor, always thoughtful, supportive, and full of sharp ideas. If you're thinking about applying to a PhD in multimodal AI etc, definitely reach out to him! Looking forward to

thumb_up_off_alt8

chat_bubble_outline1

repeat1

shareShare

Minghao Wu

@wuminghao_nlp

5 months ago

Excited to share that I’ll be joining UNC Computer Science and UNC NLP as a Postdoctoral Research Associate, working with the incredible Mohit Bansal! Can’t wait to collaborate with the amazing students and faculty there! 🎉 A huge thank you to my supervisor Reza Haffari, my colleagues at

Excited to share that I’ll be joining <a href="/unccs/">UNC Computer Science</a> and <a href="/uncnlp/">UNC NLP</a> as a Postdoctoral Research Associate, working with the incredible <a href="/mohitban47/">Mohit Bansal</a>! Can’t wait to collaborate with the amazing students and faculty there! 🎉

A huge thank you to my supervisor Reza Haffari, my colleagues at

thumb_up_off_alt86

chat_bubble_outline21

repeat20

shareShare

Daeun Lee

@danadaeun

5 months ago

Excited to share Video-Skill-CoT🎬🛠️– a new framework for domain-adaptive video reasoning with skill-aware Chain-of-Thought (CoT) supervision! ⚡️Key Highlights: ➡️ Automatically extracts domain-specific reasoning skills from questions and organizes them into a unified taxonomy,

thumb_up_off_alt75

chat_bubble_outline2

repeat28

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

5 months ago

Introducing Video-Skill-CoT 📽️ , a new framework for domain-adaptive video understanding with skill-specific chain-of-thought reasoning! ✅ Automatically discovers reasoning skills from video data ✅ Trains skill-specific expert modules with skill-specific CoT rationales ✅

thumb_up_off_alt36

chat_bubble_outline1

repeat9

shareShare

Jaehong Yoon (on the faculty job market)

@jaeh0ng_yoon

5 months ago

🚨 New Release: Video-Skill-CoT! Domain-Adaptive, Skill-Based Video Reasoning💡 ✅ Automatically extracts domain-specific reasoning skills ✅ Generates tailored, skill-based CoT rationales ✅ Trains with skill-specific experts for stronger domain adaptation 🚀 Outperforms

thumb_up_off_alt16

chat_bubble_outline0

repeat6

shareShare

Ziwei Liu

@liuziwei7

5 months ago

🔥High-Quality Video Generation Accelerator🔥 ⚡️Dual-Expert Consistency Model (#DCM)⚡️ brings 10× speedup to video gen models (from 1.3B to 13B) with no quality drop * Now supports Hunyuan and Wan - Page: vchitect.github.io/DCM/ - Code: github.com/Vchitect/DCM

thumb_up_off_alt179

chat_bubble_outline3

repeat28

shareShare

Rohan Paul

@rohanpaul_ai

5 months ago

This paper proposes VIDEO-SKILL-COT to improve domain adaptation using skill-aware Chain-of-Thought supervision and expert learning modules. Methods 🔧: → The framework automatically constructs skill-based Chain-of-Thought annotations by extracting skills from questions,

thumb_up_off_alt21

chat_bubble_outline3

repeat8

shareShare

elvis

@omarsar0

5 months ago

How much do LLMs memorize? Meta and collaborators suggest that they can estimate model capacity by measuring memorization. "Models in the GPT family have an approximate capacity of 3.6 bits-per-parameter." Once capacity fills, generalization begins! More in my notes below:

thumb_up_off_alt575

chat_bubble_outline23

repeat104

shareShare

David Wan

@meetdavidwan

5 months ago

Excited to share our new work, CLaMR! 🚀 We tackle multimodal content retrieval by jointly considering video, speech, OCR, and metadata. CLaMR learns to dynamically pick the right modality for your query, boosting retrieval by 25 nDCG@10 over single modality retrieval! 🧐

thumb_up_off_alt183

chat_bubble_outline1

repeat61

shareShare

Avi Schwarzschild

@a_v_i__s

5 months ago

Big news! 🎉 I’m joining UNC-Chapel Hill as an Assistant Professor in Computer Science starting next year! Before that, I’ll be spending time OpenAI working on LLM privacy. UNC Computer Science UNC NLP

Big news! 🎉 I’m joining UNC-Chapel Hill as an Assistant Professor in Computer Science starting next year! Before that, I’ll be spending time <a href="/OpenAI/">OpenAI</a> working on LLM privacy.
<a href="/unccs/">UNC Computer Science</a> <a href="/uncnlp/">UNC NLP</a>

thumb_up_off_alt573

chat_bubble_outline46

repeat34

shareShare

Jaehong Yoon (on the faculty job market)

@jaeh0ng_yoon

5 months ago

🚨 Check out our new paper: Frame Guidance — a powerful, training-free framework for video control in diffusion models! 🎬 ▶️ Supports multiple forms of control in video diffusion models, including keyframe-guided generation, stylization, video looping, color-block manipulation,

thumb_up_off_alt25

chat_bubble_outline0

repeat2

shareShare

Ziyang Wang

@ziyangw00

5 months ago

Excited to present VideoTree🌲 at #CVPR2025 Fri at 10:30AM! VideoTree improves long-video QA via smart sampling: -Query-adaptive: finds the parts of the video relevant to the query -Coarse-to-fine structure: structured hierarchically to sample granularly from relevant segments

thumb_up_off_alt37

chat_bubble_outline1

repeat18

shareShare

Jaewoo Lee

@jaew00_lee

5 months ago

🎉Excited to share that I’ll be starting my CS PhD journey at UNC-Chapel Hill UNC Computer Science this fall! 🎓 I’ll be working with the renowned Mohit Bansal at UNC NLP — a dream comes true! ✨ Huge thanks to everyone who's helped me get here. Can't wait to begin this new life and research journey! 🧳🚀

🎉Excited to share that I’ll be starting my CS PhD journey at <a href="/UNC/">UNC-Chapel Hill</a> <a href="/unccs/">UNC Computer Science</a> this fall! 🎓
I’ll be working with the renowned <a href="/mohitban47/">Mohit Bansal</a> at <a href="/uncnlp/">UNC NLP</a> — a dream comes true! ✨
Huge thanks to everyone who's helped me get here. Can't wait to begin this new life and research journey! 🧳🚀

thumb_up_off_alt43

chat_bubble_outline13

repeat12

shareShare