Jaemin Cho (on faculty job market) (@jmin__cho) Twitter Tweets • TwiCopy

Chong Zeng

5 months ago

What if a Transformer could render? Not text → image. But mesh → image — with global illumination. No rasterizers. No ray-tracers. Just a Transformer without per-scene training. RenderFormer does exactly that. #SIGGRAPH2025 🔗microsoft.github.io/renderformer

thumb_up_off_alt539

chat_bubble_outline11

repeat84

shareShare

David Bau

@davidbau

5 months ago

Dear MAGA friends, I have been worrying about STEM in the US a lot, because right now the Senate is writing new laws that cut 75% of the STEM budget in the US. Sorry for the long post, but the issue is really important, and I want to share what I know about it. The entire

thumb_up_off_alt466

chat_bubble_outline23

repeat74

shareShare

Joykirat

@joykiratsingh

5 months ago

I’m thrilled to share that I’ll be joining the University of North Carolina at Chapel Hill for my CS PhD this fall!! 🎓💙 UNC-Chapel Hill I’ll be working with the amazing Mohit Bansal at UNC NLP. Grateful to everyone who’s supported me, excited for this new chapter! 🚀

thumb_up_off_alt205

chat_bubble_outline35

repeat11

shareShare

Minghao Wu

@wuminghao_nlp

5 months ago

Excited to share that I’ll be joining UNC Computer Science and UNC NLP as a Postdoctoral Research Associate, working with the incredible Mohit Bansal! Can’t wait to collaborate with the amazing students and faculty there! 🎉 A huge thank you to my supervisor Reza Haffari, my colleagues at

Excited to share that I’ll be joining <a href="/unccs/">UNC Computer Science</a> and <a href="/uncnlp/">UNC NLP</a> as a Postdoctoral Research Associate, working with the incredible <a href="/mohitban47/">Mohit Bansal</a>! Can’t wait to collaborate with the amazing students and faculty there! 🎉

A huge thank you to my supervisor Reza Haffari, my colleagues at

thumb_up_off_alt86

chat_bubble_outline21

repeat20

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

5 months ago

Welcome to Chapel Hill, Minghao! Can't wait to see the exciting research you'll be doing here as a postdoc 🙂

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Daeun Lee

@danadaeun

5 months ago

Excited to share Video-Skill-CoT🎬🛠️– a new framework for domain-adaptive video reasoning with skill-aware Chain-of-Thought (CoT) supervision! ⚡️Key Highlights: ➡️ Automatically extracts domain-specific reasoning skills from questions and organizes them into a unified taxonomy,

thumb_up_off_alt75

chat_bubble_outline2

repeat28

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

5 months ago

Introducing Video-Skill-CoT 📽️ , a new framework for domain-adaptive video understanding with skill-specific chain-of-thought reasoning! ✅ Automatically discovers reasoning skills from video data ✅ Trains skill-specific expert modules with skill-specific CoT rationales ✅

thumb_up_off_alt36

chat_bubble_outline1

repeat9

shareShare

Jaehong Yoon (on the faculty job market)

@jaeh0ng_yoon

5 months ago

🚨 New Release: Video-Skill-CoT! Domain-Adaptive, Skill-Based Video Reasoning💡 ✅ Automatically extracts domain-specific reasoning skills ✅ Generates tailored, skill-based CoT rationales ✅ Trains with skill-specific experts for stronger domain adaptation 🚀 Outperforms

thumb_up_off_alt16

chat_bubble_outline0

repeat6

shareShare

Zun Wang

@zunwang919

5 months ago

🚨Check my amazing labmate's latest work 🎬 Video-Skill-CoT 🛠️, a powerful and elegant framework for domain-adaptive video reasoning with skill-aware CoT 🧠✨, achieving strong results across multiple tasks! 📊🔥

thumb_up_off_alt18

chat_bubble_outline1

repeat10

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

5 months ago

🚨 CLATTER treats entailment as a reasoning process, guiding models to follow concrete steps (decomposition, attribution/entailment, and aggregation). CLATTER improves hallucination detection via NLI, with gains on ClaimVerify, LFQA, and TofuEval especially on long-reasoning

thumb_up_off_alt20

chat_bubble_outline0

repeat10

shareShare

Rohan Paul

@rohanpaul_ai

5 months ago

This paper proposes VIDEO-SKILL-COT to improve domain adaptation using skill-aware Chain-of-Thought supervision and expert learning modules. Methods 🔧: → The framework automatically constructs skill-based Chain-of-Thought annotations by extracting skills from questions,

thumb_up_off_alt21

chat_bubble_outline3

repeat8

shareShare

David Wan

@meetdavidwan

5 months ago

Excited to share our new work, CLaMR! 🚀 We tackle multimodal content retrieval by jointly considering video, speech, OCR, and metadata. CLaMR learns to dynamically pick the right modality for your query, boosting retrieval by 25 nDCG@10 over single modality retrieval! 🧐

thumb_up_off_alt183

chat_bubble_outline1

repeat61

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

5 months ago

Excited to announce CLaMR, our new retriever for multimodal documents! Strong performance improvements (+25 nDGC@10) compared to both multimodal and unimodal retrieval baselines. 🤝 CLaMR jointly encodes multiple modalities and selects the most relevant ones for each query. 🏋️‍♂️

thumb_up_off_alt22

chat_bubble_outline0

repeat9

shareShare

Han Wang

@hanwang98

5 months ago

How can a multimodal retriever accurately retrieve docs from massive online video content that spans multiple modalities? We introduce CLaMR, a contextualized late-interaction retriever that jointly encodes all modalities and dynamically selects those containing the relevant

thumb_up_off_alt18

chat_bubble_outline1

repeat8

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

5 months ago

Introducing CLaMR -- a late-interaction retriever for complex multimodal video content! 📽️📚 ➡️ Jointly encodes frames, speech, on-screen text, and metadata to answer diverse queries grounded across modalities ➡️ Trained with a new dataset we introduce, MultiVENT 2.0++, a

thumb_up_off_alt33

chat_bubble_outline0

repeat8

shareShare

Ziyang Wang

@ziyangw00

5 months ago

Excited to present VideoTree🌲 at #CVPR2025 Fri at 10:30AM! VideoTree improves long-video QA via smart sampling: -Query-adaptive: finds the parts of the video relevant to the query -Coarse-to-fine structure: structured hierarchically to sample granularly from relevant segments

thumb_up_off_alt37

chat_bubble_outline1

repeat18

shareShare

Omar Khattab

@lateinteraction

5 months ago

Wow I missed this extra fancy ColBERT model. > A late-interaction retriever which jointly encodes/contextualizes information from many modalities, allowing for fine-grained matching between the query and implicitly finding the most relevant modality.

thumb_up_off_alt100

chat_bubble_outline1

repeat17

shareShare

Mohit Bansal

@mohitban47

5 months ago

Welcome Jaewoo to the MURGe-Lab + UNC NLP + UNC Computer Science family & the beautiful Chapel Hill + Research Triangle area! 🎉 Looking forward to the exciting research and fun together in your PhD journey 💙

thumb_up_off_alt42

chat_bubble_outline1

repeat8

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

5 months ago

Welcome Jaewoo! Looking forward to seeing your exciting future research at UNC MURGe lab 😄

thumb_up_off_alt6

chat_bubble_outline1

repeat3

shareShare

hyunji amy lee

@hyunji_amy_lee

5 months ago

🚨 Want models to better utilize and ground on the provided knowledge? We introduce Context-INformed Grounding Supervision (CINGS)! Training LLM with CINGS significantly boosts grounding abilities in both text and vision-language models compared to standard instruction tuning.

thumb_up_off_alt48

chat_bubble_outline1

repeat22

shareShare