Chunyuan Li (@chunyuanli) 's Twitter Profile
Chunyuan Li

@chunyuanli

@xAI | Previous: @MSFTResearch; PhD @DukeU

ID: 2745113140

linkhttp://chunyuan.li calendar_today17-08-2014 06:04:38

310 Tweet

3,3K Followers

624 Following

Yuanhan (John) Zhang (@zhang_yuanhan) 's Twitter Profile Photo

📽️📽️ LLaVA-Video (formerly LLaVA-NeXT-Video) has undergone a major upgrade! We are excited to release LLaVA-Video-178K, a high-quality synthetic dataset for video instruction tuning. This dataset includes: - 178,510 caption entries - 960,792 open-ended Q&A pairs -

📽️📽️ LLaVA-Video (formerly LLaVA-NeXT-Video) has undergone a major upgrade! 

We are excited to release LLaVA-Video-178K, a high-quality synthetic dataset for video instruction tuning. This dataset includes:

  - 178,510 caption entries
  - 960,792 open-ended Q&A pairs
  -
Tianyi Xiong (@tianyixiong23) 's Twitter Profile Photo

🚀🔥Introducing LLaVA-Critic--the first open-source large multimodal model designed to assess model performance across diverse multimodal tasks! LLaVA-Critic excels in two primary scenarios: - 👨‍⚖️LMM-as-a-Judge: It provides pointwise scores and pairwise rankings that closely

🚀🔥Introducing LLaVA-Critic--the first open-source large multimodal model designed to assess model performance across diverse multimodal tasks!

LLaVA-Critic excels in two primary scenarios:
- 👨‍⚖️LMM-as-a-Judge: It provides pointwise scores and pairwise rankings that closely
Joseph Pollack #Ï 🎗️ (@josephpollack) 's Twitter Profile Photo

🙋🏻‍♂️hey there folks , 🌋🌋did you know that Llava recieved a major update? Yuanhan (John) Zhang & Chunyuan Li from llms lab & team just released a new video understanding models & new datasets! collection : huggingface.co/collections/lm… @gradio demo on Hugging Face : huggingface.co/spaces/Tonic/L…

Li Junnan (@lijunnan0409) 's Twitter Profile Photo

Side note on Google Scholar: It’s been a year, and InstructBLIP still doesn’t have its own entry—it’s been merged with LLaVA (which is a fantastic paper). Any help resolving this would be much appreciated!

Yuanhan (John) Zhang (@zhang_yuanhan) 's Twitter Profile Photo

Fine-grained temporal understanding is fundamental for any video understanding model. Excited to see LLaVA-Video showing promising results on TemporalBench, Mu Cai @ not at ICLR! Yet, there remains a significant gap between the best model and human-level performance. The journey continues!

Fine-grained temporal understanding is fundamental for any video understanding model. Excited to see LLaVA-Video showing promising results on TemporalBench, <a href="/MuCai7/">Mu Cai @ not at ICLR</a>! Yet, there remains a significant gap between the best model and human-level performance. The journey continues!
Mu Cai (@mucai7) 's Twitter Profile Photo

Now TemporalBench is fully public! See how your video understanding model performs on TemporalBench before CVPR! 🤗 Dataset: huggingface.co/datasets/micro… 📎 Integrated to lmms-eval (systematic eval): github.com/EvolvingLMMs-L… (great work by Chunyuan Li Yuanhan (John) Zhang ) 📗 Our

Now TemporalBench is fully public! See how your video understanding model performs on TemporalBench before CVPR! 

🤗 Dataset: huggingface.co/datasets/micro…
📎 Integrated to lmms-eval (systematic eval): github.com/EvolvingLMMs-L… (great work by <a href="/ChunyuanLi/">Chunyuan Li</a> <a href="/zhang_yuanhan/">Yuanhan (John) Zhang</a> )
📗 Our
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

BREAKING: @xAI early version of Grok-3 (codename "chocolate") is now #1 in Arena! 🏆 Grok-3 is: - First-ever model to break 1400 score! - #1 across all categories, a milestone that keeps getting harder to achieve Huge congratulations to @xAI on this milestone! View thread 🧵

BREAKING: @xAI early version of Grok-3 (codename "chocolate") is now #1 in Arena! 🏆

Grok-3 is:
- First-ever model to break 1400 score!
- #1 across all categories, a milestone that keeps getting harder to achieve

Huge congratulations to @xAI on this milestone! View thread 🧵
xAI (@xai) 's Twitter Profile Photo

This is it: The world’s smartest AI, Grok 3, now available for free (until our servers melt). Try Grok 3 now: x.com/i/grok X Premium+ and SuperGrok users will have increased access to Grok 3, in addition to early access to advanced features like Voice Mode

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

📰More exciting news today: xAI's latest Grok-3 tops the Arena leaderboard! 🔥 This is the newest, production model, grok-3-preview-02-24 With over 3k votes, this model is tied for #1 overall, and across Hard Prompts, Coding, Math, Creative Writing, Instruction Following, and

📰More exciting news today: <a href="/xai/">xAI</a>'s latest Grok-3 tops the Arena leaderboard! 🔥

This is the newest, production model, grok-3-preview-02-24

With over 3k votes, this model is tied for #1 overall, and across Hard Prompts, Coding, Math, Creative Writing, Instruction Following, and
Elon Musk (@elonmusk) 's Twitter Profile Photo

@xAI has acquired X in an all-stock transaction. The combination values xAI at $80 billion and X at $33 billion ($45B less $12B debt). Since its founding two years ago, xAI has rapidly become one of the leading AI labs in the world, building models and data centers at