Mu Cai (@mucai7) Twitter Tweets • TwiCopy

Mu Cai

@mucai7

8 months ago

I am thrilled to join Google DeepMind as a Research Scientist and continue working on multimodal research!

I am thrilled to join <a href="/GoogleDeepMind/">Google DeepMind</a> as a Research Scientist and continue working on multimodal research!

thumb_up_off_alt1,1K

chat_bubble_outline62

repeat47

shareShare

Advait Bopardikar

@advaitonline

8 months ago

Gemini Deep Research now uses Gemini 2.5 Pro, its pretty good. You should try it out on Google Gemini App

Gemini Deep Research now uses Gemini 2.5 Pro, its pretty good. You should try it out on <a href="/GeminiApp/">Google Gemini App</a>

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat83

shareShare

Gemini 2.5 Flash just dropped. ⚡ As a hybrid reasoning model, you can control how much it ‘thinks’ depending on your 💰 - making it ideal for tasks like building chat apps, extracting data and more. Try an early version in Google AI Studio → ai.dev

thumb_up_off_alt1,1K

chat_bubble_outline56

repeat221

shareShare

Mu Cai

@mucai7

8 months ago

Totally agree. Models like #OpenAI 's #o3, #o4mini still can not figure out the basic geometry problems. If visual perception is wrong, then ``reasoning" part is meaningless. Huge room for improvement!

thumb_up_off_alt35

chat_bubble_outline0

repeat3

shareShare

Mu Cai

@mucai7

8 months ago

#OpenAI's #o3 #o4mini just again demonstrate the power of visual prompting in ViP-LLaVA(CVPR 2024)vip-llava.github.io In 2023, we proved that, drawing hints visually is more effective that elaborating in text, especially for object level understanding. Go for VisualThinking!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Xiang Li

@xiangli54505720

7 months ago

Hi everyone! I hope you had a great time in Singapore🇸🇬. Though I could not be there in person, I'm excited to share our poster schedule at #ICLR2025. Feel free to stop by, check out our work, and bring any questions you have to Kanchana Ranasinghe.

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Mu Cai

@mucai7

7 months ago

I am excited to announce that I am not at #ICLR presenting Matryoshka Multimodal Models matryoshka-mm.github.io. 😀 But rather, I am online at Bay Area. Ping me if you have any questions or ideas w.r.t paper! Feel free to read the poster at Hall 3 + Hall 2B #86 this morning!

thumb_up_off_alt123

chat_bubble_outline3

repeat8

shareShare

Shao-Hua Sun

@shaohua0116

7 months ago

#ICLR2025 “authors who review are slightly more harsh with scores on average”

thumb_up_off_alt80

chat_bubble_outline3

repeat4

shareShare

Mu Cai

@mucai7

7 months ago

Welcome everyone to submit your paper's performance on the TemporalBench challenge! temporalbench.github.io

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Google DeepMind

@googledeepmind

7 months ago

We’re releasing an updated Gemini 2.5 Pro (I/O edition) to make it even better at coding. 🚀 You can build richer web apps, games, simulations and more - all with one prompt. In Google Gemini App, here's how it transformed images of nature into code to represent unique patterns 🌱

thumb_up_off_alt3,3K

chat_bubble_outline121

repeat541

shareShare

Mu Cai

@mucai7

7 months ago

Thank you Yong Jae Lee! Without the support from you and our group members, it is impossible for me to have such works. I'll miss the days working in our group.

Thank you <a href="/yong_jae_lee/">Yong Jae Lee</a>! Without the support from you and our group members, it is impossible for me to have such works. I'll miss the days working in our group.

thumb_up_off_alt61

chat_bubble_outline5

repeat0

shareShare

Google DeepMind

@googledeepmind

7 months ago

Introducing AlphaEvolve: a Gemini-powered coding agent for algorithm discovery. It’s able to: 🔘 Design faster matrix multiplication algorithms 🔘 Find new solutions to open math problems 🔘 Make data centers, chip design and AI training more efficient across Google. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline180

repeat1,1K

shareShare

Pushmeet Kohli

@pushmeet

7 months ago

Excited to announce AlphaEvolve A powerful AI coding agent developed by our team in Google DeepMind that is able to discover impactful new algorithms for important problems in Maths and Computing by combining the creativity of large language models with automated evaluators.

thumb_up_off_alt2,2K

chat_bubble_outline45

repeat331

shareShare

Demis Hassabis

@demishassabis

7 months ago

cooking up something tasty for tomorrow...

thumb_up_off_alt5,5K

chat_bubble_outline419

repeat298

shareShare

Logan Kilpatrick

@officiallogank

7 months ago

Google's progress in AI since last year: - The worlds strongest models, on pareto frontier - Gemini app: has over 400M monthly active users - We now process 480T tokens a month, up 50x YoY - Over 7M developers have built with the Gemini API (4x) Much more to come still!

thumb_up_off_alt2,2K

chat_bubble_outline112

repeat163

shareShare

Mu Cai

@mucai7

6 months ago

Cheers, Matryoshka!

thumb_up_off_alt25

chat_bubble_outline1

repeat0

shareShare

Mu Cai

@mucai7

6 months ago

Consider submitting your paper to our workshop!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Mu Cai

@mucai7

6 months ago

See this interesting work on using RL (GRPO loss) to dramatically improve chart/math understanding for VLMs!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Feng Yao

@fengyao1909

6 months ago

🔥 "Vibe coding" is everywhere—but is it really care-free? We introduce 𝐑𝐞𝐚𝐋, an RL framework that trains LLMs with automated program analysis feedback, enabling "vibe coding" to be not just fast—but 𝐯𝐮𝐥𝐧𝐞𝐫𝐚𝐛𝐢𝐥𝐢𝐭𝐲-𝐟𝐫𝐞𝐞 & 𝐩𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧-𝐫𝐞𝐚𝐝𝐲 🛡️

thumb_up_off_alt135

chat_bubble_outline2

repeat37

shareShare

Kangwook Lee

@kangwook_lee

6 months ago

As a video gaming company, Krafton AI has secretly been cooking something big with NVIDIA AI for a while! 🥳 We introduce Orak, the first comprehensive video gaming benchmark for LLMs! arxiv.org/abs/2506.03610

As a video gaming company, <a href="/Krafton_AI/">Krafton AI</a> has secretly been cooking something big with <a href="/NVIDIAAI/">NVIDIA AI</a> for a while!

🥳 We introduce Orak, the first comprehensive video gaming benchmark for LLMs!

arxiv.org/abs/2506.03610

thumb_up_off_alt145

chat_bubble_outline3

repeat31

shareShare