Howard Zhou (@howardzzh) Twitter Tweets • TwiCopy

Howard Zhou

@howardzzh

+ Follow

I'm a Principal Software Engineer and Engineering Director at Google DeepMind, interested in Computer Vision, Machine Learning problems, and Computer Graphics.

ID: 317254344

calendar_today14-06-2011 17:22:16

14 Tweet

48 Followers

68 Following

Howard Zhou

@howardzzh

6 years ago

Work from our team, yeah!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Training NeRFs per-scene is so 2020. Inspired by image based rendering, IBRNet does amortized inference for view synthesis by learning how to look at input images at render time. 15% drop in error, 80% fewer FLOPs than NeRF. Great work Qianqian Wang! ibrnet.github.io

thumb_up_off_alt442

chat_bubble_outline2

repeat81

shareShare

Frank Dellaert

@fdellaert

4 years ago

In anticipation of the Intl. Conf. on Computer Vision (#ICCV2021) this week, I rounded up all papers that use Neural Radiance Fields (NeRFs) represented in the main #ICCV2021 conference here (1/N): dellaert.github.io/NeRF21

thumb_up_off_alt318

chat_bubble_outline2

repeat73

shareShare

Jeff Dean

@jeffdean

4 years ago

New work from Google Research by @JHYUXM, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini and Yonghui Wu: CoCa is a new way of combining image and text representations that achieves SOTA results on a large number of tasks of different kinds.

thumb_up_off_alt103

chat_bubble_outline1

repeat21

shareShare

Frank Dellaert

@fdellaert

3 years ago

Andrew Marmon and I rounded up all #CVPR2022 papers on NeRF/Neural Radiance Fields we could find in a new blog post here: dellaert.github.io/NeRF22/

thumb_up_off_alt497

chat_bubble_outline9

repeat133

shareShare

Jason Baldridge

@jasonbaldridge

3 years ago

We are excited to share our work on our Pathways Autoregressive Text-to-Image model, Parti! #Parti achieves high-fidelity photorealistic image generation and supports content-rich synthesis involving complex compositions and world knowledge. parti.research.google

thumb_up_off_alt344

chat_bubble_outline10

repeat77

shareShare

AK

@_akhaliq

2 years ago

Modeling Collaborator Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use From content moderation to wildlife conservation, the number of applications that require models to recognize nuanced or subjective visual concepts is growing.

thumb_up_off_alt43

chat_bubble_outline3

repeat16

shareShare

André Araujo

@andrefaraujo

a year ago

Want some TIPS? Well, then check out “Text-Image Pretraining with Spatial awareness” :) TIPS is a general-purpose image-text encoder, for off-the-shelf dense and image-level prediction. Finally image-text pretraining with spatially-aware representations! arxiv.org/abs/2410.16512

thumb_up_off_alt49

chat_bubble_outline4

repeat11

shareShare

André Araujo

@andrefaraujo

8 months ago

Multimodal AI encoders often lack spatial understanding… but not anymore! Our #ICLR2025 TIPS model (Text-Image Pretraining with Spatial awareness) from Google DeepMind can help 💡🚀 Check out our strong & versatile image-text encoder 💪 Paper & code: arxiv.org/abs/2410.16512

thumb_up_off_alt330

chat_bubble_outline6

repeat68

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

8 months ago

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer

thumb_up_off_alt2,2K

chat_bubble_outline75

repeat421

shareShare

Howard Zhou

@howardzzh

8 months ago

Last Call: Learn GenAI and help us break the GUINNESS WORLD RECORDS™ for Largest Virtual AI Conference! Join Google & Kaggle's GenAI Intensive: No cost, live sessions, hands-on labs. Registration closes this Friday! #GenAI #GuinnessWorldRecords #Kaggle #GoogleAI

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Howard Zhou

@howardzzh

8 months ago

Please check out this cool work from Ian Huang and our team within Google DeepMind Website: fireplace3d.github.io Paper: arxiv.org/abs/2503.04919

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Howard Zhou

Howard Zhou

Jon Barron

Frank Dellaert

Jeff Dean

Frank Dellaert

Jason Baldridge

AK

André Araujo

André Araujo

lmarena.ai (formerly lmsys.org)

Howard Zhou

Howard Zhou