Kai Zhang
@kaizhang9546
Research Scientist at Adobe pondering about 3Dβs role in the AGI story. Opinions are my own.
ID: 1346885093361471489
06-01-2021 18:23:38
115 Tweet
770 Followers
259 Following
Nice paper surveying Multimodal AI Architectures -- with a comprehensive taxonomy and analysis of their pros/cons & applications in any-to-any modality model development π ππ¨π¦π©π«ππ‘ππ§π¬π’π―π πππ±π¨π§π¨π¦π²: First work to explicitly identify and categorize four broad
Today along with 4 other models AI at Meta released Chameleon: 7B & 34B language models. This is based on AI at Meta 's brilliant paper released in May-2024. "Chameleon: Mixed-Modal Early-Fusion Foundation Models" π₯ π¨βπ§ The Problem this paper solves: Chameleon tackles the key
Exciting news - Chatbot Arena now supports image uploadsπΈ Challenge GPT-4o, Gemini, Claude, and LLaVA with your toughest questions. Plot to code, VQA, story telling, you name it. Let's get creative and have fun! Leaderboard coming soon. Credits to builders Christopher Chou
Exciting News! π My paper got accepted at #ECCV2024! Huge thanks to my Adobe and KAUST collaborators! π DATENeRF: Depth-Aware Text-based Editing of NeRFs π Sara Rojas Martinez, Julien Philip, Kai Zhang, Sai Bi, Fujun Luan,Bernard Ghanem , Kalyan Sunkavalli datenerf.github.io/DATENeRF/