
Paul Rubenstein
@paulkrubenstein
Multimodal LLMs at Google DeepMind in Zurich, views my own
ID: 891203132
19-10-2012 15:26:55
72 Tweet
356 Followers
188 Following








Happy to announce DreamFusion, our new method for Text-to-3D! dreamfusion3d.github.io We optimize a NeRF from scratch using a pretrained text-to-image diffusion model. No 3D data needed! Joint work w/ the incredible team of Ben Mildenhall Ajay Jain Jon Barron #dreamfusion


PaLM + AudioLM = AudioPaLM ! We start from PaLM pretrained on text and extend its vocab w/ audio tokens. This model can then be finetuned on a mix of any (speech, text) task e.g. ASR, TTS, MT and speech2speech translation in one's voice! š§µ1/4 google-research.github.io/seanet/audiopaā¦





