jaesung choe (@choe_jaesung) Twitter Tweets • TwiCopy

jaesung choe

@choe_jaesung

+ Follow

Research scientist @NVIDIA

ID: 1551841156388233218

linkhttps://jaesung-choe.github.io/ calendar_today26-07-2022 08:05:50

11 Tweet

8 Takipçi

52 Takip Edilen

NVIDIA AI Developer

@nvidiaaidev

2 years ago

📣 #NVIDIAResearch paper, XCube, generates highly-detailed #3D scenes with resolutions of up to 1024³ in under 30 seconds. 👀 Capable of leveraging voxel attributes such as normals, semantics, & TSDF, this model is pushing the boundary of 3D generation. nvda.ws/3TeXk3K

thumb_up_off_alt236

chat_bubble_outline7

repeat60

shareShare

Jim Fan

@drjimfan

2 years ago

We are raising baby GR00T with ❤️ in a cozy lab x.com/DrJimFan/statu…

thumb_up_off_alt306

chat_bubble_outline11

repeat17

shareShare

Jim Fan

@drjimfan

2 years ago

OpenAI is expected to demo a real-time voice assistant tomorrow. What does it take to deliver an immersive, or even magical experience? Almost all voice AI go through 3 stages: 1. Speech recognition or "ASR": audio -> text1, think Whisper; 2. LLM that plans what to say next:

thumb_up_off_alt3,3K

chat_bubble_outline131

repeat539

shareShare

Sergey Levine

@svlevine

2 years ago

Multi-turn RL with vision language models to enable RL-based training of vision-based policies.

thumb_up_off_alt107

chat_bubble_outline2

repeat24

shareShare

Jim Fan

@drjimfan

2 years ago

What makes up the abstract concept of an apple? We read the word "apple" as a string, see 2D pictures online, 3D shape in real life, and moving apples in videos. We touch the apple, feel its geometry in our palms and texture through the rich tactile sensation on our fingers. Do

thumb_up_off_alt584

chat_bubble_outline44

repeat92

shareShare

Yuke Zhu

@yukez

2 years ago

Excited to announce RoboCasa, a large-scale simulation framework of everyday tasks! We use generative AI tools to create diverse objects, scenes, and tasks. Simulation plays a pivotal role in our Data Pyramid for training generalist robots. Open-source at robocasa.ai

thumb_up_off_alt457

chat_bubble_outline15

repeat94

shareShare

Guy Davidson

@guyd33

2 years ago

New preprint! How should we think about cognitive representations of goals? We argue that thinking about goals as a type of symbolic cognitive program makes a lot sense -- and perform behavioral and computational work to support this idea: 1/n exps.gureckislab.org/guydav/goal_pr…

thumb_up_off_alt103

chat_bubble_outline3

repeat20

shareShare

Nando de Freitas

@nandodf

2 years ago

Meta Chameleon and Gemini have also adopted the Gato ( arxiv.org/pdf/2205.06175 ) architecture. Is this going to be the ultimate approach for MIMO (multimodal input multimodal output models) or is there something else we should be trying? There’s been great progress in scaling

thumb_up_off_alt200

chat_bubble_outline7

repeat30

shareShare

Allen T.

@mr_allent

2 years ago

MusePose comfyui node 🔥 MusePose is a pose driven image2video framework for virtual human generation Link to comfyui node in post below:

thumb_up_off_alt923

chat_bubble_outline17

repeat157

shareShare

Radiance Fields

@radiancefields

2 years ago

The Three.js-based 3DGS implementation of Gaussian Splatting, GaussianSplats3D from Mark Kellogg has officially added 2DGS support with v0.4.3! Article: radiancefields.com/three-js-based… Github: github.com/mkkellogg/Gaus…

thumb_up_off_alt318

chat_bubble_outline0

repeat58

shareShare

MrNeRF

@janusch_patas

2 years ago

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians arxiv.org/abs/2407.11793 Project: seokhunchoi.github.io/Click-Gaussian/ Method ⬇️ 1 | 2

thumb_up_off_alt276

chat_bubble_outline1

repeat51

shareShare