Junchen Liu (@junchenliu77) Twitter Tweets • TwiCopy

HU, Wenbo

8 months ago

Excited to share our #TrajectoryCrafter, a diffusion model for Redirecting Camera Trajectory in Monocular Videos! Try to explore the world underlying your videos~ Page: trajectorycrafter.github.io Demo: huggingface.co/spaces/Doubiiu… Code: github.com/TrajectoryCraf…

thumb_up_off_alt269

chat_bubble_outline5

repeat56

shareShare

Junchen Liu

@junchenliu77

8 months ago

Fantastic work!!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Junyi Zhang

@junyi42

7 months ago

Introducing St4RTrack!🖖 Simultaneous 4D Reconstruction and Tracking in the world coordinate feed-forwardly, just by changing the meaning of two pointmaps! st4rtrack.github.io

thumb_up_off_alt267

chat_bubble_outline6

repeat51

shareShare

Arthur Allshire

@arthurallshire

6 months ago

our new system trains humanoid robots using data from cell phone videos, enabling skills such as climbing stairs and sitting on chairs in a single policy (w/ Hongsuk Benjamin Choi Junyi Zhang David McAllister)

thumb_up_off_alt550

chat_bubble_outline28

repeat98

shareShare

Chung Min Kim

@chungminkim

6 months ago

Excited to introduce PyRoki ("Python Robot Kinematics"): easier IK, trajectory optimization, motion retargeting... with an open-source toolkit on both CPU and GPU

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat166

shareShare

Ruilong Li

@ruilong_li

6 months ago

🌟gsplat🌟just integrated 3DGUT, which allows training and rendering 3DGS on *distorted* pinhole/fisheye cameras, as well as rolling shutter effects! > Checkout this NVIDIA tech blog developer.nvidia.com/blog/revolutio… > Sweepstakes to win a 4090 nvidia.com/en-us/research…

thumb_up_off_alt185

chat_bubble_outline2

repeat30

shareShare

Mingxuan Wu

@jackwal97390450

6 months ago

Introducing POD ! Predict-Optimize-Distill : A Self-Improving Cycle for 4D Object Understanding ! Inputs: a multi-view scan of an object + casually captured, long-form human interaction monocular videos (from your phone) ! Outputs: 3D part poses over time .

thumb_up_off_alt91

chat_bubble_outline6

repeat17

shareShare

Ruilong Li

@ruilong_li

4 months ago

For everyone interested in precise 📷camera control 📷 in transformers [e.g., video / world model etc] Stop settling for Plücker raymaps -- use camera-aware relative PE in your attention layers, like RoPE (for LLMs) but for cameras! Paper & code: liruilong.cn/prope/

thumb_up_off_alt410

chat_bubble_outline7

repeat75

shareShare

David McAllister

@davidrmcall

3 months ago

Excited to share Flow Matching Policy Gradients: expressive RL policies trained from rewards using flow matching. It’s an easy, drop-in replacement for Gaussian PPO on control tasks.

thumb_up_off_alt1,1K

chat_bubble_outline8

repeat185

shareShare

Alexandre Morgand

@almorgand

3 months ago

"Cameras as Relative Positional Encoding" TLDR: comparison for conditioning transformers on cameras: token-level raymap, attention-level relative pose encodings, a (new) relative encoding Projective Positional Encoding -> camera frustums, (int|ext)insics for relative pos encoding