Sudhir Pratap Yadav (@sudhirpyadav) Twitter Tweets • TwiCopy

Sudhir Pratap Yadav

@sudhirpyadav

+ Follow

I speak what I feel is true -- Open to change mind and discuss on anything. My views keep changing with information and maturity of my mind.

ID: 2179986666

calendar_today07-11-2013 12:17:51

4,4K Tweet

43 Takipçi

494 Takip Edilen

Stefano Albrecht

@s_albrecht

4 years ago

Visit our aamas2022 paper "Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" with Lukas Schäfer Filippos Christianos Josiah Hanna. Talk & Q&As on 11 May 18:00-19:00 (1A5-3) and 13 May 9:00-10:00 (3C1-2); all BST. ➡️ Paper: arxiv.org/abs/2107.08966

Visit our <a href="/aamas2022/">aamas2022</a> paper "Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" with <a href="/LukasSchaefer96/">Lukas Schäfer</a> <a href="/f_christianos/">Filippos Christianos</a> <a href="/JosiahHanna/">Josiah Hanna</a>. Talk & Q&As on 11 May 18:00-19:00 (1A5-3) and 13 May 9:00-10:00 (3C1-2); all BST.
➡️ Paper: arxiv.org/abs/2107.08966

thumb_up_off_alt15

chat_bubble_outline1

repeat9

shareShare

Jason Liu

@jasonjzliu

a year ago

Low-cost teleop systems have democratized robot data collection, but they lack any force feedback, making it challenging to teleoperate contact-rich tasks. Many robot arms provide force information — a critical yet underutilized modality in robot learning. We introduce: 1. 🦾A

thumb_up_off_alt805

chat_bubble_outline26

repeat87

shareShare

Davide Scaramuzza

@davsca1

a month ago

Check out our latest work, "Actor-Critic Model Predictive Control: Differentiable Optimization meets Reinforcement Learning for Agile Flight," published in the IEEE Transactions on Robotics, where we reconcile #OptimalControl and #ReinforcementLearning, achieving the same

thumb_up_off_alt410

chat_bubble_outline3

repeat61

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

a month ago

I still remember this vividly. We got it working with RL. #mujoco #sim2real

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

C's Robotics Paper Notes

@roboreading

a month ago

arxiv.org/pdf/2601.17895 Masked Depth Modeling for Spatial Perception Get high-quality metric depth images with reconstruction model, amazing!

thumb_up_off_alt175

chat_bubble_outline1

repeat19

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

a month ago

Video coming soon

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

a month ago

Guess the controller

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

a month ago

This is simple walking gait, much remain to be optimised. #quadruped #rumi #sim2real

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

cw j

@cwj99770123

25 days ago

Can we bridge the Sim-to-Real gap in complex manipulation without explicit system ID? 🤖 Presenting Contact-Aware Neural Dynamics — a diffusion-based framework that grounds simulation with real-world touch. Implicit Alignment: No tedious parameter tuning. Tactile-Driven:

thumb_up_off_alt305

chat_bubble_outline5

repeat48

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

24 days ago

#sim2real checkout our gravity compensation on kinova arm

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Brent Yi

@brenthyi

23 days ago

New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ Hongsuk Benjamin Choi: hongsukchoi.github.io/fpo-control

thumb_up_off_alt559

chat_bubble_outline15

repeat96

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

21 days ago

Real progress looks slow and boring #sim2real

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Zi-ang Cao

@ziang_cao

19 days ago

🚀 Introducing CHIP: Adaptive Compliance for Humanoid Control through Hindsight Perturbation! Current humanoids face a trade-off: they are either Agile & Stiff OR Slow & Soft. CHIP breaks this barrier. We enable on-the-fly switching between Compliant (wiping 🧼,

thumb_up_off_alt201

chat_bubble_outline8

repeat47

shareShare

C Zhang

@chongzitazhang

18 days ago

imitate and then distill into a goal-reaching policy then finetune. It's kinda becoming a standard way to repurpose skills.

thumb_up_off_alt57

chat_bubble_outline1

repeat5

shareShare

Russ Tedrake

@russtedrake

18 days ago

I've been saying for years that the biggest challenge for simulation in robotics is not actually the physics engine (although you do have to get that right). The real challenge is capturing the *diversity* of the real world. There was no doubt that generative AI had the potential

thumb_up_off_alt318

chat_bubble_outline9

repeat34

shareShare

Ai2

@allen_ai

18 days ago

Introducing MolmoSpaces, a large-scale, fully open platform + benchmark for embodied AI research. 🤖 230k+ indoor scenes, 130k+ object models, & 42M annotated robotic grasps—all in one ecosystem.

thumb_up_off_alt682

chat_bubble_outline10

repeat98

shareShare

Sudhir Pratap Yadav

@sudhirpyadav

16 days ago

Finally we achived locomotion in sim, main culprit was stiffness. If your policy is not working try lowering your stiffness just enough to support weight. Soon going to transfer to real robot #sim2real

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare