Bhargav (@42bhargav) Twitter Tweets • TwiCopy

Tom Silver

@tomssilver

a month ago

This was my experience in grad school, and now I've seen some evidence to suggest a trend 🤔

thumb_up_off_alt797

chat_bubble_outline11

repeat43

shareShare

Honored to present my PhD work at Ai2! The lack of data is one of robotics’ biggest hurdles — here’s how we’re tackling it in tactile manipulation and rehab robots. Watch the talk 👇 youtu.be/Qfq0BT6En4E?si…

thumb_up_off_alt40

chat_bubble_outline2

repeat4

shareShare

Zhutian Yang

@zhutianyang_

a month ago

Nice work!

thumb_up_off_alt30

chat_bubble_outline0

repeat4

shareShare

Toviah Moldwin

@tmoldwin

a month ago

Artificial neural networks were invented as models of the brain and and now rapidly approach human-level cognition. Does this mean that the mystery of human intelligence has been solved? I grapple with this question here. dendwrite.substack.com/p/what-remains…

thumb_up_off_alt19

chat_bubble_outline3

repeat5

shareShare

HMC

@47k254

a month ago

Do you know about Open Geospatial Solutions? The Open Geospatial Solutions (opengeos) GitHub organization hosts a collection of open-source geospatial software projects. The projects are developed by a community of geospatial software developers and researchers Their collection

thumb_up_off_alt355

chat_bubble_outline3

repeat70

shareShare

Timothy Nguyen

@iamtimnguyen

a month ago

Are neural networks Turing complete? This excellent blogpost by colleague Hessam Akhlaghpour goes carefully over the limitations and flaws of much of the literature for the case of transformers: lifeiscomputation.com/transformers-a… Note that the first proofs of Turing completeness (to my knowledge)

thumb_up_off_alt63

chat_bubble_outline5

repeat18

shareShare

Tom Silver

@tomssilver

a month ago

This week's #PaperILike is "Reality Promises: Virtual-Physical Decoupling Illusions in Mixed Reality via Invisible Mobile Robots" (Kari & Abtahi, UIST 2025). This is some Tony Stark level stuff! XR + robots = future. Website: mkari.de/reality-promis… PDF: mkari.de/reality-promis…

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Sepp Hochreiter

@hochreitersepp

a month ago

xLSTM for robotic manipulation systems via diffusion-based imitation learning: arxiv.org/abs/2510.20406 PMP leverages an xLSTM to denoise actions for robotics. “PMP not only achieves state-of-the-art performance but also offers significantly faster training and inference.”

thumb_up_off_alt177

chat_bubble_outline3

repeat23

shareShare

SzymonOzog

@szymonozog_

a month ago

Blog link: szymonozog.github.io/posts/2025-10-…

thumb_up_off_alt28

chat_bubble_outline0

repeat3

shareShare

The Residency

@_theresidency

a month ago

one day you will look back and hate that you stayed in a system you already knew was broken, right now you still have time to choose — to defect instead of comply, to build instead of wait, to live instead of defer, you can always go back to school later, you can’t rewind the

thumb_up_off_alt91

chat_bubble_outline8

repeat5

shareShare

Calum E. Douglas

@calumdouglas1

a month ago

Advice to students, young engineers and inquisitive amateurs. Every major project I do, until and including today, follows this pattern, and never does the fear leave. ============ 1) Can you do "thing x" ? 2) No. 3) Go to ntrs.nasa.gov , download all papers pertaining

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat149

shareShare

Maciej Kilian

@kilian_maciej

a month ago

is qwen3 doing depth-wise layer upcycling? i'm looking at average (over tokens) router probabilities for qwen3-30b-a3b and seeing this weird pattern where for highly active experts, its also likely for the same expert index but 12 (and often 24) layers down to also be highly

thumb_up_off_alt162

chat_bubble_outline8

repeat10

shareShare

Armen Aghajanyan

@armenagha

a month ago

We dive deep into every part of the stack that we didn't build. Here's a deep-dive into (potentially undisclosed?) MoE depth wise up-cycling that Qwen3 does. I have a lot of Qwen folks that follow me, would anyone like to clarify :)

thumb_up_off_alt149

chat_bubble_outline4

repeat6

shareShare

Gabriele Berton

@gabriberton

a month ago

At ICCV I met people working on Image Matching who still don't know our image-matching-models GitHub repo It allows you to use almost any IM model just by changing one parameter Image matchers, try it out, it will save you countless hours github.com/alexstoken/ima…

thumb_up_off_alt171

chat_bubble_outline2

repeat17

shareShare

Yifan Zhang

@yifan_zhang_

a month ago

🚀Another thing that needs to be quoted is the upcoming Multi-Token Prediction (MTP) by Meta (arxiv.org/abs/2404.19737) and speculative decoding, which are rapidly becoming de facto components of modern LLMs. SPECTULATIVE DECODING IS COMPUTE-BOUND This trend suggests that

thumb_up_off_alt110

chat_bubble_outline0

repeat13

shareShare

Vivek Galatage

@vivekgalatage

a month ago

What is SIMD and how to use it by Anılcan Gülkaya medium.com/@anilcangulkay… Similar to the quoted post on designing a SIMD algorithm from scratch, this article provides all the necessary elements to start with SIMD.

thumb_up_off_alt125

chat_bubble_outline3

repeat16

shareShare

Hao Zhao

@haozhao_airsun

a month ago

If you’re excited by Tesla’s new world model, meet OmniNWM—our research take on panoramic, controllable driving world models • Ultra-long demos • precise camera control • RGB/semantics/depth/occupancy • intrinsic closed-loop rewards Arxiv: arxiv.org/abs/2510.18313 Watch:

thumb_up_off_alt179

chat_bubble_outline5

repeat27

shareShare

Caglar

@caglar_ee

a month ago

Video lectures, Maryland UMD CMSC351 Introduction to Algorithms, by Mohammad Hajiaghayi youtube.com/playlist?list=…

thumb_up_off_alt58

chat_bubble_outline1

repeat9

shareShare

Qingqing Zhao

@qingqing_zhao_

a month ago

We build world models grounded in physical reality, naturally generalizing across cars and robots 🤖

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat56

shareShare

Weiyang Liu

@besteuler

a month ago

🤯 Merging many finetuned LLMs into one model, effectively? Introducing Functional Dual Anchor (FDA), a new framework for model merging. 🚀 Current merging works poorly due to the underlying parameter conflicts. FDA shifts knowledge integration to the input-representation space

thumb_up_off_alt216

chat_bubble_outline3

repeat39

shareShare

Bhargav

Tom Silver

Jingxi Xu

Zhutian Yang

Toviah Moldwin

HMC

Timothy Nguyen

Tom Silver

Sepp Hochreiter

SzymonOzog

The Residency

Calum E. Douglas

Maciej Kilian

Armen Aghajanyan

Gabriele Berton

Yifan Zhang

Vivek Galatage

Hao Zhao

Caglar

Qingqing Zhao

Weiyang Liu