Ruchit Rawal (@rawalruchit) Twitter Tweets • TwiCopy

Ruchit Rawal

@rawalruchit

+ Follow

CS Grad Student @UMDCS | Past: MPI-SWS, IISc & NSIT | Working on multi-modal understanding, robustness, & synthetic data generation.

ID: 1131114814770745344

linkhttps://ruchitrawal.github.io/ calendar_today22-05-2019 08:29:21

292 Tweet

349 Takipçi

3,3K Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Excited to share our paper "Universal Sharpness Dynamics..." is accepted to #ICLR2025! Neural net training exhibits rich curvature (sharpness) dynamics (sharpness reduction, progressive sharpening, Edge of Stability)- but why?🤔 We show that a minimal model captures it all! 1/n

thumb_up_off_alt469

chat_bubble_outline4

repeat60

shareShare

Micah Goldblum

@micahgoldblum

2 months ago

Drop by our ICLR workshop tomorrow on Building Trust in LLMs and LLM Applications! We’ll cover topics ranging from guardrails to explainability to regulation and more. We'll be in Hall 4 #6: building-trust-in-llms.github.io/iclr-workshop/…

thumb_up_off_alt32

chat_bubble_outline0

repeat8

shareShare

Sara Hooker

@sarahookr

a month ago

Following release of our recent work, we have spent considerable time engaging with lmarena.ai over last week. The organizers had concerns about the correctness of our work on the reliability of chatbot arena rankings.

thumb_up_off_alt557

chat_bubble_outline11

repeat92

shareShare

Sayak Paul

@risingsayak

a month ago

Despite the rise in combining LLM and DiT architectures for T2I synthesis, its design remains severely understudied. We explore several architectural choices that affect this design. We provide an open & reproducible training recipe that works at scale. This was done long ago

thumb_up_off_alt244

chat_bubble_outline8

repeat37

shareShare

Dana Arad 🎗️

@dana_arad4

a month ago

Tried steering with SAEs and found that not all features behave as expected? Check out our new preprint - "SAEs Are Good for Steering - If You Select the Right Features" 🧵

thumb_up_off_alt166

chat_bubble_outline7

repeat32

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

a month ago

Zero-Shot Vision Encoder Grafting via LLM Surrogates "We construct small “surrogate models” that share the same embedding space and representation language as the large target LLM by directly inheriting its shallow layers. Vision encoders trained on the surrogate can then be

thumb_up_off_alt131

chat_bubble_outline3

repeat21

shareShare

Shashwat Goel

@shashwatgoel7

a month ago

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

thumb_up_off_alt836

chat_bubble_outline33

repeat120

shareShare

Chau Minh Pham

@chautmpham

21 days ago

🤔 What if you gave an LLM thousands of random human-written paragraphs and told it to write something new -- while copying 90% of its output from those texts? 🧟 You get what we call a Frankentext! 💡 Frankentexts are surprisingly coherent and tough for AI detectors to flag.

thumb_up_off_alt115

chat_bubble_outline4

repeat33

shareShare

Daeun Lee

@danadaeun

19 days ago

Excited to share Video-Skill-CoT🎬🛠️– a new framework for domain-adaptive video reasoning with skill-aware Chain-of-Thought (CoT) supervision! ⚡️Key Highlights: ➡️ Automatically extracts domain-specific reasoning skills from questions and organizes them into a unified taxonomy,

thumb_up_off_alt75

chat_bubble_outline2

repeat28

shareShare

Gowthami Somepalli

@gowthami_s

14 days ago

Most papers discuss the hallucination problem in visual language models. In this paper, we present a framework to quantify both hallucination and omission problems in modern video LLMs. Both dataset and benchmarking code out!

thumb_up_off_alt18

chat_bubble_outline0

repeat3

shareShare

Benno Krojer

@benno_krojer

11 days ago

Excited to share the results of my internship research with AI at Meta, as part of a larger world modeling release! What subtle shortcuts are VideoLLMs taking on spatio-temporal questions? And how can we instead curate shortcut-robust examples at a large-scale? Details 👇🔬

Excited to share the results of my internship research with <a href="/AIatMeta/">AI at Meta</a>, as part of a larger world modeling release!

What subtle shortcuts are VideoLLMs taking on spatio-temporal questions?

And how can we instead curate shortcut-robust examples at a large-scale?

Details 👇🔬

thumb_up_off_alt59

chat_bubble_outline3

repeat22

shareShare

Ruchit Rawal

Gate.io

Dayal Kalra

Micah Goldblum

Sara Hooker

Sayak Paul

Dana Arad 🎗️

Tanishq Mathew Abraham, Ph.D.

Shashwat Goel

Chau Minh Pham

Daeun Lee

Gowthami Somepalli

Benno Krojer