Sree Harsha Tanneru (@sreetanneru) 's Twitter Profile
Sree Harsha Tanneru

@sreetanneru

ID: 1632970337003651072

linkhttp://harsha070.github.io calendar_today07-03-2023 05:04:18

76 Tweet

20 Takipçi

40 Takip Edilen

Sree Harsha Tanneru (@sreetanneru) 's Twitter Profile Photo

Writing is such an underrated skill in research. In times of short attention spans, writing helps maintain that train of thought.

Sree Harsha Tanneru (@sreetanneru) 's Twitter Profile Photo

Despite seeming intuitive, leave-one-out is often not a good way to measure feature importance scores (or) faithfulness, especially for small models. A partial input is often an O.O.D. point for small models, and therefore we shouldn't read too much into the delta in preds ?

Chirag Agarwal (@_cagarwal) 's Twitter Profile Photo

We show that Verbalized uncertainty estimates are unreliable and propose several probing techniques to quantify uncertainty in explanations. Check out the work at arxiv.org/abs/2311.03533

We show that Verbalized uncertainty estimates are unreliable and propose several probing techniques to quantify uncertainty in explanations. Check out the work at arxiv.org/abs/2311.03533
Sree Harsha Tanneru (@sreetanneru) 's Twitter Profile Photo

What if one day we'll have enough compute to model everything (language, vision, speech) as gaussian processes. Inherent interpretability is real ?

Gabriele Sarti (@gsarti_) 's Twitter Profile Photo

My Hugging Face collection of LM interpretability daily picks keeps growing! 🔍 Today's pick: Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models by Chirag Agarwal Sree Harsha Tanneru 𝙷𝚒𝚖𝚊 𝙻𝚊𝚔𝚔𝚊𝚛𝚊𝚓𝚞 huggingface.co/collections/gs…

Sree Harsha Tanneru (@sreetanneru) 's Twitter Profile Photo

every tech co has an over-experienced and under-employed swe, whose job is to 1. over-engineer systems. eg - adding monitoring+auditing+authorization+plugins+machine learning for a http request router. 2. drop system design gyaan and feel holier-than-thou. it if works, it works.