Quanshi Zhang (@quanshizhang) Twitter Tweets • TwiCopy

Quanshi Zhang

@quanshizhang

+ Follow

Associate Professor at Shanghai Jiao Tong University @sjtu1896. Interpretable machine learning.

ID: 1857070826014593024

linkhttp://qszhang.com/ calendar_today14-11-2024 14:39:40

4 Tweet

9 Followers

16 Following

Quanshi Zhang

@quanshizhang

a year ago

Can the inference logic of a DNN be faithfully explained as symbolic concepts? Our #ICLR2024 paper (arxiv.org/abs/2305.01939) makes an initial theoretical attempt to address this question. We prove that under three sufficient conditions, a DNN only encodes a small number of

thumb_up_off_alt0

chat_bubble_outline0

repeat1

shareShare

Quanshi Zhang

@quanshizhang

a year ago

Reasoning or memorization? Our recent work (arxiv.org/abs/2405.11880) proposes a new axiomatic framework to formulate and quantify the exact reasoning and memorization effects in an LLM. See our project page for a brief introduction: sjtu-xai-lab.github.io/ReasoningMemor…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Quanshi Zhang

@quanshizhang

a year ago

What is the mathematical mechanism shared by different attribution methods? Our TPAMI paper (ieeexplore.ieee.org/document/10414…) unifies 14 post-hoc attribution methods from the perspective of interactions. We prove that all 14 attribution methods can be represented as a weighted

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Quanshi Zhang

@quanshizhang

a year ago

How to explain a DNN’s generalization ability and learning dynamics through the lens of interaction concepts? Our recent works (arxiv.org/abs/2405.10262, and arxiv.org/abs/2407.19198 in #NeurIPS2024) discover and theoretically prove a two-phase dynamics of interaction concepts

thumb_up_off_alt1

chat_bubble_outline0

repeat2

shareShare