@chrisgpotts : The Linear Representation Hypothesis is now widely adopted despite its highly restrictive nature. Here, @robert_csordas, Atticus Geiger, @chrmanning & I present a counterexample to the LRH and argue for more expressive theories of interpretability: arxiv.org/abs/24508.10920 • TwiCopy

Christopher Potts

@chrisgpotts

+ Follow

Stanford Professor of Linguistics and, by courtesy, of Computer Science, and member of @stanfordnlp and @StanfordAILab. He/Him/His.

ID: 408714449

linkhttp://web.stanford.edu/~cgpotts/ calendar_today09-11-2011 19:59:28

2,2K Tweet

11,11K Followers

633 Following

Christopher Potts

@chrisgpotts

3 months ago

The Linear Representation Hypothesis is now widely adopted despite its highly restrictive nature. Here, Csordás Róbert, Atticus Geiger, Christopher Manning & I present a counterexample to the LRH and argue for more expressive theories of interpretability: arxiv.org/abs/2408.10920

thumb_up_off_alt282

chat_bubble_outline10

repeat65

shareShare