Akarsh Kumar (@akarshkumar0101) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Could a major opportunity to improve representation in deep learning be hiding in plain sight? Check out our new position paper: Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis. The idea stems from a little-known

thumb_up_off_alt864

chat_bubble_outline36

repeat133

shareShare

Kenneth Stanley

@kenneth0stanley

7 months ago

With gratitude to my phenomenal coauthors: Akarsh Kumar, Jeff Clune, and Joel Lehman Paper: arxiv.org/abs/2505.11581 Github: github.com/akarshkumar010…

With gratitude to my phenomenal coauthors: <a href="/akarshkumar0101/">Akarsh Kumar</a>, <a href="/jeffclune/">Jeff Clune</a>, and <a href="/joelbot3000/">Joel Lehman</a>

Paper: arxiv.org/abs/2505.11581
Github: github.com/akarshkumar010…

thumb_up_off_alt125

chat_bubble_outline8

repeat9

shareShare

Jeff Clune

@jeffclune

7 months ago

Is there a cancer at the heart of modern AI, lurking just beneath the surface of its dazzling performance? Our research suggests maybe, but also shows elegant solutions are possible (though how to get them at scale remains a mystery). Much more research needs to be done to

thumb_up_off_alt153

chat_bubble_outline8

repeat11

shareShare

Ami

@amiasadiii

7 months ago

Akarsh Kumar Great work 👏. “There are reasons to believe that current models are still suffering from FER, especially at the frontiers of knowledge where there is less data.” Even if the 10T parameters handle the issue, where do we go from there sounds like a solid foundation to invest

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

hardmaru

@hardmaru

7 months ago

Kenneth Stanley Akarsh Kumar Jeff Clune Joel Lehman Congrats guys! Another excellent work ✨

thumb_up_off_alt9

chat_bubble_outline4

repeat3

shareShare

Machine Learning Street Talk

@mlstreettalk

7 months ago

Exciting new work from Kenneth et al! Check it out!

thumb_up_off_alt25

chat_bubble_outline1

repeat10

shareShare

Dan Roy

@roydanroy

7 months ago

Kenneth Stanley And they say academia is dead! Nice work!

thumb_up_off_alt13

chat_bubble_outline2

repeat3

shareShare

Andrew Lampinen

@andrewlampinen

7 months ago

A nice example and useful discussion here! Here are a few reactions:

thumb_up_off_alt29

chat_bubble_outline1

repeat2

shareShare

Phillip Isola

@phillip_isola

7 months ago

Really thought-provoking work! In determining what makes a good representation, it might be the journey that matters not the destination.

thumb_up_off_alt55

chat_bubble_outline3

repeat5

shareShare

John Bohannon

@bohannon_bot

7 months ago

"Why can't my trained model generalize?" Maybe b/c its internal representation of the task is whack. Beautiful new paper from Jeff Clune's team. ~thread~ arxiv.org/abs/2505.11581

thumb_up_off_alt28

chat_bubble_outline3

repeat5

shareShare

Jeff Clune

@jeffclune

7 months ago

John Bohannon Akarsh Kumar Are you calling my brain FERry? I resemble that remark! re: 1(trillion) dollar question: we're working on it! We don't know the answer, but I strongly believe it will involve lots of principles from open-endedness (some known, some yet to be discovered). 😊🌱🔬🧪

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Bary Levy

@barylevy_

6 months ago

attentionmech People underestimate evolutionary algorithms. Great things happen when evolution meta-optimizes us to be better at evolving. But a naive one-to-one gene-to-function mapping like is usually done in many implementations is not sufficient for this to arise

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Dileep George

@dileeplearning

6 months ago

super interesting paper....

thumb_up_off_alt55

chat_bubble_outline3

repeat7

shareShare

Hyojin Bahng

@hyojinbahng

6 months ago

Image-text alignment is hard — especially as multimodal data gets more detailed. Most methods rely on human labels or proprietary feedback (e.g., GPT-4V). We introduce: 1. CycleReward: a new alignment metric focused on detailed captions, trained without human supervision. 2.

thumb_up_off_alt148

chat_bubble_outline3

repeat30

shareShare

Laura Ruis

@lauraruis

6 months ago

Revisiting Louis Kirsch et al.’s general-purpose ICL by meta-learning paper and forgot how great it is. It's rare to be taken along on the authors' journey to understand the phenomenon they document like this. More toy dataset papers should follow this structure.

thumb_up_off_alt41

chat_bubble_outline3

repeat7

shareShare

Han Guo

@hanguo97

6 months ago

We know Attention and its linear-time variants, such as linear attention and State Space Models. But what lies in between? Introducing Log-Linear Attention with: - Log-linear time training - Log-time inference (in both time and memory) - Hardware-efficient Triton kernels

thumb_up_off_alt1,1K

chat_bubble_outline14

repeat185

shareShare

Akarsh Kumar

@akarshkumar0101

6 months ago

oimo.io/works/life/ Incredible website by saharan / さはら visualizing Conway's Game of Life inside of Game of Life inside Game of Life ... and so on ... forever... Reminds me of the hierarchy of emergent structures in our world from physics to chemistry to biology. How many levels

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare