Baraban (@bara_ban) Twitter Tweets • TwiCopy

Baraban

@bara_ban

+ Follow

Software engineer, drawn towards understanding the first principles of the universe and consciousness.

ID: 169047743

calendar_today21-07-2010 12:15:19

1,1K Tweet

508 Takipçi

327 Takip Edilen

Baraban

@bara_ban

a month ago

Omnipotence without forgetting is spoilers.

thumb_up_off_alt60

chat_bubble_outline3

repeat1

shareShare

Baraban

@bara_ban

a month ago

Trying Claude Code, installed CLI version on virtual Ubuntu.

thumb_up_off_alt41

chat_bubble_outline0

repeat0

shareShare

12 first experiments done github.com/KintaroAI/rese… Currently using Claude Opus 4.6 to run experiments - pretty happy. Thinking about using huggingface.co to store artifacts (models, training logs, figures) Created experiment and testing protocols - will improve them

thumb_up_off_alt23

chat_bubble_outline2

repeat3

shareShare

Baraban

@bara_ban

a month ago

Ran a bunch of experiments today with Claude Opus 4.6 as my research partner - comparing baseline vs blend (bigram embedding mixing) vs Hebbian pull (non-learnable co-occurrence force on embeddings) for GPT-2 training on TinyStories. Key findings: 1) Blend-G8 consistently beats

thumb_up_off_alt23

chat_bubble_outline1

repeat1

shareShare

Baraban

@bara_ban

a month ago

Been thinking for a while: "possible" and "impossible" are often just names we give to the presence or absence of will.

thumb_up_off_alt37

chat_bubble_outline3

repeat7

shareShare

Baraban

@bara_ban

a month ago

I don't know for sure, but it must be pretty tough being an AI safety researcher in the US these days. Every single time you bring up any concern... you just getting this x.com/i/status/19097…

thumb_up_off_alt29

chat_bubble_outline0

repeat3

shareShare

Baraban

@bara_ban

a month ago

TIL: large transformers need LR warmup at the start of training. Small models converge fine without it, large ones don't.

thumb_up_off_alt37

chat_bubble_outline0

repeat4

shareShare

Baraban

@bara_ban

24 days ago

Remember these? Foot-operated door openers that showed up during the pandemic so you didn’t have to touch handles. Did your workplace have it? Someone probably made a fortune.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Baraban

@bara_ban

24 days ago

A toy model of topographic map formation - how thalamus neurons self-organize spatially through local correlation-based rules. No pre-training, just greedy local attraction. Converges pretty good but can be better. Experiment: take an image, scramble all pixels randomly, then

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Baraban

@bara_ban

21 days ago

when autocompact kicks in, it almost feels like losing a friend

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Baraban

@bara_ban

20 days ago

TIL while searching for unsupervised competitive learning: - HTM - Hierarchical Temporal Memory and - SOM - Self-Organizing Map Looking for a small unsupervised competitive cell that maps a low-dimensional input vector to a low-dimensional probability output, usually with one

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Baraban

@bara_ban

18 days ago

looks like they can't hold it down much longer

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare