Peter Barnett (@csgosmorf) Twitter Tweets • TwiCopy

Phrases

@phrases1439078

3 years ago

derek guy It's the colors of the sort of emblem / artwork in the middle of his book, see the side by side.

<a href="/dieworkwear/">derek guy</a> It's the colors of the sort of emblem / artwork in the middle of his book, see the side by side.

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Shouldn't [x,y] = xy - yx be called the "anti-commutator", since [x,y] = -[y,x]? And since it "measures anti-commutativity"? Then {x,y} = xy + yx would be called the "commutator" since {x,y} = {y,x}, and since it measures commutativity Or am I missing something

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Peter Barnett

@csgosmorf

3 years ago

Incredible video youtu.be/FQ9l4v7zB3I?si…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

3 years ago

Alexey Guzey **Explaining Induction** 1/? To grok it we need predicates. A predicate is a function that maps each input to a truth value. e.g. Let n%2 denote the remainder after dividing n by 2 (so it is 0 for evens and 1 for odds) Let isEven(n) = (n%2 = 0) Let isOdd(n) = (n%2 = 1)

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Peter Barnett

@csgosmorf

2 years ago

How is there still not an “edit prompt” button here on mobile?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

cts🌸

@gf_256

2 years ago

"Why does everyone keep telling me to use c++?"

thumb_up_off_alt7,7K

chat_bubble_outline74

repeat589

shareShare

Peter Barnett

@csgosmorf

2 years ago

If the entirety of Earth’s surface had a tiling like this and you stepped on 2 new tiles per second, 8 hours per day, everyday, it would take you around 100 million years to step on every last tile. If u could observe 1M new tiles per second you still wouldn’t live to see it all

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

2 years ago

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

2 years ago

(vision, seeing) (audition, hearing) (olfaction, smelling) (tactician, touching) (gustation, tasting)

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Alex Albert

@alexalbert__

2 years ago

Fun story from our internal testing on Claude 3 Opus. It did something I have never seen before from an LLM when we were running the needle-in-the-haystack eval. For background, this tests a model’s recall ability by inserting a target sentence (the "needle") into a corpus of

thumb_up_off_alt11,11K

chat_bubble_outline562

repeat2,2K

shareShare

Peter Barnett

@csgosmorf

2 years ago

o1 casually thinking about Elon Musk when I'm trying to get it to fix some code that's completely irrelevant xD

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

2 years ago

System 2 thinking was a step up. Maybe the next step up is introspection via meta-representations. It could not only know things, but also know what it knows

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

2 years ago

“Error in message stream” almost every time I prompt o1, after it thinks for a couple minutes and writes almost the entire answer. Can only imagine how much compute is being wasted by this

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

a year ago

Hold your horses? No. My horses will not be held. Horses, attack!!!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Peter Barnett

@csgosmorf

a year ago

Is it just me or does o1-pro perform worse when source code files are uploaded than when they’re each copy-pasted into the chat window?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

AK

@_akhaliq

a year ago

Microsoft presents rStar-Math Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking On the MATH benchmark, it improves Qwen2.5-Math-7B from 58.8% to 90.0% and Phi3-mini-3.8B from 41.4% to 86.4%, surpassing o1-preview by +4.5% and +0.9%. On the USA Math Olympiad

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat189

shareShare

Phrases

@phrases1439078

a year ago

Major update to my free app “Lexi: Vocabulary Crosswords” for expanding vocab via crosswords with spaced repetition! Now featuring: — fill-in-the-blank clues testing in-context application — pronunciation — improved definitions 50% of profits go to Qualia Research Institute. #Vocabulary #QRI

thumb_up_off_alt3

chat_bubble_outline0

repeat2

shareShare

Peter Barnett

@csgosmorf

a year ago

This app is awesome!! (that’s my alt haha)

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare