Johnathan (@johnathan0827) Twitter Tweets • TwiCopy

Johnathan

@johnathan0827

4 years ago

我完成了 Unordered Data Structures！查看我的证书 coursera.org/share/948f1f35… #Coursera

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Very simple, minimal implementations for LLM inference at the edge with a lot of momentum, and a number of developing extensions across GPU support, quantization++, training/finetuning, etc. 👏 looking forward! +"Inference at the edge" manifesto good read:

thumb_up_off_alt2,2K

chat_bubble_outline43

repeat309

shareShare

AK

@_akhaliq

3 years ago

Introducing Superalignment by Open AI blog: openai.com/blog/introduci… Our goal is to build a roughly human-level automated alignment researcher. We can then use vast amounts of compute to scale our efforts, and iteratively align superintelligence. To align the first automated

thumb_up_off_alt202

chat_bubble_outline4

repeat52

shareShare

Yann LeCun

@ylecun

3 years ago

There is a hierarchy of training paradigms: - Architectural: uses general properties of the data to direct the architecture of the learning system. - Self-Supervised: can use lots of (raw) data to pre-train a large system to represent the data in a task-independent way. -

thumb_up_off_alt265

chat_bubble_outline14

repeat49

shareShare

Andrej Karpathy

@karpathy

3 years ago

Code Interpreter Beta (rolling out to ChatGPT Plus) is quite powerful. It's your personal data analyst: can read uploaded files, execute code, generate diagrams, statistical analysis, much more. I expect it will take the community some time to fully chart its potential. To turn

thumb_up_off_alt3,3K

chat_bubble_outline95

repeat713

shareShare

Alberto Bietti

@albertobietti

2 years ago

Come to our #NeurIPS spotlight poster (#322) this afternoon, to hear about (i) the emergence of in-context learning in Transformers, (ii) the role of weight matrices as associative memories, and (iii) how to learn these via gradient steps!

thumb_up_off_alt175

chat_bubble_outline1

repeat23

shareShare

Johnathan

@johnathan0827

a year ago

哈哈哈哈

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

s1r1us

@s1r1u5_

10 months ago

Andrej Karpathy >I should clarify that the risk is highest if you're running local LLM agents (e.g. Cursor, Claude Code, etc.). for sure, one example being this bug. youtube.com/watch?v=23Mz7q…

thumb_up_off_alt82

chat_bubble_outline3

repeat3

shareShare

Yuandong Tian

@tydsh

10 months ago

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat159

shareShare

Johnathan

@johnathan0827

10 months ago

how to make a such bot? instagram.com/reel/DKzO-7nzm…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

GeleMaster

@gele_master

9 months ago

Andy Stewart anyrouter.top/register?aff=W… ，这是一个第三方claude code的平台，github注册后可以获取50美元 claude code,很好用,不需要代理

thumb_up_off_alt19

chat_bubble_outline1

repeat3

shareShare

Andy Stewart

@manateelazycat

9 months ago

anyrouter是今天大佬推荐最多的方案，受教了，大佬们牛逼旅游完就回去用claude code搬砖

thumb_up_off_alt36

chat_bubble_outline4

repeat2

shareShare

Decart

@decartai

9 months ago

Introducing MirageLSD: The First Live-Stream Diffusion (LSD) AI Model Input any video stream, from a camera or video chat to a computer screen or game, and transform it into any world you desire, in real-time (<40ms latency). Here’s how it works (w/ demo you can use!):

thumb_up_off_alt1,1K

chat_bubble_outline108

repeat333

shareShare

Donald J. Trump

@realdonaldtrump

9 years ago

..North Korea is a rogue nation which has become a great threat and embarrassment to China, which is trying to help but with little success.

thumb_up_off_alt58,58K

chat_bubble_outline9,9K

repeat14,14K

shareShare