Johnathan (@johnathan0827) 's Twitter Profile
Johnathan

@johnathan0827

ID: 897361072117620736

calendar_today15-08-2017 07:35:26

36 Tweet

6 Followers

144 Following

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Very simple, minimal implementations for LLM inference at the edge with a lot of momentum, and a number of developing extensions across GPU support, quantization++, training/finetuning, etc. 👏 looking forward! +"Inference at the edge" manifesto good read:

Very simple, minimal implementations for LLM inference at the edge with a lot of momentum, and a number of developing extensions across GPU support, quantization++, training/finetuning, etc. 
👏 looking forward!

+"Inference at the edge" manifesto good read:
AK (@_akhaliq) 's Twitter Profile Photo

Introducing Superalignment by Open AI blog: openai.com/blog/introduci… Our goal is to build a roughly human-level automated alignment researcher. We can then use vast amounts of compute to scale our efforts, and iteratively align superintelligence. To align the first automated

Introducing Superalignment by Open AI

blog: openai.com/blog/introduci…

Our goal is to build a roughly human-level automated alignment researcher. We can then use vast amounts of compute to scale our efforts, and iteratively align superintelligence.

To align the first automated
Yann LeCun (@ylecun) 's Twitter Profile Photo

There is a hierarchy of training paradigms: - Architectural: uses general properties of the data to direct the architecture of the learning system. - Self-Supervised: can use lots of (raw) data to pre-train a large system to represent the data in a task-independent way. -

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Code Interpreter Beta (rolling out to ChatGPT Plus) is quite powerful. It's your personal data analyst: can read uploaded files, execute code, generate diagrams, statistical analysis, much more. I expect it will take the community some time to fully chart its potential. To turn

Code Interpreter Beta (rolling out to ChatGPT Plus) is quite powerful. It's your personal data analyst: can read uploaded files, execute code, generate diagrams, statistical analysis, much more. I expect it will take the community some time to fully chart its potential. 
To turn
Alberto Bietti (@albertobietti) 's Twitter Profile Photo

Come to our #NeurIPS spotlight poster (#322) this afternoon, to hear about (i) the emergence of in-context learning in Transformers, (ii) the role of weight matrices as associative memories, and (iii) how to learn these via gradient steps!

Come to our #NeurIPS spotlight poster (#322) this afternoon, to hear about (i) the emergence of in-context learning in Transformers, (ii) the role of weight matrices as associative memories, and (iii) how to learn these via gradient steps!
s1r1us (@s1r1u5_) 's Twitter Profile Photo

Andrej Karpathy >I should clarify that the risk is highest if you're running local LLM agents (e.g. Cursor, Claude Code, etc.). for sure, one example being this bug. youtube.com/watch?v=23Mz7q…

Yuandong Tian (@tydsh) 's Twitter Profile Photo

📢We show that continuous latent reasoning has a theoretical advantage over discrete token reasoning (arxiv.org/abs/2505.12514): For a graph with n vertices and graph diameter D, a two-layer transformer with D steps of continuous CoTs can solve the directed graph reachability

GeleMaster (@gele_master) 's Twitter Profile Photo

Andy Stewart anyrouter.top/register?aff=W… ,这是一个第三方claude code的平台,github注册后可以获取50美元 claude code,很好用,不需要代理

Decart (@decartai) 's Twitter Profile Photo

Introducing MirageLSD: The First Live-Stream Diffusion (LSD) AI Model Input any video stream, from a camera or video chat to a computer screen or game, and transform it into any world you desire, in real-time (<40ms latency). Here’s how it works (w/ demo you can use!):

Donald J. Trump (@realdonaldtrump) 's Twitter Profile Photo

..North Korea is a rogue nation which has become a great threat and embarrassment to China, which is trying to help but with little success.