El_Sturm (@lowtour) Twitter Tweets • TwiCopy

Lior⚡

4 months ago

Must-read on RL by Google DeepMind's Research Scientist Kevin Murphy dropped on ArXiv. It gives a clear, updated overview of deep RL and sequential decision-making, with examples.

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat143

shareShare

Carlos E. Perez

@intuitmachine

4 months ago

Perhaps humans don't have the cognitive scaffolding to ever understand how these LLMs work.

thumb_up_off_alt553

chat_bubble_outline106

repeat59

shareShare

Google just mapped every neuron & synapse in a block of mouse brain! 🤯 Slashing massively the cost and equipment barriers for such mapping. LICONN (Light Microscopy-based Connectomics) The electron microscopes used for connectomics research can cost millions of dollars, and

thumb_up_off_alt149

chat_bubble_outline4

repeat37

shareShare

Andrej Karpathy

@karpathy

4 months ago

We're missing (at least one) major paradigm for LLM learning. Not sure what to call it, possibly it has a name - system prompt learning? Pretraining is for knowledge. Finetuning (SL/RL) is for habitual behavior. Both of these involve a change in parameters but a lot of human

thumb_up_off_alt9,9K

chat_bubble_outline698

repeat1,1K

shareShare

Andriy Burkov

@burkov

4 months ago

LLMs haven't reached the level of autonomy so that they can be trusted with an entire profession, and it's already clear to everyone except for ignorant, deranged, or delusional people that they won't reach this level of autonomy. What remains is LLMs as a tool that humans use.

thumb_up_off_alt528

chat_bubble_outline54

repeat71

shareShare

sunny

@thepiggsboson

4 months ago

On the occasion of Feynman's birth anniversary, Here are all of his freely available lectures in one single thread 👇

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat314

shareShare

Lior⚡

@lioronai

4 months ago

The whole system prompt of Claude has been leaked on GitHub, 24,000 tokens long. It defines model behavior, tool use, and citation format.

thumb_up_off_alt8,8K

chat_bubble_outline144

repeat681

shareShare

Carlos E. Perez

@intuitmachine

4 months ago

The Claude 3.7 system prompt has been leaked and it's a goldmine for prompting techniques!

thumb_up_off_alt2,2K

chat_bubble_outline44

repeat203

shareShare

Rohan Paul

@rohanpaul_ai

4 months ago

Intelligent Document Processing (IDP) Leaderboard – A Unified Benchmark for OCR, KIE, VQA, Table Extraction, and More. Its a unified benchmark for Vision-Language Models across 6 document AI tasks using 16 datasets and 9,229 documents. Gemini 2.5 Flash leads overall but stumbles

thumb_up_off_alt420

chat_bubble_outline7

repeat73

shareShare

ₕₐₘₚₜₒₙ

@hamptonism

4 months ago

Neural Network:

thumb_up_off_alt1,1K

chat_bubble_outline8

repeat223

shareShare

Carlos E. Perez

@intuitmachine

4 months ago

Do you truly understand the limits of current AI?!

thumb_up_off_alt194

chat_bubble_outline5

repeat47

shareShare

Swapna Kumar Panda

@swapnakpanda

4 months ago

Stanford's Machine Learning Courses: ❯ CS221 - Artificial Intelligence ❯ CS229 - Machine Learning ❯ CS230 - Deep Learning ❯ CS234 - Reinforcement Learning ❯ CS224U - NL Understanding ❯ CS224N - NLP with Deep Learning All FREE courses. Links inside:

thumb_up_off_alt2,2K

chat_bubble_outline19

repeat457

shareShare

elvis

@omarsar0

4 months ago

LLMs Get Lost in Multi-turn Conversation The cat is out of the bag. Pay attention, devs. This is one of the most common issues when building with LLMs today. Glad there is now paper to share insights. Here are my notes:

thumb_up_off_alt4,4K

chat_bubble_outline98

repeat644

shareShare

Rohan Paul

@rohanpaul_ai

4 months ago

Why LLMs Fail in Back-and-Forth Chats Beautiful paper from @microsoft and Salesforce AI Research Large Language Models (LLMs) are incredibly good at tackling tasks when you give them all the information upfront in one go. Think of asking for a code snippet with all requirements clearly

Why LLMs Fail in Back-and-Forth Chats

Beautiful paper from @microsoft and <a href="/SFResearch/">Salesforce AI Research</a>

Large Language Models (LLMs) are incredibly good at tackling tasks when you give them all the information upfront in one go. Think of asking for a code snippet with all requirements clearly

thumb_up_off_alt260

chat_bubble_outline15

repeat58

shareShare

El_Sturm

@lowtour

4 months ago

That's a good entry point

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ziming Liu

@zimingliu11

4 months ago

Interested in the science of language models but tired of neural scaling laws? Here's a new perspective: our new paper presents neural thermodynamic laws -- thermodynamic concepts and laws naturally emerge in language model training! AI is naturAl, not Artificial, after all.

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat249

shareShare

Hafizur Rahman

@i_amhafiz

4 months ago

7. Introduction to Machine Learning Draw insights and offer recommendations from data, all while weighing the ethical dimensions of machine learning. Link: applieddigitalskills.withgoogle.com/c/middle-and-h…

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

ℏεsam

@hesamation

4 months ago

mixture of experts (MoE) visually explained:

thumb_up_off_alt1,1K

chat_bubble_outline8

repeat162

shareShare

Charly Wargnier

@datachaz

4 months ago

Wow. This is one of the best interactive sites I’ve seen for learning how LLMs work! 🔥 It starts w/ a clear intro and guides you through every core component: from Embedding, Layer Norm, and Self-Attention to MLPs, Transformer blocks, Softmax, and Output layers. link in 🧵↓

thumb_up_off_alt531

chat_bubble_outline18

repeat87

shareShare

Alec Helbling

@alec_helbling

4 months ago

Flow matching produces smooth, deterministic trajectories. In contrast, the sampling process of a diffusion model is chaotic, resembling the random motion of gas particles.

thumb_up_off_alt3,3K

chat_bubble_outline39

repeat391

shareShare