David Liu (@davidwnliu) Twitter Tweets • TwiCopy

Guillaume Bellec

2 years ago

Pre-print: machine learning for neuroscience We build interpretable biological network reconstructions from electrode recordings with ML and optimal transport. Towards models of mechanisms driving behavior, we focus on single-trial neural activity and trial variability 1/6

thumb_up_off_alt310

chat_bubble_outline1

repeat71

shareShare

Laura Ruis

@lauraruis

a year ago

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this: Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢 🧵⬇️

thumb_up_off_alt966

chat_bubble_outline24

repeat208

shareShare

Wayne Soo

@soowmwayne

a year ago

Continuous-time RNNs are used in neuroscience to model neural dynamics. CNNs are used in vision neuroscience for image processing. So what's the right architecture to model the biological visual system? We propose a hybrid. (#NeurIPS2024 spotlight!) openreview.net/forum?id=ZZ94a…

thumb_up_off_alt163

chat_bubble_outline2

repeat44

shareShare

Kevin Patrick Murphy

@sirbayes

a year ago

I am happy to announce that the first draft of my RL tutorial is now available. arxiv.org/abs/2412.05265

thumb_up_off_alt4,4K

chat_bubble_outline75

repeat752

shareShare

Scott Linderman

@scott_linderman

a year ago

I'm excited to share our #NeurIPS2024 paper, "Modeling Latent Neural Dynamics with Gaussian Process Switching Linear Dynamical Systems" 🧠✨ We introduce the gpSLDS, a new model for interpretable analysis of latent neural dynamics! 🧵 1/10

thumb_up_off_alt136

chat_bubble_outline2

repeat18

shareShare

Darshan 🦖

@darshan

a year ago

The most misunderstood condition: Brain fog. It's not just fatigue. It's not just stress. Here's what's really happening inside your body:

thumb_up_off_alt9,9K

chat_bubble_outline127

repeat1,1K

shareShare

James Campbell

@jam3scampbell

10 months ago

The Road to AGI along with Emiliano (who's awesome, go follow), I built an interactive timeline of everything in AI the past few years we're living through the most exciting time in history and this site hopes to document it! go visit: ai-timeline dot org (link below)

thumb_up_off_alt563

chat_bubble_outline32

repeat63

shareShare

David D. Baek

@dbaek__

10 months ago

1/9 🚨 New Paper Alert: Cross-Entropy Loss is NOT What You Need! 🚨 We introduce harmonic loss as alternative to the standard CE loss for training neural networks and LLMs! Harmonic loss achieves 🛠️significantly better interpretability, ⚡faster convergence, and ⏳less grokking!

thumb_up_off_alt4,4K

chat_bubble_outline76

repeat537

shareShare

Aleksander Madry

@aleks_madry

10 months ago

Do current LLMs perform simple tasks (e.g., grade school math) reliably? We know they don't (is 9.9 larger than 9.11?), but why? Turns out that, for one reason, benchmarks are too noisy to pinpoint such lingering failures. w/ Josh Vendrow Eddie Vendrow Sara Beery 1/5

thumb_up_off_alt241

chat_bubble_outline12

repeat48

shareShare

Brian S. Kim

@itchdoctor

9 months ago

Cancer neuroimmunology is real. Nociceptive neurons promote gastric tumour progression via a CGRP–RAMP1 axis | Nature nature.com/articles/s4158…

thumb_up_off_alt177

chat_bubble_outline2

repeat45

shareShare

Miles Cranmer

@milescranmer

9 months ago

Why 'I don’t know' is the true test for AGI—it’s a strictly harder problem than text generation! This magnificent 62-page paper (arxiv.org/abs/2408.02357) formally proves AGI hallucinations are inevitable, with 50 pages (!!) of supplementary proofs.

thumb_up_off_alt942

chat_bubble_outline46

repeat140

shareShare

David Duvenaud

@davidduvenaud

9 months ago

LLMs have complex joint beliefs about all sorts of quantities. And my postdoc James Requeima visualized them! In this thread we show LLM predictive distributions conditioned on data and free-form text. LLMs pick up on all kinds of subtle and unusual structure: 🧵

thumb_up_off_alt1,1K

chat_bubble_outline30

repeat208

shareShare

Akira Yoshiyama ⁂

@yoshiyama_akira

9 months ago

Happy to announce we outperformed OpenAI o1 with a 7B model :) We released two self-improvement methods for verifiable domains in our preliminary paper -->

Happy to announce we outperformed <a href="/OpenAI/">OpenAI</a> o1 with a 7B model :)

We released two self-improvement methods for verifiable domains in our preliminary paper -->

thumb_up_off_alt3,3K

chat_bubble_outline108

repeat254

shareShare

Bindu Reddy

@bindureddy

9 months ago

Mercury Is The First Diffusion LLM! AI simply groks the patterns of the universe. Diffusion LLMs literally manifest the LLM response and are so next generation This is Mercury! The world’s first diffusion LLM

thumb_up_off_alt377

chat_bubble_outline70

repeat55

shareShare

AK

@_akhaliq

9 months ago

Block Diffusion Interpolating Between Autoregressive and Diffusion Language Models

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat250

shareShare

MatthewBerman

@matthewberman

8 months ago

We knew very little about how LLMs actually work...until now. Anthropic just dropped the most insane research paper, detailing some of the ways AI "thinks." And it's completely different than we thought. Here are their wild findings: 🧵

We knew very little about how LLMs actually work...until now.

<a href="/AnthropicAI/">Anthropic</a> just dropped the most insane research paper, detailing some of the ways AI "thinks."

And it's completely different than we thought.

Here are their wild findings: 🧵

thumb_up_off_alt10,10K

chat_bubble_outline86

repeat1,1K

shareShare

AI at Meta

@aiatmeta

8 months ago

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model

thumb_up_off_alt13,13K

chat_bubble_outline706

repeat2,2K

shareShare

AI at Meta

@aiatmeta

4 months ago

Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense

thumb_up_off_alt3,3K

chat_bubble_outline150

repeat689

shareShare

Andrej Karpathy

@karpathy

a month ago

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language

thumb_up_off_alt9,9K

chat_bubble_outline423

repeat1,1K

shareShare

Spencer Baggins

@bigaiguy

a month ago

🚨 MIT just humiliated every major AI lab and nobody’s talking about it. They built a new benchmark called WorldTest to see if AI actually understands the world… and the results are brutal. Even the biggest models Claude, Gemini 2.5 Pro, OpenAI o3 got crushed by humans.

thumb_up_off_alt2,2K

chat_bubble_outline216

repeat544

shareShare