Vignesh Padmanabhan (@vigg_1991) Twitter Tweets • TwiCopy

Vignesh Padmanabhan

@vigg_1991

+ Follow

ID: 120717378

calendar_today07-03-2010 09:40:42

2,2K Tweet

448 Followers

2,2K Following

Unsloth AI

@unslothai

10 months ago

A Complete Guide to Fine-tuning LLMs in 20 mins! Learn to: • Choose the correct model & training method (LoRA, FFT, GRPO) • Build Datasets & Chat templates • Train with Unsloth notebooks • Run & deploy your LLM in llama.cpp, Ollama & Open WebUI Docs: docs.unsloth.ai

thumb_up_off_alt877

chat_bubble_outline14

repeat143

shareShare

jack

@jack

9 months ago

bitchat? now on the App Store: apps.apple.com/us/app/bitchat…

thumb_up_off_alt11,11K

chat_bubble_outline1,1K

repeat1,1K

shareShare

Google Research

@googleresearch

9 months ago

Let your wearable data "speak" for itself! Introducing SensorLM, a family of sensor-language foundation models trained on ~60 million hours of data, enabling robust wearable data understanding with natural language. → goo.gle/4lSLwQi

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat149

shareShare

Ritchie Vink

@ritchievink

9 months ago

Polars 1.32 is out and it lands a lot! Let's go through a few: 1/4 Selectors are now implemented in Rust and we can finally select arbitrary nested types:

thumb_up_off_alt280

chat_bubble_outline4

repeat16

shareShare

The Humanoid Hub

@thehumanoidhub

8 months ago

Dynamic control trained at SUSTech’s ACT Lab in Shenzhen.

thumb_up_off_alt5,5K

chat_bubble_outline535

repeat985

shareShare

elvis

@omarsar0

7 months ago

anthropic.com/engineering/ef…

thumb_up_off_alt185

chat_bubble_outline5

repeat24

shareShare

Google Research

@googleresearch

6 months ago

Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI

thumb_up_off_alt460

chat_bubble_outline17

repeat72

shareShare

Philipp Schmid

@_philschmid

5 months ago

philschmid.de/agentic-pattern

thumb_up_off_alt20

chat_bubble_outline0

repeat5

shareShare

Weaviate • vector database

@weaviate_io

5 months ago

Vector search: fast, accurate, or affordable. Pick... all three? ✨ Most engineering teams are trapped in an expensive cycle: as their AI applications scale, they're forced to choose between performance and budget. More data means bigger infrastructure bills, slower searches,

thumb_up_off_alt26

chat_bubble_outline0

repeat11

shareShare

Cameron R. Wolfe, Ph.D.

@cwolferesearch

5 months ago

The original PPO-based RLHF pipeline had 4 model copies: 1. Policy 2. Reference 3. Critic 4. Reward Model Recent GRPO-based RLVR pipelines have eliminated all of these models except for the policy. - The critic is no longer needed because values are estimated from group

thumb_up_off_alt259

chat_bubble_outline4

repeat34

shareShare

Andrej Karpathy

@karpathy

5 months ago

x.com/i/article/2002…

thumb_up_off_alt7,7K

chat_bubble_outline191

repeat1,1K

shareShare

Thariq

@trq212

4 months ago

x.com/i/article/2011…

thumb_up_off_alt4,4K

chat_bubble_outline186

repeat408

shareShare

Harrison Chase

@hwchase17

4 months ago

x.com/i/article/2011…

thumb_up_off_alt1,1K

chat_bubble_outline30

repeat177

shareShare

Thariq

@trq212

2 months ago

x.com/i/article/2027…

thumb_up_off_alt6,6K

chat_bubble_outline141

repeat719

shareShare

Sebastian Raschka

@rasbt

2 months ago

I (finally) put together a new LLM Architecture Gallery that collects the architecture figures all in one place! sebastianraschka.com/llm-architectu…

thumb_up_off_alt5,5K

chat_bubble_outline150

repeat954

shareShare

Vignesh Padmanabhan

@vigg_1991

2 months ago

A Scientist's View of War youtu.be/XI9NG068TwI?si… via YouTube

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Boris Cherny

@bcherny

2 months ago

Really great writeup

thumb_up_off_alt4,4K

chat_bubble_outline58

repeat274

shareShare

Shann³

@shannholmberg

a month ago

how autoresearch works, simplified it's a pattern that lets AI agents run experiments and improve anything you can measure three files is all you need, everyone should be running it. ↓ > program. md is where you tell the agent what to do. your goal, the rules it has to

thumb_up_off_alt514

chat_bubble_outline32

repeat64

shareShare

Andrej Karpathy

@karpathy

a month ago

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating

thumb_up_off_alt36,36K

chat_bubble_outline1,1K

repeat4,4K

shareShare

Sydney Runkle

@sydneyrunkle

18 days ago

we just shipped support for subagents with `deepagents deploy`! add an agents/ dir to your project with an AGENTS.md per specialized subagent. subagents are great for task delegation with isolated/optimized context docs.langchain.com/oss/python/dee…

thumb_up_off_alt87

chat_bubble_outline4

repeat16

shareShare