Dr. Carlos Toxtli (@ctoxtli) 's Twitter Profile
Dr. Carlos Toxtli

@ctoxtli

📜 Assistant Professor @ClemsonUniv
🥼 Director Human-AI Empowerment Lab @ClemsonAI
🤖 Past: Google, United Nations, Snap, Microsoft Research

ID: 51934480

linkhttp://www.carlostoxtli.com/#bio calendar_today29-06-2009 02:56:47

13,13K Tweet

3,3K Followers

3,3K Following

el.cine (@ehuanglu) 's Twitter Profile Photo

omg.. this cant be real China’s 4DV AI just dropped 4D Gaussian Splatting, you can turn 2D video into 4D with sound.. imagine.. we will be able to change camera angle, zoom in/out while watching movies 5 examples:

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Brilliant Paper. We need to evaluate reasoning steps separately for knowledge correctness and reasoning quality LLMs give a single right answer but hide wrong facts or sloppy reasoning. This paper scores each reasoning step, exposing which sentences supply real knowledge

Brilliant Paper.  

We need to evaluate reasoning steps separately for knowledge correctness and reasoning quality

LLMs give a single right answer but hide wrong facts or sloppy reasoning. 

This paper scores each reasoning step, exposing which sentences supply real knowledge
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Training on wrong answers outpaces training on correct ones. 10 times more learning emerges from plausible errors than from truths. Large language models refine their accuracy slowly when they learn only from correct examples. This paper introduces Likra, which trains one

Training on wrong answers outpaces training on correct ones.

10 times more learning emerges from plausible errors than from truths.

Large language models refine their accuracy slowly when they learn only from correct examples.

This paper introduces Likra, which trains one
Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Alibaba's RL LLM training library: ROLL "We introduce ROLL, an efficient, scalable, and user-friendly library designed for Reinforcement Learning Optimization for Large-scale Learning. ROLL caters to three primary user groups: tech pioneers aiming for cost-effective,

Alibaba's RL LLM training library: ROLL

"We introduce ROLL, an efficient, scalable, and user-friendly library  designed for Reinforcement Learning Optimization for Large-scale  Learning. ROLL caters to three primary user groups: tech pioneers aiming  for cost-effective,
Sam Altman (@sama) 's Twitter Profile Photo

wrote a new post, the gentle singularity. realized it may be the last one like this i write with no AI help at all. (proud to have written "From a relativistic perspective, the singularity happens bit by bit, and the merge happens slowly" the old-fashioned way)

Angry Tom (@angrytomtweets) 's Twitter Profile Photo

AI is getting crazier. MeiGen’s new AI model, MultiTalk, creates some of the most realistic lip sync videos yet. Nothing is real anymore... 10 insane examples:

hardmaru (@hardmaru) 's Twitter Profile Photo

Reinforcement Learning Teachers of Test Time Scaling In this new paper, we introduce a new way to teach LLMs how to reason by learning to teach, not solve! The core idea: A teacher model is trained via RL to generate explanations from question-answer pairs, optimized to improve

Reinforcement Learning Teachers of Test Time Scaling

In this new paper, we introduce a new way to teach LLMs how to reason by learning to teach, not solve!

The core idea: A teacher model is trained via RL to generate explanations from question-answer pairs, optimized to improve
Brendan Jowett (@jowettbrendan) 's Twitter Profile Photo

BREAKING: Google just turned Gemini into a full-blown AI school system. Teachers can now assign AI experts to students. Students can auto-generate quizzes and visual explainers. And it's all free in Google Workspace for Education. Here’s what just dropped 👇

elvis (@omarsar0) 's Twitter Profile Photo

Small Language Models are the Future of Agentic AI Lots to gain from building agentic systems with small language models. Capabilities are increasing rapidly! AI devs should be exploring SLMs. Here are my notes:

Small Language Models are the Future of Agentic AI

Lots to gain from building agentic systems with small language models.

Capabilities are increasing rapidly!

AI devs should be exploring SLMs.

Here are my notes:
elvis (@omarsar0) 's Twitter Profile Photo

Threats in LLM-Powered AI Agents Workflows Neat survey of typical threats you encounter when building AI agents. Prompt injections and protocol exploits included. Bookmark this one!

Threats in LLM-Powered AI Agents Workflows

Neat survey of typical threats you encounter when building AI agents.

Prompt injections and protocol exploits included.

Bookmark this one!
Jerry Liu (@jerryjliu0) 's Twitter Profile Photo

Introducing NotebookLlama - an open-source version of NotebookLM! 📓🦙 NotebookLlama is a full implementation of NotebookLM that includes all the capabilities that makes it so great for researchers+business users: ✅ Create a knowledge repository of documents. Has likely higher

elvis (@omarsar0) 's Twitter Profile Photo

Context Engineering Guide I'm writing a detailed guide on context engineering for AI devs. v1 is out now! (bookmark it) I use a concrete deep research multi-agent example to show what context engineering involves.

Context Engineering Guide

I'm writing a detailed guide on context engineering for AI devs.

v1 is out now! (bookmark it)

I use a concrete deep research multi-agent example to show what context engineering involves.
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

How to build a thriving open source community by writing code like bacteria do 🦠. Bacterial code (genomes) are: - small (each line of code costs energy) - modular (organized into groups of swappable operons) - self-contained (easily "copy paste-able" via horizontal gene

How to build a thriving open source community by writing code like bacteria do 🦠. Bacterial code (genomes) are:

- small (each line of code costs energy)
- modular (organized into groups of swappable operons)
- self-contained (easily "copy paste-able" via horizontal gene
elvis (@omarsar0) 's Twitter Profile Photo

A Survey of Context Engineering 160+ pages covering the most important research around context engineering for LLMs. This is a must-read! Here are my notes:

A Survey of Context Engineering

160+ pages covering the most important research around context engineering for LLMs.

This is a must-read!

Here are my notes:
Ai2 (@allen_ai) 's Twitter Profile Photo

Great science starts with great questions. 🤔✨ Meet AutoDS—an AI that doesn’t just hunt for answers, it decides which questions are worth asking. 🧵

Great science starts with great questions. 🤔✨ Meet AutoDS—an AI that doesn’t just hunt for answers, it decides which questions are worth asking. 🧵
Alexander Wei (@alexwei_) 's Twitter Profile Photo

1/N I’m excited to share that our latest OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

1/N I’m excited to share that our latest <a href="/OpenAI/">OpenAI</a> experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
hardmaru (@hardmaru) 's Twitter Profile Photo

Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad 🥉 matharena.ai/imo/ Nice blog post from the team behind MathArena: Evaluating LLMs on Uncontaminated Math Competitions (arxiv.org/abs/2505.23281) providing independent analysis of LLM performance on IMO.

Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad 🥉

matharena.ai/imo/

Nice blog post from the team behind MathArena: Evaluating LLMs on Uncontaminated Math Competitions (arxiv.org/abs/2505.23281) providing independent analysis of LLM performance on IMO.