zoe (@hx0dxs) 's Twitter Profile
zoe

@hx0dxs

e/acc

ID: 1823520621084446720

calendar_today14-08-2024 00:45:01

314 Tweet

15 Followers

187 Following

Unsloth AI (@unslothai) 's Twitter Profile Photo

You can now run FP8 reinforcement learning on consumer GPUs! Try DeepSeek-R1’s FP8 GRPO at home using only a 5GB GPU. Qwen3-1.7B fits in 5GB VRAM. We collabed with PyTorch to make FP8 RL inference 1.4× faster. Unsloth: 60% less VRAM, 12× longer context. docs.unsloth.ai/new/fp8-reinfo…

You can now run FP8 reinforcement learning on consumer GPUs!

Try DeepSeek-R1’s FP8 GRPO at home using only a 5GB GPU.

Qwen3-1.7B fits in 5GB VRAM.
We collabed with PyTorch to make FP8 RL inference 1.4× faster.
Unsloth: 60% less VRAM, 12× longer context.

docs.unsloth.ai/new/fp8-reinfo…
TuringPost (@theturingpost) 's Twitter Profile Photo

When @NVIDIA announced Nemotron 3 – it marked a symbolic turning point in a year that fundamentally reshaped open-source AI leadership. Is NVIDIA the new open-source king? What’s behind this strategy? Let's see. ▪️ It releases 3 trillion tokens of new pretraining, 18 million

When @NVIDIA announced Nemotron 3 – it marked a symbolic turning point in a year that fundamentally reshaped open-source AI leadership.

Is NVIDIA the new open-source king? What’s behind this strategy? Let's see.

▪️ It releases 3 trillion tokens of new pretraining, 18 million
Carlos E. Perez (@intuitmachine) 's Twitter Profile Photo

A 20B open-source model just OUTPERFORMED GPT-4o on long-term recall benchmarks. The secret? It stops treating memory like a database and starts treating it like a mind. Here is the breakdown of the HINDSIGHT architecture. 🧵 1/ Everyone "knows" how RAG (Retrieval Augmented

A 20B open-source model just OUTPERFORMED GPT-4o on long-term recall benchmarks.

The secret? It stops treating memory like a database and starts treating it like a mind.

Here is the breakdown of the HINDSIGHT architecture. 🧵
1/ Everyone "knows" how RAG (Retrieval Augmented
Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

Introducing our latest open model: MedASR 🔬Speech to text model 🏥for healthcare-based voice applications 🤗available in Hugging Face ⚡️run with transformers Download right now huggingface.co/google/medasr

Hasan Toor ✪ (@hasantoxr) 's Twitter Profile Photo

Top engineers at OpenAI, Anthropic, and Google don't prompt like you do. They use 10 techniques that turn mediocre outputs into production-grade results. I spent 2 weeks reverse-engineering their methods. Here's what actually works (steal the prompts + techniques) 👇

Top engineers at OpenAI, Anthropic, and Google don't prompt like you do.

They use 10 techniques that turn mediocre outputs into production-grade results.

I spent 2 weeks reverse-engineering their methods.

Here's what actually works (steal the prompts + techniques) 👇
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Very interesting Github repo, with 13.3K+ stars ⭐️ DeepCode is an open-source multi-agent system that converts research papers and natural language descriptions into code. Uses Model Context Protocol (MCP) to orchestrate specialized agents handling document parsing, code

Very interesting Github repo, with 13.3K+ stars ⭐️

DeepCode is an open-source multi-agent system that converts research papers and natural language descriptions into code. 

Uses Model Context Protocol (MCP) to orchestrate specialized agents handling document parsing, code
Ben Smith (@bensmithlive) 's Twitter Profile Photo

New study from China found prolonged Bluetooth headset use strongly linked to thyroid nodules. OpenAI & Jony Ive are now building earbuds that sit directly in your ear canal 24/7. The closer EMF is to your thyroid, the worse. This is not going to end well.

New study from China found prolonged Bluetooth headset use strongly linked to thyroid nodules.

OpenAI & Jony Ive are now building earbuds that sit directly in your ear canal 24/7.

The closer EMF is to your thyroid, the worse. This is not going to end well.
Paul F. Austin (@paulaustin3w) 's Twitter Profile Photo

Psilocybin doesn’t work by “opening your mind.” It works by loosening the machinery that holds your sense of self together. A Washington University study followed healthy participants who received a single high dose of psilocybin, tracking brain activity before, during, and for

Psilocybin doesn’t work by “opening your mind.”

It works by loosening the machinery that holds your sense of self together.

A Washington University study followed healthy participants who received a single high dose of psilocybin, tracking brain activity before, during, and for
Ahmad (@theahmadosman) 's Twitter Profile Photo

There are maybe ~20-25 papers that matter. Implement those and you’ve captured ~90% of the alpha behind modern LLMs. Everything else is garnish. You want that list? Keep reading ;) The Top 26 Essential Papers (+5 Bonus Resources) for Mastering LLMs and Transformers This list