Joe Hoover (@joeehoover) Twitter Tweets • TwiCopy

Horace He

2 years ago

Happy to OSS gpt-fast, a fast and hackable implementation of transformer inference in <1000 lines of native PyTorch with support for quantization, speculative decoding, TP, Nvidia/AMD support, and more! Code: github.com/pytorch-labs/g… Blog: pytorch.org/blog/accelerat… (1/12)

thumb_up_off_alt2,2K

chat_bubble_outline46

repeat1,1K

shareShare

Replicate

@replicate

2 years ago

Businesses are building on open-source AI. But we’ve only reached a tiny fraction. That's why we raised a $40M Series B. Open-source is open for business 😎 replicate.com/blog/series-b

thumb_up_off_alt600

chat_bubble_outline27

repeat63

shareShare

Hongyang Zhang

@hongyangzh

2 years ago

Introduce EAGLE, a new method for fast LLM decoding based on compression: - 3x🚀than vanilla - 2x🚀 than Lookahead (on its benchmark) - 1.6x🚀 than Medusa (on its benchmark) - provably maintains text distribution - trainable (in 1~2 days) and testable on RTX 3090s Playground:

thumb_up_off_alt516

chat_bubble_outline17

repeat124

shareShare

Nate Raw

@_nateraw

2 years ago

Feel the AGI!! 💪 Try out the new Mixtral model from Mistral AI, a 8x7B Mixture of Experts, now on Replicate! Big shout out to Dmytro Dzhulgakov for their minimal implementation that I used to get this shipped 🚀 Impl very slow for now - but it works 😅 replicate.com/nateraw/mixtra…

thumb_up_off_alt482

chat_bubble_outline17

repeat66

shareShare

Hamel Husain

@hamelhusain

2 years ago

If you are using axolotl, you may have gotten confused by subtle differences in tokenization and seen artifacts like spaces. This can affect you at inference time. In this post, I explain exactly why it is happening, and what you should do about (link in second message 👇).

thumb_up_off_alt119

chat_bubble_outline7

repeat14

shareShare

Charlie Holtz

@charliebholtz

2 years ago

I made a site to chat with Mistral AI's new 8x7B instruct model! - free + open source - streams at ~40 tokens/s - beats GPT-3.5 on most benchmarks Try it at mixtral.replicate.dev

thumb_up_off_alt587

chat_bubble_outline16

repeat82

shareShare

Joe Hoover

@joeehoover

2 years ago

TFW your 2 y/o and ChatGPT absolutely refuse to take a deep breath and think step by step.

thumb_up_off_alt12

chat_bubble_outline1

repeat0

shareShare

gavin leech‎ ‎ ‎ ‎ ‎ ‎

@g_leech_

2 years ago

ML in 2023 (not a calibrated accounting of All Progress, just what caught my haphazard eye)

thumb_up_off_alt242

chat_bubble_outline8

repeat44

shareShare

AI at Meta

@aiatmeta

2 years ago

Today we’re releasing Code Llama 70B: a new, more performant version of our LLM for code generation — available under the same license as previous Code Llama models. Download the models ➡️ bit.ly/3Oil6bQ • CodeLlama-70B • CodeLlama-70B-Python • CodeLlama-70B-Instruct

thumb_up_off_alt5,5K

chat_bubble_outline164

repeat1,1K

shareShare

Replicate

@replicate

2 years ago

Code Llama 70B is live on Replicate! It's the most powerful code generation model from AI at Meta with instruct, Python, and base variants. Code Llama 70B instruct is fine tuned for understanding natural language instructions: replicate.com/meta/codellama…

thumb_up_off_alt67

chat_bubble_outline2

repeat9

shareShare

Replicate

@replicate

2 years ago

Llama 3's out. Try it here: llama3.replicate.dev

thumb_up_off_alt202

chat_bubble_outline12

repeat37

shareShare

Joe Hoover

@joeehoover

2 years ago

It’s rare to see a course go viral like this one. But then, what else would you expect when Hamel Husain and @Dan_s_Becker get together?

thumb_up_off_alt4

chat_bubble_outline2

repeat0

shareShare

Joe Hoover

@joeehoover

2 years ago

🎉

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Joe Hoover

@joeehoover

2 years ago

I love the smell of LMs in the morning

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Joe Hoover

@joeehoover

2 years ago

Wut

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Joe Hoover

@joeehoover

a year ago

We're building next-gen AI evaluation systems at Apple. Want to help? Looking for a contract ML Scientist/Engineer in NYC (or SEA/Bay Area). Need someone who wants to ships fast. Deep LLM experience required. Must be available now. DM resume + fit explanation. #MLjobs #Apple

thumb_up_off_alt35

chat_bubble_outline3

repeat6

shareShare

Joe Hoover

@joeehoover

a year ago

We're building responsible AI evaluation systems at Apple. Want to help? Seeking a Senior Research Data Scientist (contract) in SEA/NYC/SD/Bay Area with data science, RAI, and human research expertise. Python + LLM experience required. Must be available now. DM resume + fit.

thumb_up_off_alt9

chat_bubble_outline1

repeat3

shareShare

Joe Hoover

@joeehoover

a year ago

SOTA LLM agents trained on just 72 tasks—but not with GRPO 👀 - No value network - No reward norms - Beats o1 - Great discussion on why this worked in a small-data regime

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Zichen Liu @ ICLR2025

@zzlccc

a year ago

🚨There May Not be Aha Moment in R1-Zero-like Training: oatllm.notion.site/oat-zero A common belief about the recent R1-Zero-like training is that self-reflections *emerge* as a result of RL training. We carefully investigated and showed the opposite. 🧵

thumb_up_off_alt472

chat_bubble_outline18

repeat72

shareShare

Joe Hoover

@joeehoover

a year ago

"on the majority of the *original* benchmarks, over 50% of 'model errors' are actually caused by label noise!" 🥺 Doing the lord's work 🙏🙏

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare