Lucas Grosjean (@xo_lucas_13) Twitter Tweets • TwiCopy

Pedro Cuenca

3 years ago

This is huge. Apple just released a Core ML conversion tool and inference pipeline for Stable Diffusion. Their code is inspired by diffusers and a pleasure to read. We converted the weights for you in the Hub. Go play with them! huggingface.co/blog/diffusers…

thumb_up_off_alt1,1K

chat_bubble_outline5

repeat179

shareShare

Andrej Karpathy

@karpathy

3 years ago

Great video on helion fusion. Few thoughts: - "no steam turbine" umm SOLD :) - triggers my hard tech envy for natural sciences, sometimes feel deep learning is not that deep - how can systems like chatgpt++ help accelerate this kind of work? how "intelligence constrained" is it?

thumb_up_off_alt951

chat_bubble_outline42

repeat61

shareShare

elvis

@omarsar0

3 years ago

How much can you get out of training a language model on a single consumer GPU in one day? Results attained in constrained setting: decent downstream performance on GLUE. Performance closely follows scaling laws observed in large-compute settings. arxiv.org/abs/2212.14034

thumb_up_off_alt643

chat_bubble_outline16

repeat100

shareShare

Lucas Grosjean

@xo_lucas_13

3 years ago

Happy new year to everyone! May your dreams become true and I wish you to learn more and more! :)

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

Bojan Tunguz

@tunguz

3 years ago

If you think that ChatGPT is awesome, just wait to see ChatZero - a chatbot trained from scratch by having thousands of concurrent instances BS to itself for millions of GPU hours.

thumb_up_off_alt294

chat_bubble_outline16

repeat16

shareShare

Tamay Besiroglu

@tamaybes

3 years ago

Recent applications of deep learning in science and engineering, such as AlphaFold and Copilot, have been astonishing. What does standard economic growth theory say about the economic effects of its adoption in R&D? We sketch a simple picture: arxiv.org/abs/2212.08198

thumb_up_off_alt402

chat_bubble_outline4

repeat63

shareShare

Happy Researchers

@hapyresearchers

3 years ago

The academic path..... Undergrad --> PhD --> Postdoc --> PI --> bakery shop owner

thumb_up_off_alt11,11K

chat_bubble_outline159

repeat1,1K

shareShare

Jean de La Rochebrochard

@2lr

3 years ago

Nos fondateurs reçoivent actuellement des emails frauduleux qui se font passer pour l'équipe de Kima. 1) Je ne vouvoie que mes grands-parents 2) Mes emails sont beaucoup plus laconiques 3) C'est à nous qu'on demande de l'argent, pas l'inverse

thumb_up_off_alt44

chat_bubble_outline8

repeat3

shareShare

Sam Altman

@sama

3 years ago

$2 per million tokens 🤯

thumb_up_off_alt3,3K

chat_bubble_outline165

repeat197

shareShare

Michael Black

@michael_j_black

3 years ago

PhD students, don't worry. Technologies, trends, and even whole fields come and go. A PhD makes you an expert in a field but, more importantly, teaches you how to become an expert. Once you know that you can learn anything, you can adapt to major disruptions in your field.

thumb_up_off_alt3,3K

chat_bubble_outline58

repeat425

shareShare

Jean de La Rochebrochard

@2lr

3 years ago

Once upon a time, there was a founder. Their vision was grand, but their bank account was empty. Their background was solid, so much so that they could design amazing slides. Their ambition was stellar and reflected beautifully in their pitch. Their fundraising was

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

Lucas Grosjean

@xo_lucas_13

3 years ago

spectrum.ieee.org/chips-act-work… Bruno Le Maire Roland Lescure Quand allons-nous nous réveiller ? / Time to wake up?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jim Fan

@drjimfan

3 years ago

What if we set GPT-4 free in Minecraft? ⛏️ I’m excited to announce Voyager, the first lifelong learning agent that plays Minecraft purely in-context. Voyager continuously improves itself by writing, refining, committing, and retrieving *code* from a skill library. GPT-4 unlocks

thumb_up_off_alt9,9K

chat_bubble_outline362

repeat1,1K

shareShare

Steeve Morin 🇺🇦

@steeve

a year ago

We can finally say it publicly: poolside got their models running on Amazon Web Services’s Trainium/Inferentia thanks to ZML. No code change. Promise kept.

We can finally say it publicly: <a href="/poolsideai/">poolside</a> got their models running on <a href="/awscloud/">Amazon Web Services</a>’s Trainium/Inferentia thanks to <a href="/zml_ai/">ZML</a>.

No code change. Promise kept.

thumb_up_off_alt202

chat_bubble_outline14

repeat26

shareShare

Jenia Jitsev 🏳️‍🌈 🇺🇦 🇮🇱

@jjitsev

a year ago

(Yet) another tale of Rise and Fall: DeepSeek R1 is claimed to match o1/o1-preview on olympiad level math & coding problems. Can it handle versions of AIW problems that reveal generalization & basic reasoning deficits in SOTA LLMs? (arxiv.org/abs/2406.02061) 🧵1/n