Patrick Devaney (@patrickbdevaney) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Saurabh Kumar

@drummatick

a year ago

Literally a beast of a book. Emphasizes heavily on code and modern deep learning architectures Important concepts are highlighted so it’s easier to understand and focus.

thumb_up_off_alt661

chat_bubble_outline7

repeat63

shareShare

v0

@v0

a year ago

v0 can now: • Create and run full-stack Next.js and React applications • Create multiple files in one generation • Link and deploy to Vercel projects • Use Vercel project environment variables

thumb_up_off_alt3,3K

chat_bubble_outline157

repeat304

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

a year ago

REDUCIO! Generating 1024×1024 Video within 16 Seconds using Extremely Compressed Motion Latents code: github.com/microsoft/Redu… paper: arxiv.org/abs/2411.13552

thumb_up_off_alt144

chat_bubble_outline0

repeat34

shareShare

The Algorithm Design Manual - Practical approach - Real-world examples - Problem-solving strategies - Good book for someone trying to understand algorithms - It will require some understanding of any language. - Resources: github.com/mohitmishra786…

thumb_up_off_alt1,1K

chat_bubble_outline1

repeat166

shareShare

Maxime Labonne

@maximelabonne

a year ago

📈 The State of Generative AI in the Enterprise Interesting report from Menlo Ventures that shows the evolution of Gen AI in companies from 2023 to 2024: • Uses cases: Code generation, chatbots, search, data, and meeting summarization are the top generative AI use cases in

thumb_up_off_alt171

chat_bubble_outline9

repeat40

shareShare

Min Choi

@minchoi

a year ago

Less than 48 hours ago, DeepSeek AI from China just dropped their AI reasoning model. And it's on par with OpenAI o1-preview. Major shift. 10 examples (and how to try):

thumb_up_off_alt2,2K

chat_bubble_outline79

repeat273

shareShare

Alex Xu

@alexxubyte

a year ago

Linux Boot Process Explained.

thumb_up_off_alt1,1K

chat_bubble_outline11

repeat224

shareShare

Andrew Ng

@andrewyng

a year ago

A small number of people are posting text online that’s intended for direct consumption not by humans, but by LLMs (large language models). I find this a fascinating trend, particularly when writers are incentivized to help LLM providers better serve their users! People who post

thumb_up_off_alt764

chat_bubble_outline53

repeat144

shareShare

Eric Ciarla (hiring)

@ericciarla

a year ago

Introducing llms.txt Generator ✨ You can now concatenate any website into a single text file that can be fed into any LLM. We crawl the whole website with Firecrawl and extract data with gpt-4o-mini. Create your own llms.txt at llmstxt.firecrawl.dev!

thumb_up_off_alt2,2K

chat_bubble_outline67

repeat267

shareShare

Unsloth AI

@unslothai

a year ago

You can finetune Llama-3.2-Vision-11B for free on Colab now! Unsloth finetunes VLMs 2x faster, with 50% less VRAM, 6x longer context - with no accuracy loss. Documentation: docs.unsloth.ai GitHub: github.com/unslothai/unsl… Finetuning Colab: colab.research.google.com/drive/1j0N4XTY…

thumb_up_off_alt1,1K

chat_bubble_outline12

repeat276

shareShare

Daniel Han

@danielhanchen

a year ago

Vision finetuning is finally in🦥Unsloth AI! It took a while, but Llama 3.2 Vision, Pixtral, Qwen2 VL & all Llava variants now work! 1. QLoRA / LoRA is 1.3x to 2x faster for each 2. 30-70% less VRAM usage 3. 3 examples - Radiography, LaTeX, Q&A Extra stuff: 1. Pixtral chat

Vision finetuning is finally in🦥<a href="/UnslothAI/">Unsloth AI</a>! It took a while, but Llama 3.2 Vision, Pixtral, Qwen2 VL & all Llava variants now work!

1. QLoRA / LoRA is 1.3x to 2x faster for each
2. 30-70% less VRAM usage
3. 3 examples - Radiography, LaTeX, Q&A

Extra stuff:
1. Pixtral chat

thumb_up_off_alt577

chat_bubble_outline11

repeat106

shareShare

Yaroslav Bulatov

@yaroslavvb

a year ago

Anyone who thinks you need 100k GPUs to make progress should watch Hannaneh Hajishirzi COLM keynote. Molmo appeared to beat Llama 3.2 in quality with same release day, all open-science on a 1k GPU cluster youtube.com/watch?v=qMTzor…

thumb_up_off_alt382

chat_bubble_outline13

repeat42

shareShare

vanito

@vaniipillai

a year ago

the best part of the book fair hehe

thumb_up_off_alt1,1K

chat_bubble_outline58

repeat74

shareShare

swarms

@swarms_corp

a year ago

Introducing an all-new suite of tools built on swarms - the production-grade framework for autonomous agent swarms ⎆ Documentation Intelligence ⎆ Cross-language Compilation ⎆ Multi-agent Architecture ⎆ Financial Enterprise Solutions Here's what our lead developer

thumb_up_off_alt60

chat_bubble_outline3

repeat20

shareShare

DailyPapers

@huggingpapers

6 months ago

Distilling LLM Agents! 🧪 New work shows how to transfer the reasoning & task-solving power of large language model agents into smaller, more efficient models by cloning their tool-using behavior with retrieval and code!

thumb_up_off_alt566

chat_bubble_outline5

repeat81

shareShare

Naksh Jain

@nakshsonigara

6 months ago

Fractal, an Indian AI company, dropped Fathom-R1-14B open-source reasoning model that achieves performance comparable to o4-mini on math benchmarks within a 16K context window, trained for just $499. Built on top of DeepSeek-R1-Distill-Qwen-14B, It beats o3-mini-low.

thumb_up_off_alt722

chat_bubble_outline21

repeat77

shareShare

Miami AI Hub

@miamiaihub

6 months ago

🚨 Speaker Alert! 🚨 We’re kicking off Le Robot Hackathon Miami (June 14-15) with an amazing panel featuring clem 🤗, Co-Founder & CEO of Hugging Face. Clem turned open-source AI into a global movement—now he’s jetting to the 305 to talk robotics, community, and why the

🚨 Speaker Alert! 🚨
We’re kicking off Le Robot Hackathon Miami (June 14-15) with an amazing panel featuring <a href="/ClementDelangue/">clem 🤗</a>, Co-Founder & CEO of <a href="/huggingface/">Hugging Face</a>. Clem turned open-source AI into a global movement—now he’s jetting to the 305 to talk robotics, community, and why the

thumb_up_off_alt33

chat_bubble_outline5

repeat12

shareShare

Mustafa Shukor

@mustafashukor1

6 months ago

The Worldwide LeRobot hackathon is in 2 weeks, and we have been cooking something for you… Introducing SmolVLA, a Vision-Language-Action model with light-weight architecture, pretrained on community datasets, with an asynchronous inference stack, to control robots🧵

The Worldwide <a href="/LeRobotHF/">LeRobot</a> hackathon is in 2 weeks, and we have been cooking something for you…
Introducing SmolVLA, a Vision-Language-Action model with light-weight architecture, pretrained on community datasets, with an asynchronous inference stack, to control robots🧵

thumb_up_off_alt429

chat_bubble_outline6

repeat79

shareShare

merve

@mervenoyann

6 months ago

H Company released Holo-1: 3B and 7B GUI Action Vision Language Models for various web and computer agent tasks 🤗 Holo-1 has Apache 2.0 license and Hugging Face transformers support 🔥 more details in their blog post (next ⤵️)

H Company released Holo-1: 3B and 7B GUI Action Vision Language Models for various web and computer agent tasks 🤗

Holo-1 has Apache 2.0 license and <a href="/huggingface/">Hugging Face</a> transformers support 🔥
more details in their blog post (next ⤵️)

thumb_up_off_alt241

chat_bubble_outline10

repeat38

shareShare

Sam Rodriques

@sgrodriques

6 months ago

Today we are releasing ether0, our first scientific reasoning model. We trained Mistral 24B with RL on several molecular design tasks in chemistry. Remarkably, we found that LLMs can learn some scientific tasks more much data-efficiently than specialized models trained from

thumb_up_off_alt363

chat_bubble_outline9

repeat69

shareShare

Patrick Devaney

good girl

Saurabh Kumar

v0

𝚐𝔪𝟾𝚡𝚡𝟾

Mohit Mishra

Maxime Labonne

Min Choi

Alex Xu

Andrew Ng

Eric Ciarla (hiring)

Unsloth AI

Daniel Han

Yaroslav Bulatov

vanito

swarms

DailyPapers

Naksh Jain

Miami AI Hub

Mustafa Shukor

merve

Sam Rodriques