Omar Sanseviero (@osanseviero) Twitter Tweets • TwiCopy

Omar Sanseviero

@osanseviero

+ Follow

Chief Llama Officer @huggingface 🦙

Founder @AI_Learners.
Xoogler (SWE @Google Assistant, 20% PM TF Graphics).
100% Hacker Llama🇵🇪🇲🇽

ID: 207744565

linkhttps://osanseviero.github.io/hackerllama/ calendar_today25-10-2010 23:29:03

9,9K Tweet

35,35K Followers

2,2K Following

vLLM

@vllm_project

16 days ago

We are excited to see vLLM as an option for local apps in the Hugging Face hub! It comes with easy snippets to quickly test out the model.

We are excited to see <a href="/vllm_project/">vLLM</a> as an option for local apps in the <a href="/huggingface/">Hugging Face</a> hub! It comes with easy snippets to quickly test out the model.

thumb_up_off_alt28

chat_bubble_outline2

repeat5

shareShare

apolinario 🌐

@multimodalart

15 days ago

It's now so easy add images to the gallery of your LoRA on Hugging Face 🤯 🪄 ① Generate an image with the Widget 🖼️ ② Press "Add to model card gallery" 🔥

We are announcing Llama-3.1-SuperNova, a Llama-3.1-70B-Instruct model offline distilled from Llama-3.1-405B-Instruct. It's ridiculously strong, particularly in instruction following and math. It's available to play with at supernova.arcee.ai. Read more about the model and

Omar Sanseviero

@osanseviero

14 days ago

Here we go again 🤗 Here is a step-by-step guide on how to upload 12B+ models to Hugging Face Step 1. pip install huggingface_hub Step 2. huggingface-cli upload-folder <repo-id> <local-path> --repo-type=model That's it. Enjoy fast download speeds!

thumb_up_off_alt194

chat_bubble_outline13

repeat7

shareShare

Vaibhav (VB) Srivastav

@reach_vb

14 days ago

Mistral released Pixtral 12B Vision Language Model 🔥 Some notes on the release: 1. Text backbone: Mistral Nemo 12B 2. Vision Adapter: 400M 3. Uses GeLU (for vision adapter) & 2D RoPE (for vision encoder) 4. Larger vocabulary - 131,072 5. Three new special tokens - `img`,

thumb_up_off_alt779

chat_bubble_outline13

repeat162

shareShare

Omar Sanseviero

@osanseviero

14 days ago

LLaMA-Omni, a new model for speech interaction 🦙Based on Llama 3.1 8B Instruct ⚡️Low-latency speech 🚀Simultaneous text and speech generation 🤏Trained with 4 GPUs in less than 3 days Model: hf.co/ICTNLP/Llama-3… Paper: hf.co/papers/2409.06…

thumb_up_off_alt867

chat_bubble_outline14

repeat146

shareShare

Omar Sanseviero

@osanseviero

14 days ago

Are you tired of having to download datasets to explore them? You can now run SQL directly in the Hugging Face Hub dataset viewer 🤗As an example, check out positive samples in the IMDB dataset huggingface.co/datasets/stanf…

Are you tired of having to download datasets to explore them?

You can now run SQL directly in the <a href="/huggingface/">Hugging Face</a> Hub dataset viewer 🤗As an example, check out positive samples in the IMDB dataset

huggingface.co/datasets/stanf…

Omar Sanseviero

@osanseviero

13 days ago

Google releases DataGemma 👀Challenge hallucinations by using real data from Data Commons 🧠Gemma 2 fine-tuned for RAG and Retrieval Interleaved Generation (RIG) Models: huggingface.co/collections/go… Blog: blog.google/technology/ai/…

Omar Sanseviero

@osanseviero

12 days ago

Can LLMs Generate Novel Research Ideas? 🤔 🏫Recruit 100 NLP researchers to write ideas 🫣Blind reviews of both LLMs and human ideas Result: LLM ideas are rated more novel but less feasible Nice in-depth write-up in their paper! huggingface.co/papers/2409.04…

thumb_up_off_alt13

chat_bubble_outline1

repeat0

shareShare

Omar Sanseviero

@osanseviero

12 days ago

This is how the Hugging Face team is preparing for the PyTorch Conference next week🤗 See you soon and come to our party for some nice swag!

thumb_up_off_alt179

chat_bubble_outline12

repeat14

shareShare

Pedro Cuenca

@pcuenq

12 days ago

Announcing SAM 2 Studio and Core ML Segment Anything 2! I'm super excited about on-device ML, and firmly believe that it will be a big part of the future of AI. We converted Segment Anything 2 to Core ML and wrote a demo app that you can use on your Mac 🔥

Omar Sanseviero

@osanseviero

12 days ago

Introducing SAM 2 Studio ⚡️Fully local image segmentation app 👀Privacy-first: nothing sent to a server ⭐️Using Segment Anything 2 + CoreML Models: huggingface.co/collections/ap… Download app: huggingface.co/coreml-project… GH repo: github.com/huggingface/sa…

Ying Shan

@yshan2u

11 days ago

A quick update that we have release the inference code and model for DepthCrafter. Code: github.com/Tencent/DepthC… Model: huggingface.co/tencent/DepthC… Project page: depthcrafter.github.io

Martin Görner

@martin_gorner

9 days ago

Personal update: I'h joining Hugging Face today!

thumb_up_off_alt895

chat_bubble_outline75

repeat21

shareShare

Daniel Vila Suero

@dvilasuero

9 days ago

🧶 Introducing DataCraft: build synthetic datasets using natural language! Creating good quality synthetic data is difficult. It’s a trial and error process and requires a lot of tricks. DataCraft puts Argilla's dataset generation best practices in your hands within a no

thumb_up_off_alt27

chat_bubble_outline2

repeat9

shareShare

Wauplin

@wauplin

9 days ago

I'm thrilled to unveil our revamped Inference API docs! We've tackled your feedback head-on: clearer rate limits, dedicated PRO section, better code examples, and detailed parameter lists for each task. Deploying AI made simple. Dive in: huggingface.co/docs/api-infer…

Vaibhav (VB) Srivastav

@reach_vb

9 days ago

Segment Anything 2 (SAM 2) by AI at Meta running 100% on-device powered by Apple CoreML! ⚡ Takes fraction of a second to run inference on Mac or iPhone! > Apache licensed optimised model checkpoints - tiny, small, base ad large! > Open source application to annote any image in

Omar Sanseviero

@osanseviero

9 days ago

How much memory do you need to use a model? 🤗 Powered by EleutherAI Cookbook, you can now try this demo to - Estimate memory to infer or train a LLM - Compute the parameters count - Theoretical FLOPs for training huggingface.co/spaces/derek-t…

thumb_up_off_alt27

chat_bubble_outline3

repeat5

shareShare