Omar Sanseviero (@osanseviero) 's Twitter Profile
Omar Sanseviero

@osanseviero

Making ML go brr at Google

ex-Chief Llama Officer @huggingface 🦙
Founder @AI_Learners.
100% Hacker Llama🇵🇪🇲🇽

ID: 207744565

linkhttps://osanseviero.github.io/hackerllama/ calendar_today25-10-2010 23:29:03

10,10K Tweet

44,44K Followers

2,2K Following

Mark McD ☠ (@m4rkmc) 's Twitter Profile Photo

🎬 Generate videos with the Gemini CLI Add: 🧑‍💻 GenMedia MCP servers for Imagen, Veo & Chirp 📝 A GEMINI܂md file explaining your ✨ creative process And you too can take 🙀 Rusty the Cat on an adventure ⬇️ Full tutorial in the vid ⬇️

Prince Canuma (@prince_canuma) 's Twitter Profile Photo

MLX-VLM v0.3.0 is here! And it brings a lot of significant improvements 🔥📷 What’s new: - KV Cache quantization - Mixed quantization (i.e, 4bit + 6bit) - Add support for audio modality in server - Gemma3n: Fixed embeddings, Vision, pixel casting, multi-task (audio + vision)

MLX-VLM v0.3.0 is here!

And it brings a lot of significant improvements 🔥📷

What’s new: 
- KV Cache quantization
- Mixed quantization (i.e, 4bit + 6bit)
- Add support for audio modality in server
- Gemma3n: Fixed embeddings, Vision, pixel casting, multi-task (audio + vision)
Philipp Schmid (@_philschmid) 's Twitter Profile Photo

Gemini API now supports Batch Mode with 50% cost savings! Submit large jobs and retrieve your results within 24 hours at a 50% discount. 🚀 - Process large batches at 50% of the standard API cost. - Receive results within a 24-hour window. - Supports built-in tools like Google

Gemini API now supports Batch Mode with 50% cost savings! Submit large jobs and retrieve your results within 24 hours at a 50% discount. 🚀

- Process large batches at 50% of the standard API cost.
- Receive results within a 24-hour window.
- Supports built-in tools like Google
Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

Introducing MatFormer Lab for Gemma 3n 🧑‍🔬 Use Mix-n-Match to slice the E4B and create a model with a custom size between 2B and 4B effective parameters Explore the quality-size trade-off and share your models with the community Try it out: goo.gle/gemma3n-matfor…

Introducing MatFormer Lab for Gemma 3n 🧑‍🔬

Use Mix-n-Match to slice the E4B and create a model with a custom size between 2B and 4B effective parameters

Explore the quality-size trade-off and share your models with the community

Try it out: goo.gle/gemma3n-matfor…
Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

Introducing T5Gemma: the next generation of encoder-decoder/T5 models! 🔧Decoder models adapted to be encoder-decoder 🔥32 models with different combinations 🤗Available in Hugging Face and Kaggle developers.googleblog.com/en/t5gemma

Introducing T5Gemma: the next generation of encoder-decoder/T5 models!

🔧Decoder models adapted to be encoder-decoder
🔥32 models with different combinations
🤗Available in Hugging Face and Kaggle

developers.googleblog.com/en/t5gemma
Philipp Schmid (@_philschmid) 's Twitter Profile Photo

New Agent Example! Turn any research question into a data visualization, automatically using Gemini 2.5 Pro and CAMEL-AI.org's OWL framework. Exciting collaboration! 🚀 🔍 Performs live web research using search engines and browser use. 🐍 Autonomously writes and executes Python

Patrick Loeber (@patloeber) 's Twitter Profile Photo

Excited to introduce GenAI Processors! An Open-Source Python library from Google DeepMind that allows you to build asynchronous and composable AI Pipelines for Generative AI

Excited to introduce GenAI Processors!

An Open-Source Python library from <a href="/GoogleDeepMind/">Google DeepMind</a> that allows you to build asynchronous and composable AI Pipelines for Generative AI
Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

Introducing GenAI Processors ✨ An open source library to build real-time projects easily, with cool features such as stream-based I/O and chaining, modularity, composability, and more GitHub: github.com/google-gemini/… Blog: developers.googleblog.com/en/genai-proce…

Introducing GenAI Processors ✨

An open source library to build real-time projects easily, with cool features such as stream-based I/O and chaining, modularity, composability, and more

GitHub: github.com/google-gemini/…
Blog: developers.googleblog.com/en/genai-proce…
Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

MedSigLIP: create embeddings for medical images and text - 400M text + 400M vision encoder - Useful for classification, semantic image retrieval, and more -Trained with chest X-rays, CT slices, MRI slices, dermatology images, and more. huggingface.co/google/medsigl…

Google AI Developers (@googleaidevs) 's Twitter Profile Photo

Walk the fashion runway with AI in this project from Nitin Tiwari and Margaret M.. Sketch2Runway uses Gemini 2.0 Flash and Veo 3 to enable all levels of fashion designers to transform fashion sketches into runway videos ↓ margaretmz.medium.com/fashion-sketch…

Omar Sanseviero (@osanseviero) 's Twitter Profile Photo

Next week we're doing an Open Models Meetup in Bangalore and we're looking for speakers! Speakers include team members from Google DeepMind, so you'll hear about Gemma, synthetic data, and architectural evolutions. See you there! Call for Speakers👉 forms.gle/zrWN95Cspmtby4…

Logan Kilpatrick (@officiallogank) 's Twitter Profile Photo

Today we are rolling out our first Gemini Embedding model, which ranks #1 on the MTEB leaderboard, as a generally available stable model. It is priced at $0.15 per million tokens and ready for at scale production use!

Today we are rolling out our first Gemini Embedding model, which ranks #1 on the MTEB leaderboard, as a generally available stable model. It is priced at $0.15 per million tokens and ready for at scale production use!
Tuana (@tuanacelik) 's Twitter Profile Photo

A new walkthrough for a research agent using LlamaIndex 🦙 and Google Gemini, fresh out the oven 🥨 Given a topic: 🌎 Use Gemini 2.5 pro with its server side google search tool 📝 Create an agent that takes notes as it gets results from its websearch 👀 Create other agents that

A new walkthrough for a research agent using <a href="/llama_index/">LlamaIndex 🦙</a> and <a href="/Google/">Google</a> Gemini, fresh out the oven 🥨 Given a topic:

🌎 Use Gemini 2.5 pro with its server side google search tool
📝 Create an agent that takes notes as it gets results from its websearch
👀 Create other agents that
Mark McD ☠ (@m4rkmc) 's Twitter Profile Photo

📣 Gemini CLI roadmap It has been so cool seeing everyone building with the Gemini CLI (60k ⭐s!), and sharing feedback (1k open issues 😬) To make things even more transparent the team has made the 🗺️ roadmap public. Take a look and tell us what you think:

merve (@mervenoyann) 's Twitter Profile Photo

Fine-tune Gemma3n on videos with audios inside with Colab A100 🔥 Just dropped the notebook where you can learn how to fine-tune Gemma3n on images+audio+text at the same time!

Fine-tune Gemma3n on videos with audios inside with Colab A100 🔥

Just dropped the notebook where you can learn how to fine-tune Gemma3n on images+audio+text at the same time!