Christy Bergman (@cbergman) 's Twitter Profile
Christy Bergman

@cbergman

AI DevAdvocate. I blog about AI, Machine Learning and wine. Tweets are my own

ID: 14204688

linkhttps://github.com/christy calendar_today24-03-2008 01:18:38

349 Tweet

374 Followers

245 Following

Towards Data Science (@tdatascience) 's Twitter Profile Photo

Learn how to adopt RAG best practices by incorporating evaluations into your pipeline: Christy Bergman covers the ins and outs of optimizing chunkings, embeddings, and more. buff.ly/3LjysTk

Christy Bergman (@cbergman) 's Twitter Profile Photo

Had a blast creating this #MultiModal tutorial/demo! Used fun mix of tools for awesome results! 💡✨ 🖼️Milvus for the #VectorDatabase 🧠 a tiny Clip #EmbeddingModel by Ash Vardanian 🤖 @chatgpt4o as the #LLM. Check it out! ➡️ github.com/christy/Zilliz… #AI #MachineLearning

Christy Bergman (@cbergman) 's Twitter Profile Photo

Thanks Towards Data Science for the reshare! Iterate to find the best RAG combinations by: Changing the Chunking Strategy 📦 Changing the Embedding Model 📷 Changing the LLM Model 📷 I made a video youtube.com/watch?v=BzZLyP…… Thanks to Greg Kamradt for the original chunking article!

Zilliz (@zilliz_universe) 's Twitter Profile Photo

Monday Meetup is right around the corner! 🗣 Join us in SF on August 5 for exciting talks: 🔢 Using Ray Data for Multimodal Embedding Inference with Christy Bergman 📐 A Different Angle: Retrieval Optimized Embedding Models Marqo 🛠 Building the Future of Neural Search: How to

Monday Meetup is right around the corner! 🗣 Join us in SF on August 5 for exciting talks: 
🔢 Using Ray Data for Multimodal Embedding Inference with <a href="/cbergman/">Christy Bergman</a> 
📐 A Different Angle: Retrieval Optimized Embedding Models <a href="/marqo_ai/">Marqo</a> 
🛠 Building the Future of Neural Search: How to
The AI Conference (@aiconference) 's Twitter Profile Photo

🌟Join our expert panel at The AI Conference 2024 to explore advanced RAG (Retrieval-Augmented Generation) techniques. Learn how integrating information retrieval with generative models is revolutionizing AI, making it more contextually rich and useful in real-world

🌟Join our expert panel at The AI Conference 2024 to explore advanced RAG (Retrieval-Augmented Generation) techniques.

Learn how integrating information retrieval with generative models is revolutionizing AI, making it more contextually rich and useful in real-world
Christy Bergman (@cbergman) 's Twitter Profile Photo

Interesting take-down how to do LoRA properly, quickly, with less memory, on all layers Daniel Han's tweet and blog unsloth.ai/blog/contpretr… ! > For continued pretraining, I advise people to train on all layers (inc gate) + lm_head, embed_tokens, use RS LoRA, use rank>=256

swyx (@swyx) 's Twitter Profile Photo

CUDA MODE hackathon today! Here's Andrej Karpathy on the 🏖️ origin story of llm.c, and what it hints at for the fast, simple, llm-compiled future of custom software.

Christy Bergman (@cbergman) 's Twitter Profile Photo

Interesting! The most common inference quantization int8/fp8 is not necessarily the best. bf16 #quantization is a way better accuracy/latency tradeoff.

Sam Altman (@sama) 's Twitter Profile Photo

GPT-4.5 is ready! good news: it is the first model that feels like talking to a thoughtful person to me. i have had several moments where i've sat back in my chair and been astonished at getting actually good advice from an AI. bad news: it is a giant, expensive model. we

DeepSeek (@deepseek_ai) 's Twitter Profile Photo

🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks. ⚡ 6.6 TiB/s aggregate read throughput in a 180-node cluster ⚡ 3.66 TiB/min

Towards Data Science (@tdatascience) 's Twitter Profile Photo

Thankfully Christy Bergman's article can help you identify key convos with an AI hack to perform semantic clustering simply by prompting LLMs! towardsdatascience.com/tutorial-seman…

Christy Bergman (@cbergman) 's Twitter Profile Photo

Don't🍷about #OOM running out of memory! Hugging Face is making it easier to run huge #TransformerandDiffuser models on consumer GPUs w quantization, tensor parallelism, offloading. Hear from Steven Liu how to fit these models on your setup. lu.ma/taf3lmvj #HuggingFace

Don't🍷about #OOM running out of memory!
<a href="/huggingface/">Hugging Face</a> is making it easier to run huge #TransformerandDiffuser models on consumer GPUs w quantization, tensor parallelism, offloading. Hear from <a href="/stevhliu/">Steven Liu</a> how to fit these models on your setup. lu.ma/taf3lmvj #HuggingFace
Christy Bergman (@cbergman) 's Twitter Profile Photo

💓Andrew Ng Note to self: look here before next CFP submission or helping others. Ask the model to summarize best advice per conference CFP rules and topic submitter wants to talk about...