Goku Mohandas
@gokumohandas
ml, bio, art, tennis, travel
ID: 3259586191
29-06-2015 04:21:48
954 Tweet
14,14K Followers
117 Following
The definitive guide to RAG in production! 🙏 Goku Mohandas walks us through implementing RAG from scratch, building a scalable app It now has updated discussion on embedding fine-tuning, re-ranking and effectively routing requests I think this is easily the most complete
I’ve read dozens of articles on building RAG-based LLM Applications, and this one by Goku Mohandas and Philipp Moritz from Anyscale is the best by far. If you’re curious about RAG, do yourself a favor by studying this. It will bring you up to speed 🔥 anyscale.com/blog/a-compreh…
An #OpenSource Stack for #AI Compute: Kubernetes + ray + PyTorch + vLLM ➡️ This Anyscale blog post by Robert Nishihara describes a snapshot of that emerging stack based on experience working with Ray users + case studies from Pinterest, Uber, Roblox, and