Ankur Gupta (@getpy) 's Twitter Profile
Ankur Gupta

@getpy

Tweets on Python, Technology, Software Development, Programming.

ID: 457886651

calendar_today07-01-2012 23:05:23

3,3K Tweet

35,35K Followers

256 Following

Dan Mac (@daniel_mac8) 's Twitter Profile Photo

babe, wake up. a novel benchmark just dropped. Prophet Arena is a leaderboard that ranks AIs ability to predict the actual future.

babe, wake up. a novel benchmark just dropped.

Prophet Arena is a leaderboard that ranks AIs ability to predict the actual future.
Ara (@arafatkatze) 's Twitter Profile Photo

In building AI agents Cline , we've identified three mind viruses Mind Viruses are seductive ideas that sound smart, but don’t work in practice. 1. Multi-Agent Orchestration 2. RAG (Retrieval Augmented Generation) 3. More Instructions = Better Results Let's explore why!

In building AI agents <a href="/cline/">Cline</a> , we've identified three mind viruses Mind Viruses are seductive ideas that sound smart, but don’t work in practice. 
1. Multi-Agent Orchestration
2. RAG (Retrieval Augmented Generation)
3. More Instructions = Better Results
Let's explore why!
Romain Huet (@romainhuet) 's Twitter Profile Photo

The AGENTS.md microsite is live! Think of it as a README for agents: a simple, open format for guiding coding agents. It’s been awesome to work alongside peers from Amp — Research Preview, Cursor, Jules, Factory, and Roo Code to bring this to life!

Asankhaya Sharma (@asankhaya) 's Twitter Profile Photo

1/10🧵 @nousresearch recently published an interesting study showing open models use 2-3x more tokens than closed models for reasoning tasks. We tested whether dynamic token allocation could help address this efficiency gap. Here's what we learned šŸ‘‡

Omar Khattab (@lateinteraction) 's Twitter Profile Photo

Matt Pocock Storm DSPy Hey Matt! DSPy is a declarative programming model. It's sort of like relational databases / SQL. You write what you actually want to say (and want others to read) in English, knowing that it may or may not be the best "prompt" for your LLM. DSPy then gives you the tools to

MLflow (@mlflow) 's Twitter Profile Photo

The new GenAI Evaluation capability available in MLflow 3.3 helps teams measure, improve, and maintain the quality of your GenAI applications throughout their lifecycle from development through production. šŸ™Œ āœ… Run automatic and custom evaluations (scorers & judges) on your

Mike Taylor (@hammer_mt) 's Twitter Profile Photo

Matt Pocock DSPy Bear in mind that I'm a good prompter, I literally wrote a whole book on it for O'Reilly. So most people should just let DSPy handle it. Even if you can beat it why waste a day unless it's a really important prompt.

Ankur Gupta (@getpy) 's Twitter Profile Photo

Frontier model coders be like how can be get coding agents to spend more tokens … find the most cited class in codebase and in reasoning recommend changing it’s name.

Frontier model coders be like how can be get coding agents to spend more tokens … find the most cited class in codebase and in reasoning recommend changing it’s name.
presstab (@presstab_crypto) 's Twitter Profile Photo

Quick demo of JrDev - my coding agent built entirely in Python (backend + Textual frontend) šŸ Such a fun project to build! Free & Open Source Link below šŸ‘‡