
Federico Bianchi
@federicobianchy
Senior ML Scientist at TogetherAI. Prev. @EvidenceOpen and @StanfordNLP. Capybaras. (he/him).
ID: 2332157006
https://federicobianchi.io 07-02-2014 17:12:24
790 Tweet
1,1K Followers
756 Following












Can LLMs predict the future? In FutureBench, friends from Together AI create new questions from evolving news & markets: As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮 Also cool: by design, dynamic & uncontaminated eval


Most AI benchmarks test the past. But real intelligence is about predicting the future. Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with Hugging Face 🔍 Reasoning > memorization 📊 Real-world events 🧠 Dynamic,


🔮Exciting new benchmark testing how well AI predicts the future! Each week, we curate news + prediction markets for questions about next week. Then we have agents make forecasts. Requires advanced research + reasoning Together AI Hugging Face 📜together.ai/blog/futureben… 🌐


