Federico Bianchi (@federicobianchy) Twitter Tweets • TwiCopy

Together AI

7 months ago

Big news for developers! We’re launching two new products to bring code execution & dev environments to AI apps: 📦 Together Code Sandbox ⚡ Together Code Interpreter Now you can run LLM-generated code in secure, scalable, fully managed environments. Details below 👇

thumb_up_off_alt55

chat_bubble_outline3

repeat12

shareShare

Together AI

@togethercompute

7 months ago

Announcing our latest example app to show off how to do structured OCR on Together AI! An app that lets you easily split your restaurant bill – powered by llama 4 vision w/ JSON mode on Together. Try it today -> usebillsplit.com

thumb_up_off_alt15

chat_bubble_outline5

repeat4

shareShare

Together AI

@togethercompute

7 months ago

Introducing the Open Deep Research CLI! Generate research reports directly from your terminal with `npx open-deep-research`.

thumb_up_off_alt111

chat_bubble_outline11

repeat13

shareShare

Hassan

@nutlope

7 months ago

Introducing the Open Deep Research CLI! Generate research reports directly from your terminal with "npx open-deep-research".

thumb_up_off_alt49

chat_bubble_outline3

repeat1

shareShare

Together AI

@togethercompute

6 months ago

Data Science Agent – implemented from scratch in code! In this notebook, we share our Data Science Agent implementation & go over how it works step-by-step. We use an iterative ReAct workflow capable of reasoning, code execution, and self-correction. Link below!🔽

thumb_up_off_alt51

chat_bubble_outline3

repeat7

shareShare

James Zou

@james_y_zou

6 months ago

Fully open source and capable Data Science Agent that you can run in CoLab! Great job by Federico Bianchi Zain Shang Zhu 👏 github.com/togethercomput…

thumb_up_off_alt16

chat_bubble_outline0

repeat4

shareShare

Together AI

@togethercompute

6 months ago

1/ We built an open-source AI agent that can reason like a data scientist. It loads data, writes Python code, retrains when models fail, and solves real Kaggle + DABStep tasks. Here’s how we did it (and how you can too): 👇

thumb_up_off_alt189

chat_bubble_outline8

repeat28

shareShare

James Zou

@james_y_zou

6 months ago

Excited to introduce Open Data Scientist: ✅outperforms Gemini data science agent ✅solves real Kaggle tasks ✅fully open source, easy to adapt ✅sandbox for safe exec Step-by-step tutorial on building our agent together.ai/blog/building-… Great job Federico Bianchi Shang Zhu

thumb_up_off_alt83

chat_bubble_outline1

repeat9

shareShare

Together AI

@togethercompute

6 months ago

Introducing the Open Deep Research app! Generate detailed reports on any topic with open source LLMs. Free & fully open source. We’re releasing everything: evaluation dataset, code, app, and blog.🔥

thumb_up_off_alt280

chat_bubble_outline9

repeat43

shareShare

Together AI

@togethercompute

5 months ago

Our Open Data Scientist agent is now ranked on the DABStep data analysis leaderboard! We released everything - so you can try it yourself!🔥: • Full codebase • Detailed workflow recipe • Benchmarks See how we created it. 🧵

thumb_up_off_alt56

chat_bubble_outline2

repeat6

shareShare

James Zou

@james_y_zou

5 months ago

📢New conference where AI is the primary author and reviewer! agents4science.stanford.edu Current venues don't allow AI-written papers, so it's hard to assess the +/- of such works🤔 #Agents4Science solicits papers where AI is the main author w/ human advisors. 💡Initial reviews by

thumb_up_off_alt425

chat_bubble_outline16

repeat103

shareShare

Clémentine Fourrier 🍊

@clefourrier

5 months ago

Can LLMs predict the future? In FutureBench, friends from Together AI create new questions from evolving news & markets: As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮 Also cool: by design, dynamic & uncontaminated eval

Can LLMs predict the future?

In FutureBench, friends from <a href="/togethercompute/">Together AI</a> create new questions from evolving news & markets:
As time passes, we'll see which agents are the best at predicting events that have yet to happen! 🔮

Also cool: by design, dynamic & uncontaminated eval

thumb_up_off_alt35

chat_bubble_outline2

repeat8

shareShare

Together AI

@togethercompute

5 months ago

Most AI benchmarks test the past. But real intelligence is about predicting the future. Introducing FutureBench — a new benchmark for evaluating agents on real forecasting tasks that we developed with Hugging Face 🔍 Reasoning > memorization 📊 Real-world events 🧠 Dynamic,

thumb_up_off_alt89

chat_bubble_outline5

repeat17

shareShare

James Zou

@james_y_zou

5 months ago

🔮Exciting new benchmark testing how well AI predicts the future! Each week, we curate news + prediction markets for questions about next week. Then we have agents make forecasts. Requires advanced research + reasoning Together AI Hugging Face 📜together.ai/blog/futureben… 🌐

thumb_up_off_alt33

chat_bubble_outline0

repeat6

shareShare

Stanford HAI

@stanfordhai

4 months ago

📢New policy brief: Does labeling AI content affect its persuasiveness? Scholars evaluate the impact of authorship labels on how people perceive AI-written policy messages. hai.stanford.edu/policy/labelin…

thumb_up_off_alt51

chat_bubble_outline2

repeat18

shareShare

Together AI

@togethercompute

4 months ago

🚨 Together AI is proud to sponsor the Best Paper Awards at #Agents4Science - the world's first conference where AI agents author AND review papers! 🤖 💰 3 winners get $10K in compute credits 📅 Submission deadline: Sept 5 ⏰ Only 15 days left!

thumb_up_off_alt18

chat_bubble_outline6

repeat5

shareShare

Together AI

@togethercompute

4 months ago

Building AI agents for complex engineering tasks ≠ building chatbots 🧵 Most AI agents today excel at short, simple tasks. But automating multi-day engineering workflows? That’s a whole different game. At Together AI, we learned this the hard way while optimizing LLM

thumb_up_off_alt63

chat_bubble_outline3

repeat15

shareShare