Shiro Takagi (@46takagi) 's Twitter Profile
Shiro Takagi

@46takagi

Unktok CEO
artificial scientists & metascience
JP(@takagi_shiro)

ID: 1375396544422191104

linkhttps://t46.github.io/ calendar_today26-03-2021 10:37:58

429 Tweet

189 Takipçi

811 Takip Edilen

Clément (@clmt) 's Twitter Profile Photo

Gemma 3 is out! We are focused on bringing you open models with best capabilities while being fast and easy to deploy: - 27B lands an ELO of 1338, all the while still fitting on 1 single H100! - vision support to process mixed image/video/text content - extended context window

Gemma 3 is out! 

We are focused on bringing you open models with best capabilities while being fast and easy to deploy:

- 27B lands an ELO of 1338, all the while still fitting on 1 single H100!
- vision support to process mixed image/video/text content
- extended context window
Robert Dadashi (@robdadashi) 's Twitter Profile Photo

Today we are releasing the best open-weights model you can run on a single device reaching 1339 Elo on LMsys for Gemma 3 27B (aka zizou-10)! Very strong capabilities on math, multilingual, coding, instruction following, function calling !

Pieter Abbeel (@pabbeel) 's Twitter Profile Photo

Founders who were PhD or post-doc in my lab at Berkeley, **largely funded by NSF / DoD grants**, start-up, market cap (collected by OpenAI Deep Research)

Founders who were PhD or post-doc in my lab at Berkeley, **largely funded by NSF / DoD grants**, start-up, market cap (collected by OpenAI Deep Research)
arXiv.org (@arxiv) 's Twitter Profile Photo

Match ALREADY completed?! 🤯 Thank you to Hugging Face for this generous match & THANK YOU to everyone who has donated to arXiv on #GivingDay so far!🥳👏🏻🤗 3 MORE HOURS left to give & support #openscience! givingday.cornell.edu/campaigns/arxiv (also stay tuned for a super secret mission)

Match ALREADY completed?! 🤯

Thank you to <a href="/huggingface/">Hugging Face</a> for this generous match &amp; THANK YOU to everyone who has donated to arXiv on #GivingDay so far!🥳👏🏻🤗

3 MORE HOURS left to give &amp; support #openscience!

givingday.cornell.edu/campaigns/arxiv

(also stay tuned for a super secret mission)
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

LLM-based review systems lack domain expertise and structured evaluation. This results in superficial feedback and inaccurate assessments. This paper introduces DeepReview, a multi-stage framework mimicking expert revietheyrs. It uses structured analysis and evidence-based

LLM-based review systems lack domain expertise and structured evaluation.

This results in superficial feedback and inaccurate assessments.

This paper introduces DeepReview, a multi-stage framework mimicking expert revietheyrs.

It uses structured analysis and evidence-based
Samuel Schmidgall (@srschmidgall) 's Twitter Profile Photo

🚀🌐Introducing AgentRxiv: a framework where autonomous research agents can upload, retrieve, and build on each other’s research. AgentRxiv takes your research direction and progressively outputs research, building on its previous work with each new paper! 🧵

🚀🌐Introducing AgentRxiv: a framework where autonomous research agents can upload, retrieve, and build on each other’s research.

AgentRxiv takes your research direction and progressively outputs research, building on its previous work with each new paper!
🧵
Samuel Schmidgall (@srschmidgall) 's Twitter Profile Photo

🎉Read the preprint: agentrxiv.github.io Try out AgentRxiv: github.com/SamuelSchmidga… Let’s explore how agents can accelerate research—together. 🧵8/8

🎉Read the preprint: agentrxiv.github.io
Try out AgentRxiv: github.com/SamuelSchmidga…
Let’s explore how agents can accelerate research—together.
🧵8/8
Carta Thomas (@cartathomas2) 's Twitter Profile Photo

🚀 Introducing 🧭MAGELLAN—our new metacognitive framework for LLM agents! It predicts its own learning progress (LP) in vast natural language goal spaces, enabling efficient exploration of complex domains.🌍✨Learn more: 🔗 arxiv.org/abs/2502.07709 #OpenEndedLearning #LLM #RL

Ai2 (@allen_ai) 's Twitter Profile Photo

Imagine AI doing science: reading papers, generating ideas, designing and running experiments, analyzing results… How many more discoveries can we reveal? 🧐 Meet CodeScientist, a promising next step toward autonomous scientific discovery. 🧵

Imagine AI doing science: reading papers, generating ideas, designing and running experiments, analyzing results… How many more discoveries can we reveal? 🧐

Meet CodeScientist, a promising next step toward autonomous scientific discovery. 🧵
All Hands AI (@allhands_ai) 's Twitter Profile Photo

Today, we're excited to make two big announcements! - OpenHands LM: The strongest 32B coding agent model, resolving 37.4% of issues on SWE-bench Verified 📈 - OpenHands Cloud: SOTA open-source coding agents from your computer, phone, github, with $50 in free credits 🙌☁️

Today, we're excited to make two big announcements!

- OpenHands LM: The strongest 32B coding agent model, resolving 37.4% of issues on SWE-bench Verified 📈
- OpenHands Cloud: SOTA open-source coding agents from your computer, phone, github, with $50 in free credits 🙌☁️
Pieter Abbeel (@pabbeel) 's Twitter Profile Photo

I'm thrilled to share our first release as the AGI SF Lab. Meet Nova Act -- the most effortless way to build agents that can reliably use browsers, giving agents access to much of our digital world. It brings us closer to building universal agents in both digital and physical

Sakana AI (@sakanaailabs) 's Twitter Profile Photo

Introducing The AI Scientist-v2, which produced the 1st fully AI-generated paper to pass peer review at a workshop level (at ICLR 2026 2025)! Tech Report: pub.sakana.ai/ai-scientist-v… GitHub: github.com/SakanaAI/AI-Sc… V2 adds: 🧠 Agentic Tree Search (open-ended idea exploration) 💡

Introducing The AI Scientist-v2, which produced the 1st fully AI-generated paper to pass peer review at a workshop level (at <a href="/iclr_conf/">ICLR 2026</a> 2025)!

Tech Report: pub.sakana.ai/ai-scientist-v…
GitHub: github.com/SakanaAI/AI-Sc…

V2 adds:
🧠 Agentic Tree Search (open-ended idea exploration)
💡
Cong Lu (@cong_ml) 's Twitter Profile Photo

Delighted to share the full pre-print and 100% open source code for The AI Scientist-v2 🚀🚀 Tons of improvements enabling modern foundation models to come up with better ideas, conduct deeper and mor general experiments and write better papers! 👩‍🔬