Ceaglex (@ceaglex_) 's Twitter Profile
Ceaglex

@ceaglex_

Dream to become a PhD in audio generation field

ID: 1493205242246164481

calendar_today14-02-2022 12:47:46

44 Tweet

16 Takipçi

78 Takip Edilen

Jiayi Zhang @ICLR2025 (@didiforx) 's Twitter Profile Photo

No labels? ØvO help you! Excited to share our new paper: Self-Supervised Prompt Optimization (arxiv.org/abs/2502.06855) 🔥 Key features: ØvO: Output vs Output - no labels/human feedback needed! 99% cost reduction ($0.15) SOTA performance with just 3 examples 1/5

No labels? ØvO help you! 
Excited to share our new paper: 
Self-Supervised Prompt Optimization (arxiv.org/abs/2502.06855)  
🔥 Key features: 
ØvO: Output vs Output - no labels/human feedback needed! 
99% cost reduction ($0.15) 
SOTA performance with just 3 examples  

1/5
Jiayi Zhang @ICLR2025 (@didiforx) 's Twitter Profile Photo

Reasoning models lack atomic thought ⚛️ Unlike humans using independent units, they store full histories🤔 Introducing Atom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1 ! The best part? It's plugs in for ANY framework 🔌 1/5

Reasoning models lack atomic thought ⚛️

Unlike humans using independent units, they store full histories🤔

Introducing Atom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1 !

The best part? It's plugs in for ANY framework 🔌
1/5
MetaGPT (@metagpt_) 's Twitter Profile Photo

20 Months: 0 → 7 papers (2 ICLR orals) & 40+ institution collabs. With a clear vision, we're building the open-source foundation for tomorrow's agents. We also release MGX (mgx.dev) and commit to open-source its core soon. Check threads for what we've built! 1/8

20 Months: 0 → 7 papers (2 ICLR orals) & 40+ institution collabs.
With a clear vision, we're building the open-source foundation for tomorrow's agents.
We also release MGX (mgx.dev) and commit to open-source its core soon.
Check threads for what we've built!
1/8
Jiayi Zhang @ICLR2025 (@didiforx) 's Twitter Profile Photo

No fortress, purely open ground. Manus 👋. We open-sourced its core feature in 2 hours after dinner. Check it out 👇: github.com/mannaandpoem/O… 1/4

Jiayi Zhang @ICLR2025 (@didiforx) 's Twitter Profile Photo

Text-to-SQL woes? Reasoning models stumble in zero-shot tasks 😓 Enter Alpha-SQL — our breakthrough boosts 7B LLMs by 15-20%, topping GPT-4o SOTA and even reasoning models on BIRD! 🎉 Test Time Scaling still shines. How we nailed it 👇: 1/5

Text-to-SQL woes? Reasoning models stumble in zero-shot tasks 😓  

Enter Alpha-SQL — our breakthrough boosts 7B LLMs by 15-20%, topping GPT-4o SOTA and even reasoning models on BIRD! 🎉 

Test Time Scaling still shines.  

How we nailed it 👇: 

1/5
BangLiu (@bangl93) 's Twitter Profile Photo

🧠264 pages and 1416 references chart the future of Foundation Agents. Our latest survey dives deep into agents—covering brain-inspired cognition, self-evolution, multi-agents, and AI safety. Discover the #1 Paper of the Day on Hugging Face👇: huggingface.co/papers/2504.01… 1/3

🧠264 pages and 1416 references chart the future of Foundation Agents.

Our latest survey dives deep into agents—covering brain-inspired cognition, self-evolution, multi-agents, and AI safety.

Discover the #1 Paper of the Day on Hugging Face👇:

huggingface.co/papers/2504.01…

1/3
Jiayi Zhang @ICLR2025 (@didiforx) 's Twitter Profile Photo

It's actually a pity that we got no enough time to maintain OpenManus during the past 3 months. But the better news is that we will build a formal open-source community for OpenManus at the end of this month.

It's actually a pity that we got no enough time to maintain OpenManus during the past 3 months.

But the better news is that we will build a formal open-source community for OpenManus at the end of this month.
Jiayi Zhang @ICLR2025 (@didiforx) 's Twitter Profile Photo

Why don't products claiming to be "general agents" like Manus and GenSpark compare their coding capabilities and deep search prowess against OpenAI's agents? I mean, can Manus even handle something as basic as maintaining a survey repo? It just needs simple search and automated

BangLiu (@bangl93) 's Twitter Profile Photo

🤖Check The Hitchhiker’s Guide to Agents HERE🤖 Our Foundation Agents Survey V2 level up to 396 pages – every chapter is a full-on survey itself! 🧠 Agent Framework & Components 🌍 World Model & Memory 🔄 Self-Evolution 👥 Multi Agents 🛡️ Safety 1/4

🤖Check The Hitchhiker’s Guide to Agents HERE🤖

Our Foundation Agents Survey V2 level up to 396 pages – every chapter is a full-on survey itself!
🧠 Agent Framework & Components
🌍 World Model & Memory
🔄 Self-Evolution
👥 Multi Agents
🛡️ Safety

1/4