Dongmin Park @ iclr25 (@dongmin_park11) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🎉 Thrilled to announce our MindGames challenge is accepted at #NeurIPS2025! 🧠🤖 Ready to deploy your AI agents to compete and collaborate in Hanabi, Werewolf, Stag Hunt, and Colonel Blotto? 🎮 Stay tuned for details!

thumb_up_off_alt109

chat_bubble_outline2

repeat9

shareShare

Grace Luo

@graceluo_

2 months ago

✨New preprint: Dual-Process Image Generation! We distill *feedback from a VLM* into *feed-forward image generation*, at inference time. The result is flexible control: parameterize tasks as multimodal inputs, visually inspect the images with the VLM, and update the generator.🧵

thumb_up_off_alt1,1K

chat_bubble_outline18

repeat165

shareShare

Dongmin Park @ iclr25

@dongmin_park11

2 months ago

Thanks Yoshi Suhara, it was a real pleasure working with you and the NVIDIA AI team! Hope we get to collaborate again in the future, especially on building gaming SLMs jointly!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

George

@georgejrjrjr

2 months ago

Have you read the Deep Research Bench paper yet? Very cool project. And you can tell building out the eval and its infrastructure was a BUNCH of work. I would love to see this expanded to include open deep research *scaffolds*, so this hill can get climbed pronto in the open.

thumb_up_off_alt216

chat_bubble_outline1

repeat35

shareShare

Alfonso Amayuelas

@alfonamayuelas

2 months ago

New paper 🚨📜🚀 Introducing “Agents of Change: Self-Evolving LLM Agents for Strategic Planning”! In this work, we show how LLM-powered agents can rewrite their own prompts & code to climb the learning curve in the board game Settlers of Catan 🎲 🧵👇

thumb_up_off_alt295

chat_bubble_outline4

repeat68

shareShare

inZOI

@playinzoi

2 months ago

Your next story begins on Mac this August. Pre-order today on the Mac App Store and let your imagination lead the way. ➡️ inzoi.me/macpreorder #Apple #Mac #WWDC #inZOI #KRAFTON #LifeSimulation

thumb_up_off_alt651

chat_bubble_outline47

repeat76

shareShare

Dongmin Park @ iclr25

@dongmin_park11

2 months ago

Orak🎮 benchmark leaderboard is just launched! Submit your LLMs and agentic strategies to compete in diverse real-world video games! krafton-ai.github.io/orak-leaderboa… *Orak comes from 오락, a native Korean word meaning “game”

thumb_up_off_alt26

chat_bubble_outline0

repeat8

shareShare

inZOI

@playinzoi

2 months ago

🔎Get a glimpse of what’s new in June Update (v.0.2.0)! Accessories like glasses, headpieces, and earrings can now be freely resized and repositioned. This allows for more precise and flexible styling than ever before. Updates open up more ways to create stories — with new

thumb_up_off_alt689

chat_bubble_outline52

repeat106

shareShare

Kevin Ellis

@ellisk_kellis

2 months ago

New paper: World models + Program synthesis by Wasu Top Piriyakulkij 1. World modeling on-the-fly by synthesizing programs w/ 4000+ lines of code 2. Learns new environments from minutes of experience 3. Positive score on Montezuma's Revenge 4. Compositional generalization to new environments

thumb_up_off_alt556

chat_bubble_outline14

repeat100

shareShare

Yunzhi Zhang

@zhang_yunzhi

2 months ago

(1/n) Time to unify your favorite visual generative models, VLMs, and simulators for controllable visual generation—Introducing a Product of Experts (PoE) framework for inference-time knowledge composition from heterogeneous models.

thumb_up_off_alt296

chat_bubble_outline4

repeat61

shareShare

Essential AI

@essential_ai

2 months ago

[1/5] 🚀 Meet Essential-Web v1.0, a 24-trillion-token pre-training dataset with rich metadata built to effortlessly curate high-performing datasets across domains and use cases!

thumb_up_off_alt297

chat_bubble_outline11

repeat54

shareShare

Andrew Wilkinson

@awilkinson

2 months ago

AI Nerds: What’s your coolest MCP workflow?

thumb_up_off_alt2,2K

chat_bubble_outline143

repeat150

shareShare

Dongmin Park @ iclr25

@dongmin_park11

2 months ago

🚀 The research teaser video for Orak (오락) is out! 🔗 YouTube: youtube.com/watch?v=2_tUJR… Explore the code, benchmark, and leaderboard, and join us in pushing the boundaries of game agents!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Adina Yakup

@adinayakup

2 months ago

Stream-Omni 🔥 new Any-to-Any model by Chinese Academy of Science. Model: huggingface.co/ICTNLP/stream-… Paper: huggingface.co/papers/2506.13… ✨ Unified multimodal input: text, vision, and speech ✨ Real-time "see-while-hear" experience ✨ Efficient training with minimal omni-modal data

thumb_up_off_alt64

chat_bubble_outline1

repeat12

shareShare

Dawn Song

@dawnsongtweets

a month ago

My group & collaborators have developed many popular benchmarks over the years, e.g., MMLU, MATH, APPS---really excited about our latest benchmark OMEGA Ω: 🔍Can LLMs really think outside the box in math? a new benchmark probing 3 axes of generalization: 1️⃣ Exploratory 2️⃣

thumb_up_off_alt156

chat_bubble_outline4

repeat31

shareShare

Kevin Wang

@kevinwang_111

19 days ago

Excited to announce the Mindgame @NeurIPS Competition is officially LIVE! 🤖 Pit your agents against others in Mafia, Codename, Prisoner’s Dilemma, Stg Hunt, and Colonel Blotto. Sign up now for $500 in compute credits on your initial run! 🔗 Register : mindgamesarena.com

thumb_up_off_alt78

chat_bubble_outline5

repeat18

shareShare

elvis

@omarsar0

19 days ago

Agent Leaderboard v2 is here! > GPT-4.1 leads > Gemini-2.5-flash excels at tool selection > Kimi K2 is the top open-source model > Grok 4 falls short > Reasoning models lag behind > No single model dominates all domains More below:

thumb_up_off_alt1,1K

chat_bubble_outline49

repeat210

shareShare

Scott Condron

@_scottcondron

11 days ago

I don't have any special inside knowledge about how Kimi.ai trained Kimi K2. I just read the paper and this part is what I've been telling anyone who will listen about. Their data generation steps to get lots of high quality, multi-turn agent traces to train on is so much

I don't have any special inside knowledge about how <a href="/Kimi_Moonshot/">Kimi.ai</a> trained Kimi K2. I just read the paper and this part is what I've been telling anyone who will listen about.

Their data generation steps to get lots of high quality, multi-turn agent traces to train on is so much

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat100

shareShare

Kangwook Lee

@kangwook_lee

8 days ago

🧵When training reasoning models, what's the best approach? SFT, Online RL, or perhaps Offline RL? At KRAFTON AI and SK telecom, we've explored this critical question, uncovering interesting insights! Let’s dive deeper, starting with the basics first. 1) SFT SFT (aka hard

thumb_up_off_alt149

chat_bubble_outline4

repeat31

shareShare

Jason Weston

@jaseweston

7 days ago

🌿Introducing MetaCLIP 2 🌿 📝: arxiv.org/abs/2507.22062 code, model: github.com/facebookresear… After four years of advancements in English-centric CLIP development, MetaCLIP 2 is now taking the next step: scaling CLIP to worldwide data. The effort addresses long-standing

thumb_up_off_alt323

chat_bubble_outline13

repeat63

shareShare