Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile
Dimitris Papailiopoulos

@dimitrispapail

Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.

ID: 573817445

linkhttp://papail.io calendar_today07-05-2012 17:26:48

7,7K Tweet

16,16K Followers

1,1K Following

Dongmin Park @ iclr25 (@dongmin_park11) 's Twitter Profile Photo

🚨New Paper Alert As a game company, KRAFTON AI is actively exploring how to apply LLM agents to video games. We present Orak—a foundational video gaming benchmark for LLM agents! Includes Pokémon, StarCraft II, Slay the Spire, Darkest Dungeon, Ace Attorney, and more in🧵

🚨New Paper Alert

As a game company, <a href="/Krafton_AI/">KRAFTON AI</a> is actively exploring how to apply LLM agents to video games.

We present Orak—a foundational video gaming benchmark for LLM agents!

Includes Pokémon, StarCraft II, Slay the Spire, Darkest Dungeon, Ace Attorney, and more in🧵
Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

Some of the most impactful work you can do in academia isn’t cool new algos or novel architectures. It’s data research. Data research isn’t just dumping tokens into a json. It requires a ton of rigorous experimentation, algorithmic thinking, and actually talking to your models.

Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

I find it surprising when I meet LLM folks that don't talk to the models they're training/evaluating. That is an incredibly fun aspect of the research AND is super informative. You're no longer training a CIFAR10 classifier you'd never use. This thing can literally talk to you!

Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

Best way to get incredibly good peer review on your papers is to have them go viral on twitter. It is however advisable to be near certain they are correct before doing so.

Dimitris Papailiopoulos (@dimitrispapail) 's Twitter Profile Photo

Gemini 2.5 is extremely good at explaining data, and coming up with theoretical hypotheses that are sound. It's also great at "idea" debugging when forming speculations about phenomena. I'm actually impressed. Better than o3.

Jeff Dean (@jeffdean) 's Twitter Profile Photo

If you're ever in Athens, I highly recommend the Museum of Ancient Greek Technology! maps.app.goo.gl/sngwM98ueKkroo… Early humanoid robots to pour water+wine, demonstrations of heat+steam+water weight to "magically" open 500 kg temple doors, armor, ... (cont)

If you're ever in Athens, I highly recommend the Museum of Ancient Greek Technology! 

maps.app.goo.gl/sngwM98ueKkroo…

Early humanoid robots to pour water+wine, demonstrations of heat+steam+water weight to "magically" open 500 kg temple doors, armor, ...
(cont)
Andy Konwinski (@andykonwinski) 's Twitter Profile Photo

Today, I’m launching a deeply personal project. I’m betting $100M that we can help computer scientists create more upside impact for humanity. Built for and by researchers, including Jeff Dean & Joelle Pineau on the board, Laude Institute catalyzes research with real-world impact.

Today, I’m launching a deeply personal project. I’m betting $100M that we can help computer scientists create more upside impact for humanity.
Built for and by researchers, including <a href="/JeffDean/">Jeff Dean</a> &amp; <a href="/jpineau1/">Joelle Pineau</a> on the board, <a href="/LaudeInstitute/">Laude Institute</a> catalyzes research with real-world impact.
Kangwook Lee (@kangwook_lee) 's Twitter Profile Photo

🚨New Paper Alert🚨 Could generative agents powered by LLMs transform social science by accurately simulating human social behaviors at scale? We tested this possibility with virtual humans facing disease threats in "Infected Smallville." 🧵