Dani Balcells (@danibalcells) 's Twitter Profile
Dani Balcells

@danibalcells

founding ML research engineer @plastic_labs

ID: 18690610

linkhttp://danibalcells.com calendar_today06-01-2009 19:27:11

97 Tweet

288 Takipçi

393 Takip Edilen

vintro (@vintrotweets) 's Twitter Profile Photo

last week i was playing around with pre-filling reasoning traces in deepseek to see if i could steer its thinking towards better responses anecdotally this seemed to work so i got with Dani Balcells to figure out how we could turn this into a more rigorous experiment...

León (@leonguertler) 's Twitter Profile Photo

TextArena is live on arXiv! We present a benchmark of 57+ competitive text-based games to evaluate and train LLMs on agentic behavior — including negotiation, deception, theory of mind and many more. Real-time TrueSkill. Multiplayer support. Human-vs-models. Model-vs-model.

TextArena is live on arXiv! We present a benchmark of 57+ competitive text-based games to evaluate and train LLMs on agentic behavior — including negotiation, deception, theory of mind and many more.  Real-time TrueSkill. Multiplayer support. Human-vs-models. Model-vs-model.