José Maria Pombal (@zmprcp) 's Twitter Profile
José Maria Pombal

@zmprcp

Research Scientist @unbabel, PhD student @istecnico.

ID: 1633224223454797826

linkhttp://zeppombal.github.io calendar_today07-03-2023 21:53:17

53 Tweet

81 Takipçi

103 Takip Edilen

José Maria Pombal (@zmprcp) 's Twitter Profile Photo

New paper out 🚀 Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models: arxiv.org/abs/2504.01001. We present a framework and release a repository for creating reliable benchmarks for (V)LM tasks quickly and fully automatically.

New paper out 🚀 Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models: arxiv.org/abs/2504.01001.

We present a framework and release a repository for creating reliable benchmarks for (V)LM tasks quickly and fully automatically.