LayerLens (@layerlens_ai) 's Twitter Profile
LayerLens

@layerlens_ai

Pioneering Trust in the Age of Generative AI.

Book a demo: cal.com/archie-chaudhu…

ID: 1847432077639114752

linkhttp://layerlens.com calendar_today19-10-2024 00:18:56

443 Tweet

175 Takipçi

54 Takip Edilen

LayerLens (@layerlens_ai) 's Twitter Profile Photo

Another cool benchmarking paper published yesterday. In "JuStRank: Benchmarking LLM Judges for System Ranking", researchers from IBM Research introduced JuStRank, the first large-scale benchmark for evaluating LLM judges for ranking target systems: arxiv.org/abs/2412.09569