SBMARUF
Home
Experience
Projects
Talk
Publications
CV
News
Blog
Awards
framework
ZeroSumEval: An Extensible Framework for Scaling LLM Evaluation with Inter-Model Competition
An extensible framework for scaling LLM evaluation through inter-model competition.
Cite
×