framework

ZeroSumEval: An Extensible Framework for Scaling LLM Evaluation with Inter-Model Competition

An extensible framework for scaling LLM evaluation through inter-model competition.