survey

A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

A systematic survey and critical review on evaluating large language models with recommendations for more rigorous evaluation.