Return to Article Details Evaluating Large Language Models for Educational Measurement Insights from Automated and Human Scoring of Language Exams Download Download PDF