(1)
Basaran, B. Evaluating Large Language Models for Educational Measurement Insights from Automated and Human Scoring of Language Exams. JAIT 2026.