Basaran, B. “Evaluating Large Language Models for Educational Measurement Insights from Automated and Human Scoring of Language Exams”. Journal of Artificial Intelligence and Technology, Feb. 2026, doi:10.37965/jait.2026.0949.