Basaran, B. “Evaluating Large Language Models for Educational Measurement Insights from Automated and Human Scoring of Language Exams”. Journal of Artificial Intelligence and Technology, vol. 6, Feb. 2026, pp. 349-55, doi:10.37965/jait.2026.0949.