[1]

B. Basaran, “Evaluating Large Language Models for Educational Measurement Insights from Automated and Human Scoring of Language Exams”, JAIT, vol. 6, pp. 349–355, Feb. 2026.