[1]
B. Basaran, “Evaluating Large Language Models for Educational Measurement Insights from Automated and Human Scoring of Language Exams”, JAIT, Feb. 2026.