Building a Large Language Model for the Human Resource Services in Malaysia

Guan Hong Lai; Darren Xin Lun Chai; Tong Ming Lim

doi:10.37965/jait.2025.0752

Building a Large Language Model for the Human Resource Services in Malaysia

Authors

Guan Hong Faculty of Computing and Information Technology, Tunku Abdul Rahman University of Management and Technology https://orcid.org/0009-0005-4568-3620
Darren Xin Lun Faculty of Computing and Information Technology, Tunku Abdul Rahman University of Management and Technology https://orcid.org/0009-0008-7033-3835
Tong Ming Lim Faculty of Computing and Information Technology, Tunku Abdul Rahman University of Management and Technology https://orcid.org/0000-0002-0455-9704

DOI:

https://doi.org/10.37965/jait.2025.0752

Keywords:

base model, employee engagement, GPT-2, human resources, large language model

Abstract

This study presents the development of a domain-specific base large language model (LLM) tailored for its application in human resources (HR) management, particularly employee engagement. The model addresses inefficiencies in traditional HR systems, such as inconsistent query resolution on compliance management and policy dissemination. By leveraging LLM’s advanced techniques like Rotary Positional Embeddings (RoPE), grouped key-value attention, and Transformer Block enhancements, this research designs and develops an HR-specific base LLM as the first line of HR service for employees in small- and medium-sized enterprises. Data preparation involves cleaning, tokenizing, and training HR-specific datasets, enabling the model to handle nuanced queries with contextual relevance. Through iterative training and evaluation, the Enhanced GPT-2 model has demonstrated significant improvements in learning capability, based on attention weights in the embedding layers, over GPT-2 Small in terms of relevance, consistency, and scalability. Future work focuses on expanding datasets, improving fine-tuning techniques, and integrating retrieval-augmented generation for real-time adaptability.