Korea Institute of Science and Technology Information
Publication Year
2022-06-20
Abstract
Online recruitment websites discuss job demands in various fields, and job postings contain detailed job specifications. Analyzing this text can elucidate the features that determine job salaries. Text embedding models can learn the contextual information in a text, and explainable artificial intelligence frameworks can be used to examine in detail how text features contribute to the models' outputs. We collected 733,625 job postings using the WORKNET API and classified them into low, mid, and high-range salary groups. A text embedding model that predicts job salaries based on the text in job postings was trained with the collected data. Then, we applied the SHapley Additive exPlanations (SHAP) framework to the trained model and discovered the significant words that determine each salary class. Several limitations and remaining words are also discussed.
Keyword
job posting; text embedding; explainable artificial intelligence; SHapley Additive exPlanations
Journal Title
Journal of Information Science Theory and Practice;Volume 10 Special Issue