KISTI Institutional Repository: A Network Packet Analysis Method to Discover Malicious Activities

Open Access KISTI

KISTI repository

BROWSE

KISTI Institutional Repository8. KISTI 간행물 JISTaP Vol. 10 - Special Issue

download437 view1,611

This item is licensed Korea Open Government License

Title: A Network Packet Analysis Method to Discover Malicious Activities

Author(s): Kwon, Taewoong; Myung, Joonwoo; Lee, Jun; Kim, Kyu-il; Song, Jungsuk

Publisher: Korea Institute of Science and Technology Information

Publication Year: 2022-06-20

Abstract: With the development of networks and the increase in the number of network devices, the number of cyber attacks targeting them is also increasing. Since these cyber-attacks aim to steal important information and destroy systems, it is necessary to minimize social and economic damage through early detection and rapid response. Many studies using machine learning (ML) and artificial intelligence (AI) have been conducted, among which payload learning is one of the most intuitive and effective methods to detect malicious behavior. In this study, we propose a preprocessing method to maximize the performance of the model when learning the payload in term units. The proposed method constructs a high-quality learning data set by eliminating unnecessary noise (stopwords) and preserving important features in consideration of the machine language and natural language characteristics of the packet payload. Our method consists of three steps: Preserving significant special characters, Generating a stopword list, and Class label refinement. By processing packets of various and complex structures based on these three processes, it is possible to make high-quality training data that can be helpful to build high-performance ML/AI models for security monitoring. We prove the effectiveness of the proposed method by comparing the performance of the AI model to which the proposed method is applied and not. Forthermore, by evaluating the performance of the AI model applied proposed method in the real-world Security Operating Center (SOC) environment with live network traffic, we demonstrate the applicability of the our method to the real environment.