KISTI Institutional Repository: Runtime prediction of parallel applications with workload-aware clustering

KISTI repository

download0 view1,310

This item is licensed Korea Open Government License

Title: Runtime prediction of parallel applications with workload-aware clustering

Abstract: Several fields of science have demanded large-scale workflow support, which requires thousands of CPU cores or more. In order to support such large-scale scientific workflows, high capacity parallel systems such as supercomputers are widely used. In order to increase the utilization of these systems, most schedulers use backfilling policy: Small jobs are moved ahead to fill in holes in the schedule when large jobs do not delay. Since an estimate of the runtime is necessary for backfilling, most parallel systems use user's estimated runtime. However, it is found to be extremely inaccurate because users overestimate their jobs. Therefore, in this paper, we propose a novel system for the runtime prediction based on workload-aware clustering with the goal of improving prediction performance. The proposed scheme develops support vector regression model by the clusters resulted from a self-organizing map and hierarchical clustering analysis with the feature space reduced by factor analysis to reinforce prediction accuracy. In the experiments, we use workload logs on parallel systems (i.e., iPSC, LANL-CM5, SDSC-Par95, SDSC-Par96, and CTC-SP2) to evaluate the effectiveness of our approach. Comparing with other techniques, experimental results show that the proposed method improves the accuracy up to 69.08%.

Keyword: Runtime prediction; workload-aware clustering; support vector regression; machine learning approach

URI: https://repository.kisti.re.kr/handle/10580/14612
http://www.ndsl.kr/ndsl/search/detail/article/articleSearchResultDetail.do?cn=NART78984346

KISTI 국가과학기술데이터본부 디지털큐레이션센터 데이터표준화팀
우)34141 대전광역시 유성구 대학로 245 한국과학기술정보연구원
Tel 042) 869-1004,1234 FAX 042) 869-1091

KISTI Institutional Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.