KISTI Institutional Repository: On the role of message broker middleware for many-task computing

KISTI repository

download0 view1,303

This item is licensed Korea Open Government License

Abstract: We have designed and implemented a new data processing framework called ‘‘Many-task computing On HAdoop’’
(MOHA) which aims to effectively support fine-grained many-task applications that can show another type of dataintensive
workloads in the YARN-based Hadoop 2.0 platform. MOHA is developed as one of Hadoop YARN applications
so that it can transparently co-host existing many-task computing (MTC) applications with other data processing workflows
such as MapReduce in a single Hadoop cluster. In this paper, we investigate main characteristics of two well-known opensource
message broker middleware systems (Apache ActiveMQ and Kafka) and their implications on a many-task management
scheme in our MOHA framework. Through our extensive experiments with a real MTC application, we
demonstrate and discuss trade-offs between parallelism and load balancing of data access patterns in message broker
middleware systems for Many-Task Computing on Hadoop.

Keyword: Many-task computing; Message broker middleware; Hadoop; YARN; ActiveMQ; Kafka; MOHA

KISTI 국가과학기술데이터본부 디지털큐레이션센터 데이터표준화팀
우)34141 대전광역시 유성구 대학로 245 한국과학기술정보연구원
Tel 042) 869-1004,1234 FAX 042) 869-1091

KISTI Institutional Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.