This item is licensed Korea Open Government License
dc.contributor.author
NGUYEN NGOC CAO
dc.contributor.author
김직수
dc.contributor.author
이재환
dc.contributor.author
황순욱
dc.date.accessioned
2019-08-28T07:42:18Z
dc.date.available
2019-08-28T07:42:18Z
dc.date.issued
2018-03-31
dc.identifier.issn
1386-7857
dc.identifier.uri
https://repository.kisti.re.kr/handle/10580/14774
dc.description.abstract
We have designed and implemented a new data processing framework called ‘‘Many-task computing On HAdoop’’
(MOHA) which aims to effectively support fine-grained many-task applications that can show another type of dataintensive
workloads in the YARN-based Hadoop 2.0 platform. MOHA is developed as one of Hadoop YARN applications
so that it can transparently co-host existing many-task computing (MTC) applications with other data processing workflows
such as MapReduce in a single Hadoop cluster. In this paper, we investigate main characteristics of two well-known opensource
message broker middleware systems (Apache ActiveMQ and Kafka) and their implications on a many-task management
scheme in our MOHA framework. Through our extensive experiments with a real MTC application, we
demonstrate and discuss trade-offs between parallelism and load balancing of data access patterns in message broker
middleware systems for Many-Task Computing on Hadoop.
dc.language
eng
dc.relation.ispartofseries
Cluster Computing
dc.title
On the role of message broker middleware for many-task computing