KISTI Institutional Repository: High performance parallelization of Boyer–Moore algorithm on many-core accelerators

Open Access KISTI

KISTI repository

BROWSE

KISTI Institutional Repository7. KISTI 연구성과 학술지 발표논문

download0 view1,115

This item is licensed Korea Open Government License

Title: High performance parallelization of Boyer–Moore algorithm on many-core accelerators

Author(s): 정요상; 이명호; 김직수; 남덕윤; 황순욱

Publication Year: 2015-06-18

Abstract: Boyer–Moore (BM) algorithm is a single pattern string matching algorithm. It is considered as the most efficient string matching algorithm and used in many applications The algorithm first calculates two string shift rules based on the given pattern string in the preprocessing phase. Using the two shift rules, pattern matching operations are performed against the target input string in the second phase. The string shift rules calculated in the first phase let parts of the target input string be skipped where there are no matches to be found in the second phase. The second phase is a time consuming process and needs to be parallelized in order to realize
the high performance string matching. In this paper,we parallelize the BM algorithm on the latest many-core accelerators such as the Intel Xeon Phi and the Nvidia Tesla K20 GPU along with the general-purpose multi-core microprocessors. For the parallel string matching, the target input data is partitioned amongst multiple threads. Data lying on the threads’ boundaries is searched redundantly so that the pattern string lying on the boundary between two neighboring threads cannot be missed. The redundant data search overheads increases significantly for a large number of threads. For a fixed target input length, the number of possible matches decreases as the pattern length increases. Furthermore, the positions
of the pattern string are spread all over the target data randomly.
This leads to the unbalanced workload distribution among threads. We employ the dynamic scheduling and the multithreading techniques to deal with the load balancing issue. We also use the algorithmic cascading technique to maximize the benefit of the multithreading and to reduce the overheads associated with the redundant data search between neighboring threads. Our parallel implementation leads to ∼17-times speedup on the Xeon Phi and ∼47-times speedup
on the Nvidia Tesla K20 GPU compared with a serial implementation on the host Intel Xeon processor.

Keyword: Boyer–Moore algorithm; Many-core accelerator; Parallelization; Dynamic scheduling; Multithreading; Algorithmic cascading

Journal Title: Cluster Computing

ISSN: 1386-7857

Files in This Item:: There are no files associated with this item.

Appears in Collections:: 7. KISTI 연구성과 > 학술지 발표논문

URI: https://repository.kisti.re.kr/handle/10580/14445

Export: RIS (EndNote); XLS (Excel); XML

Show full item record

KISTI 국가과학기술데이터본부 디지털큐레이션센터 데이터표준화팀
우)34141 대전광역시 유성구 대학로 245 한국과학기술정보연구원
Tel 042) 869-1004,1234 FAX 042) 869-1091

KISTI Institutional Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

개인정보처리방침

저작권 정책

BROWSE

Browse