download380 view2,263
twitter facebook

공공누리This item is licensed Korea Open Government License

dc.contributor.author
Bsoul, Qusay
dc.contributor.author
Abdul Salam, Rosalina
dc.contributor.author
Atwan, Jaffar
dc.contributor.author
Jawarneh, Malik
dc.date.accessioned
2022-03-15T07:17:16Z
dc.date.available
2022-03-15T07:17:16Z
dc.date.issued
2021-12-30
dc.identifier.issn
2287-4577
dc.identifier.uri
https://repository.kisti.re.kr/handle/10580/16428
dc.description.abstract
Text clustering is one of the most commonly used methods for detecting themes or types of documents. Text clustering is used in many fields, but its effectiveness is still not sufficient to be used for the understanding of Arabic text, especially with respect to terms extraction, unsupervised feature selection, and clustering algorithms. In most cases, terms extraction focuses on nouns. Clustering simplifies the understanding of an Arabic text like the text of the Quran; it is important not only for Muslims but for all people who want to know more about Islam. This paper discusses the complexity and limitations of Arabic text clustering in the Quran based on their themes. Unsupervised feature selection does not consider the relationships between the selected features. One weakness of clustering algorithms is that the selection of the optimal initial centroid still depends on chances and manual settings. Consequently, this paper reviews literature about the three major stages of Arabic clustering: terms extraction, unsupervised feature selection, and clustering. Six experiments were conducted to demonstrate previously un-discussed problems related to the metrics used for feature selection and clustering. Suggestions to improve clustering of the Quran based on themes are presented and discussed.
dc.format
application/pdf
dc.language.iso
eng
dc.publisher
Korea Institute of Science and Technology Information
dc.relation.ispartofseries
Journal of Information Science Theory and Practice;Volume 9 Issue 4
dc.title
Arabic Text Clustering Methods and Suggested Solutions for Theme-Based Quran Clustering: Analysis of Literature
dc.type
Serial
dc.identifier.doi
https://doi.org/10.1633/JISTaP.2021.9.4.2
dc.contributor.approver
KOAR, ADMIN
dc.date.dateaccepted
2022-03-15T07:17:16Z
dc.date.datesubmitted
2022-03-15T07:17:16Z
dc.subject.keyword
text mining
dc.subject.keyword
Arabic text clustering algorithms
dc.subject.keyword
terms extraction
dc.subject.keyword
un-supervised feature selection
dc.subject.keyword
optimal initial centroid
Appears in Collections:
8. KISTI 간행물 > JISTaP > Vol. 9 - No. 4
Files in This Item:
Thumbnail Arabic Text Clustering Methods and Suggested Solutions for Theme-Based Quran Clustering - Analysis of Literature.pdf1.24 MBDownload

Browse