download407 view1,353
twitter facebook

공공누리This item is licensed Korea Open Government License

Title
Word Embeddings-Based Pseudo Relevance Feedback Using Deep Averaging Networks for Arabic Document Retrieval
Author(s)
Farhan, Yasir HadiNoah, Shahrul Azman MohdMohd, MasnizahAtwan, Jaffar
Publisher
Korea Institute of Science and Technology Information
Publication Year
2021-06-30
Abstract
Pseudo relevance feedback (PRF) is a powerful query expansion (QE) technique that prepares queries using the top k pseudorelevant documents and choosing expansion elements. Traditional PRF frameworks have robustly handled vocabulary mismatch corresponding to user queries and pertinent documents; nevertheless, expansion elements are chosen, disregarding similarity to the original query's elements. Word embedding (WE) schemes comprise techniques of significant interest concerning QE, that falls within the information retrieval domain. Deep averaging networks (DANs) defines a framework relying on average word presence passed through multiple linear layers. The complete query is understandably represented using the average vector comprising the query terms. The vector may be employed for determining expansion elements pertinent to the entire query. In this study, we suggest a DANs-based technique that augments PRF frameworks by integrating WE similarities to facilitate Arabic information retrieval. The technique is based on the fundamental that the top pseudo-relevant document set is assessed to determine candidate element distribution and select expansion terms appropriately, considering their similarity to the average vector representing the initial query elements. The Word2Vec model is selected for executing the experiments on a standard Arabic TREC 2001/2002 set. The majority of the evaluations indicate that the PRF implementation in the present study offers a significant performance improvement compared to that of the baseline PRF frameworks.
Keyword
automatic query expansion; information retrievalword embedding; deep averaging networks; pseudo relevance feedback; Arabic document retrieval on TREC collection
Journal Title
Journal of Information Science Theory and Practice;Volume 9 Issue 2
ISSN
2287-4577
DOI
https://doi.org/10.1633/JISTaP.2021.9.2.1
Files in This Item:
Thumbnail Word Embeddings-Based Pseudo Relevance Feedback Using Deep Averaging Networks for Arabic Document Retrieval.pdf522.28 kBDownload
Appears in Collections:
8. KISTI 간행물 > JISTaP > Vol. 9 - No. 2
Type
Serial
URI
https://repository.kisti.re.kr/handle/10580/16272
Export
RIS (EndNote)
XLS (Excel)
XML

Browse