download0 view858
twitter facebook

공공누리This item is licensed Korea Open Government License

Title
An Evaluation of Passage-based Text Categorization
Author(s)
김진숙김진숙김명호
Publication Year
2004-07-01
Abstract
Researches in text categorization have been confined to whole-document-level classification, probably due to lack of full-text test collections. However, full-length documents available today in large quantities pose renewed interests in text classification. A document is usually written in an organized structure to present its main topic(s). This structure can be expressed as a sequence of subtopic text blocks, or passages. In order to reflect the subtopic structure of a document, we propose a new passage-level or passage-based text categorization model, which segments a test document into several passages, assigns categories to each passage, and merges
the passage categories to the document categories. Compared with traditional document-level categorization, two additional steps, passage splitting and category merging, are required in this model. Using four subsets of the Reuters text categorization test collection and a full-text test collection of which documents are ...
Keyword
text categorization; passage; non-overlapping window; overlapping window; paragraph; bounded paragraph; page; TextTile; passage weight function
Journal Title
Journal of intelligent information systems
Citation Volume
23
ISSN
0925-9902
Files in This Item:
There are no files associated with this item.
Appears in Collections:
7. KISTI 연구성과 > 학술지 발표논문
URI
https://repository.kisti.re.kr/handle/10580/13565
http://www.ndsl.kr/ndsl/search/detail/article/articleSearchResultDetail.do?cn=NART21399019
Export
RIS (EndNote)
XLS (Excel)
XML

Browse