download429 view948
twitter facebook

CC_BYThis item is licensed Creative Commons License

dc.contributor.author
Ning Yu
dc.date.accessioned
2018-10-12T04:51:06Z
dc.date.available
2018-10-12T04:51:06Z
dc.date.issued
2013-02-28
dc.identifier.issn
2287-4577
dc.identifier.uri
https://repository.kisti.re.kr/handle/10580/8624
dc.description.abstract
Domain transfer is a widely recognized problem for machine learning algorithms because models built upon one data domain generally do not perform well in another data domain. This is especially a challenge for tasks such as opinion classification, which often has to deal with insufficient quantities of labeled data. This study investigates the feasibility of self-training in dealing with the domain transfer problem in opinion classification via leveraging labeled data in non-target data domain(s) and unlabeled data in the target-domain. Specifically, self-training is evaluated for effectiveness in sparse data situations and feasibility for domain adaptation in opinion classification. Three types of Web content are tested: edited news articles, semi-structured movie reviews, and the informal and unstructured content of the blogosphere. Findings of this study suggest that, when there are limited labeled data, self-training is a promising approach for opinion classification, although the contributions vary across data domains. Significant improvement was demonstrated for the most challenging data domain-the blogosphere-when a domain transfer-based self-training strategy was implemented.
dc.format
image/jpeg
dc.language.iso
eng
dc.relation.ispartofseries
Journal of Information Science Theory and Practice
dc.title
Domain Adaptation for Opinion Classification: A Self-Training Approach
dc.type
Article
dc.rights.license
CC_BY
dc.identifier.doi
10.1633/JISTaP.2013.1.1.1
dc.citation.endPage
26
dc.citation.number
1
dc.citation.startPage
10
dc.citation.volume
1
dc.identifier.bibliographicCitation
vol. 1, no. 1, page. 10 - 26
dc.subject.keyword
Domain adaptation
dc.subject.keyword
Opinion classification
dc.subject.keyword
Self-training
dc.subject.keyword
Semi-supervised learning
dc.subject.keyword
Sentiment analysis
dc.subject.keyword
Machine learning
dc.rights.holder
KISTI
Appears in Collections:
8. KISTI 간행물 > JISTaP > Vol. 1 - No. 1
Files in This Item:
Thumbnail E1JSCH_2013_v1n1_10.pdf332.33 kBDownload

Browse