A two-stage text mining model for information filtering
Paper
Paper/Presentation Title | A two-stage text mining model for information filtering |
---|---|
Presentation Type | Paper |
Authors | Li, Yuefeng (Author), Zhou, Xujuan (Author), Bruza, Peter (Author), Xu, Yue (Author) and Lau, Raymond Y.K. (Author) |
Journal or Proceedings Title | Proceedings of the 17th ACM conference on Information and Knowledge Management |
Number of Pages | 10 |
Year | 2008 |
Place of Publication | New York, United States |
ISBN | 9781595939913 |
Digital Object Identifier (DOI) | https://doi.org/10.1145/1458082.1458218 |
Web Address (URL) of Paper | https://dl.acm.org/citation.cfm?id=1458082 |
Conference/Event | CIKM 2008: ACM 17th Conference on Information and Knowledge Management |
Event Details | CIKM 2008: ACM 17th Conference on Information and Knowledge Management Event Date 26 to end of 30 Oct 2008 Event Location Napa Valley, United States |
Abstract | Mismatch and overload are the two fundamental issues regarding the effectiveness of information filtering. Both term-based and pattern (phrase) based approaches have been employed to address these issues. However, they all suffer from some limitations with regard to effectiveness. This paper proposes a novel solution that includes two stages: an initial topic filtering stage followed by a stage involving pattern taxonomy mining. The objective of the first stage is to address mismatch by quickly filtering out probable irrelevant documents. The threshold used in the first stage is motivated theoretically. The objective of the second stage is to address overload by apply pattern mining techniques to rationalize the data relevance of the reduced document set after the first stage. Substantial experiments on RCV1 show that the proposed solution achieves encouraging performance. |
Keywords | nformation Filtering, Text Mining, Decision Rules, Thresh- olds, Weighting Schema |
ANZSRC Field of Research 2020 | 469999. Other information and computing sciences not elsewhere classified |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Byline Affiliations | Queensland University of Technology |
City University of Hong Kong, China | |
Institution of Origin | University of Southern Queensland |
https://research.usq.edu.au/item/q395v/a-two-stage-text-mining-model-for-information-filtering
844
total views10
total downloads2
views this month0
downloads this month