Exploiting highly qualified pattern with frequency and weight occupancy
Article
Article Title | Exploiting highly qualified pattern with frequency and weight occupancy |
---|---|
ERA Journal ID | 18060 |
Article Category | Article |
Authors | Gan, Wensheng (Author), Lin, Jerry Chun-Wei (Author), Fournier-Viger, Philippe (Author), Chao, Han-Chieh (Author), Zhan, Justin (Author) and Zhang, Ji (Animator) |
Journal Title | Knowledge and Information Systems |
Journal Citation | 56 (1), pp. 165-196 |
Number of Pages | 32 |
Year | 2018 |
Place of Publication | London, United Kingdom |
ISSN | 0219-1377 |
0219-3116 | |
Digital Object Identifier (DOI) | https://doi.org/10.1007/s10115-017-1103-8 |
Web Address (URL) | https://link.springer.com/article/10.1007%2Fs10115-017-1103-8 |
Abstract | By identifying useful knowledge embedded in the behavior of search engines, users can provide valuable information for web searching and data mining. Numerous algorithms have been proposed to find the desired interesting patterns, i.e., frequent pattern, in real-world applications. Most of those studies use frequency to measure the interestingness of patterns. However, each object may have different importance in these real-world applications, and the frequent ones do not usually contain a large portion of the desired patterns. In this paper, we present a novel method, called exploiting highly qualified patterns with frequency and weight occupancy (QFWO), to suggest the possible highly qualified patterns that utilize the idea of co-occurrence and weight occupancy. By considering item weight, weight occupancy and the frequency of patterns, in this paper, we designed a new highly qualified patterns. A novel Set-enumeration tree called the frequency-weight (FW)-tree and two compact data structures named weight-list and FW-table are designed to hold the global downward closure property and partial downward closure property of quality and weight occupancy to further prune the search space. The proposed method can exploit high qualified patterns in a recursive manner without candidate generation. Extensive experiments were conducted both on real-world and synthetic datasets to evaluate the effectiveness and efficiency of the proposed algorithm. Results demonstrate that the obtained patterns are reasonable and acceptable. Moreover, the designed QFWO with several pruning strategies is quite efficient in terms of runtime and search space. |
Keywords | data mining; association rules; interestingness measures |
ANZSRC Field of Research 2020 | 469999. Other information and computing sciences not elsewhere classified |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Byline Affiliations | Peking University, China |
Harbin Institute of Technology, China | |
University of Nevada, United States | |
School of Agricultural, Computational and Environmental Sciences | |
Institution of Origin | University of Southern Queensland |
https://research.usq.edu.au/item/q52vx/exploiting-highly-qualified-pattern-with-frequency-and-weight-occupancy
203
total views9
total downloads3
views this month0
downloads this month