Efficient discovery of risk patterns in medical data
Article
Article Title | Efficient discovery of risk patterns in medical data |
---|---|
ERA Journal ID | 5031 |
Article Category | Article |
Authors | Li, Jiuyong (Author), Fu, Ada Wai-chee (Author) and Fahey, Paul (Author) |
Journal Title | Artificial Intelligence in Medicine |
Journal Citation | 45 (1), pp. 77-89 |
Number of Pages | 13 |
Year | 2009 |
Publisher | Elsevier |
Place of Publication | Amsterdam, Netherlands |
ISSN | 0933-3657 |
1873-2860 | |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.artmed.2008.07.008 |
Web Address (URL) | http://www.sciencedirect.com/science/article/pii/S0933365708000900 |
Abstract | Objective: This paper studies a problem of efficiently discovering risk patterns in medical data. Risk patterns are defined by a statistical metric, relative risk, which has been widely used in epidemiological research. Methods: To avoid fruitless search in the complete exploration of risk patterns, we define optimal risk pattern set to exclude superfluous patterns, i.e. complicated patterns with lower relative risk than their corresponding simpler form patterns. We prove that mining optimal risk pattern sets conforms an anti-monotone property that supports an efficient mining algorithm. We propose an efficient algorithm for mining optimal risk pattern sets based on this property. We also propose a hierarchical structure to present discovered patterns for the easy perusal by domain experts. Results: The proposed approach is compared with two well-known rule discovery methods, decision tree and association rule mining approaches on benchmark data sets and applied to a real world application. The proposed method discovers more and better quality risk patterns than a decision tree approach. The decision tree method is not designed for such applications and is inadequate for pattern exploring. The proposed method does not discover a large number of uninteresting superfluous patterns as an association mining approach does. The proposed method is more efficient than an association rule mining method. A real world case study shows that the method reveals some interesting risk patterns to medical practitioners. Conclusion: The proposed method is an efficient approach to explore risk patterns. It quickly identifies cohorts of patients that are vulnerable to a risk outcome from a large data set. The proposed method is useful for exploratory study on large medical data to generate and refine hypotheses. The method is also useful for designing medical surveillance systems. |
Keywords | association rule; data mining; decision tree; epidemiology; relative risk; risk pattern |
ANZSRC Field of Research 2020 | 490501. Applied statistics |
490502. Biostatistics | |
420399. Health services and systems not elsewhere classified | |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Byline Affiliations | University of South Australia |
Chinese University of Hong Kong, China | |
Department of Mathematics and Computing | |
Institution of Origin | University of Southern Queensland |
https://research.usq.edu.au/item/q1153/efficient-discovery-of-risk-patterns-in-medical-data
1861
total views7
total downloads1
views this month0
downloads this month