Abstract
In this paper, we propose mining maximal frequent patterns from univariate uncertain data. Univariate uncertain data refers to cases where each attribute in a transaction is associated with a quantitative interval and a probability density function that assigns a probability to each value in the interval. The number of frequent U2 patterns (i.e. frequent patterns of univariate uncertain data) is usually very large. To return a concise and informative mining result to users, we propose mining maximal frequent U2 patterns (MFU2Ps). A maximal frequent U2 pattern is a frequent U2 pattern without any frequent superset. The three proposed algorithms, MU2P-Miner, U2GenMax, and U2MAFIA, are different in terms of both the data formats used to store transactions and the structures used to store the MFU2Ps which are found during the mining process. The experiment results show that different algorithms excel when applied to different datasets and settings. We have applied the proposed algorithms to univariate uncertain data comprising measurements of the air quality and weather conditions in Taiwan; the derived MFU2Ps show that the air quality in Taiwan is usually good (unless a sand storm affects the island) and the weather is often wet.
Get full access to this article
View all access options for this article.
