This paper presents Max-Miner, an efficient algorithm for mining maximal frequent itemsets from large databases. Unlike previous algorithms such as Apriori, which scale exponentially with the length of the longest pattern, Max-Miner scales roughly linearly in the number of maximal patterns, regardless of the length of the longest pattern. This makes it significantly more efficient for long patterns. Max-Miner focuses on extracting only maximal frequent itemsets, which are itemsets that have no superset that is frequent. This approach implicitly and concisely represents all frequent itemsets.
Max-Miner improves performance by abandoning a strict bottom-up traversal of the search space and instead attempting to "look ahead" to quickly identify long frequent itemsets. This allows it to prune subsets of these itemsets early, reducing the search space. It also uses a heuristic to tune its search and a technique to determine the support of candidate itemsets without accessing the database. These techniques allow Max-Miner to run in time that is roughly linear in the number of maximal frequent itemsets and the size of the database.
The paper also discusses the integration of additional pattern constraints into the search, culminating in the Max-Miner-LO algorithm, which identifies only the longest maximal frequent itemsets. This algorithm efficiently identifies all of the longest maximal frequent itemsets even when the space of all maximal frequent itemsets is intractably large.
The paper compares Max-Miner with other algorithms such as Apriori and Pincer-Search, showing that Max-Miner performs significantly better, especially for long patterns. It also discusses the use of support lower-bounding to improve performance and the effects of various optimizations on the algorithm's performance. The paper concludes that Max-Miner is a highly efficient algorithm for mining maximal frequent itemsets from large databases, and that further research is needed to explore additional constraints and optimizations.This paper presents Max-Miner, an efficient algorithm for mining maximal frequent itemsets from large databases. Unlike previous algorithms such as Apriori, which scale exponentially with the length of the longest pattern, Max-Miner scales roughly linearly in the number of maximal patterns, regardless of the length of the longest pattern. This makes it significantly more efficient for long patterns. Max-Miner focuses on extracting only maximal frequent itemsets, which are itemsets that have no superset that is frequent. This approach implicitly and concisely represents all frequent itemsets.
Max-Miner improves performance by abandoning a strict bottom-up traversal of the search space and instead attempting to "look ahead" to quickly identify long frequent itemsets. This allows it to prune subsets of these itemsets early, reducing the search space. It also uses a heuristic to tune its search and a technique to determine the support of candidate itemsets without accessing the database. These techniques allow Max-Miner to run in time that is roughly linear in the number of maximal frequent itemsets and the size of the database.
The paper also discusses the integration of additional pattern constraints into the search, culminating in the Max-Miner-LO algorithm, which identifies only the longest maximal frequent itemsets. This algorithm efficiently identifies all of the longest maximal frequent itemsets even when the space of all maximal frequent itemsets is intractably large.
The paper compares Max-Miner with other algorithms such as Apriori and Pincer-Search, showing that Max-Miner performs significantly better, especially for long patterns. It also discusses the use of support lower-bounding to improve performance and the effects of various optimizations on the algorithm's performance. The paper concludes that Max-Miner is a highly efficient algorithm for mining maximal frequent itemsets from large databases, and that further research is needed to explore additional constraints and optimizations.