TY - GEN
T1 - A List based Redundancy Removal Approach by Mining Closed and Non-Derivable Frequent Itemsets
AU - Jashma Suresh, P. P.
AU - Dinesh Acharya, U.
AU - Subba Reddy, N. V.
PY - 2020/5
Y1 - 2020/5
N2 - Mining Frequent Itemsets has become an integral component of research lately, mainly because it can be applied to a variety of areas, including clustering of documents for information retrieval, inventory management, association rule mining, and outlier detection. Existing techniques that extract "frequent items" do not take care to address redundancy. An itemset is said to be redundant if their support can be derived from other existing itemsets using standard deduction techniques. The presence of such itemsets in the collection was found to increase the size of the collection of the mined frequent items. This in turn affects the memory and run time of the mining process. To overcome these drawbacks, this paper employs a list-based approach that generates only "Closed and Non-Derivable Itemsets." This is a collection of items 'I' such that 'I' has no superset with the same support as that of itself, and I belongs to the collection of those items whose lowest upper bound and highest lower bound values are not the same. With the help of experiments, it has been proved that the proposed approach addresses redundancy better by generating a compressed set of non-redundant frequent items.
AB - Mining Frequent Itemsets has become an integral component of research lately, mainly because it can be applied to a variety of areas, including clustering of documents for information retrieval, inventory management, association rule mining, and outlier detection. Existing techniques that extract "frequent items" do not take care to address redundancy. An itemset is said to be redundant if their support can be derived from other existing itemsets using standard deduction techniques. The presence of such itemsets in the collection was found to increase the size of the collection of the mined frequent items. This in turn affects the memory and run time of the mining process. To overcome these drawbacks, this paper employs a list-based approach that generates only "Closed and Non-Derivable Itemsets." This is a collection of items 'I' such that 'I' has no superset with the same support as that of itself, and I belongs to the collection of those items whose lowest upper bound and highest lower bound values are not the same. With the help of experiments, it has been proved that the proposed approach addresses redundancy better by generating a compressed set of non-redundant frequent items.
UR - http://www.scopus.com/inward/record.url?scp=85087438371&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85087438371&partnerID=8YFLogxK
U2 - 10.1109/ICICCS48265.2020.9121071
DO - 10.1109/ICICCS48265.2020.9121071
M3 - Conference contribution
AN - SCOPUS:85087438371
T3 - Proceedings of the International Conference on Intelligent Computing and Control Systems, ICICCS 2020
SP - 52
EP - 58
BT - Proceedings of the International Conference on Intelligent Computing and Control Systems, ICICCS 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 International Conference on Intelligent Computing and Control Systems, ICICCS 2020
Y2 - 13 May 2020 through 15 May 2020
ER -