TY - GEN
T1 - A Dynamic Itemset Counting Based Two-Phase Algorithm for Mining High Utility Itemsets
AU - Anup Bhat, B.
AU - Harish, S. V.
AU - Geetha, M.
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/12
Y1 - 2018/12
N2 - High Utility Itemset Mining (HUIM) aids in the discovery of itemsets based on quantity and unit price of the items from a transactional database. Since its inception, HUIM has evolved as a generalized form of Frequent Itemset Mining (FIM). Unlike the support of an itemset which is antimonotone and is exploited in the algorithms for mining frequent itemsets, the utility measure is neither antimonotone nor monotone. This makes the problem of mining High Utility Itemsets (HUIs) interesting. In the current study, a novel method based on Dynamic Itemset Counting (DIC) has been proposed to optimize the Apriori-like Two-Phase (TP) algorithm for mining HUIs. Although, the TP algorithm uses antimonotonicity of Transaction Weighted Utility (TWU) of itemsets to prune the search space, the candidates are generated in a level-wise manner. This requires multiple database scans to test the candidates. The proposed method tests and generates higher order candidates at different stops during the database scan and segregates the itemsets for further evaluation. Experiments performed on real-time datasets show significant improvement in execution time of the DIC method when compared to the TP algorithm.
AB - High Utility Itemset Mining (HUIM) aids in the discovery of itemsets based on quantity and unit price of the items from a transactional database. Since its inception, HUIM has evolved as a generalized form of Frequent Itemset Mining (FIM). Unlike the support of an itemset which is antimonotone and is exploited in the algorithms for mining frequent itemsets, the utility measure is neither antimonotone nor monotone. This makes the problem of mining High Utility Itemsets (HUIs) interesting. In the current study, a novel method based on Dynamic Itemset Counting (DIC) has been proposed to optimize the Apriori-like Two-Phase (TP) algorithm for mining HUIs. Although, the TP algorithm uses antimonotonicity of Transaction Weighted Utility (TWU) of itemsets to prune the search space, the candidates are generated in a level-wise manner. This requires multiple database scans to test the candidates. The proposed method tests and generates higher order candidates at different stops during the database scan and segregates the itemsets for further evaluation. Experiments performed on real-time datasets show significant improvement in execution time of the DIC method when compared to the TP algorithm.
UR - http://www.scopus.com/inward/record.url?scp=85082586407&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082586407&partnerID=8YFLogxK
U2 - 10.1109/INDICON45594.2018.8987024
DO - 10.1109/INDICON45594.2018.8987024
M3 - Conference contribution
AN - SCOPUS:85082586407
T3 - INDICON 2018 - 15th IEEE India Council International Conference
BT - INDICON 2018 - 15th IEEE India Council International Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th IEEE India Council International Conference, INDICON 2018
Y2 - 16 December 2018 through 18 December 2018
ER -