TY - GEN
T1 - Data Regeneration from Poisoned Datasets
AU - Bannur, Charvi
AU - Bhat, Chaitra
AU - Bharadwaj, Ishita
AU - Singh, Kushagra
AU - Kulkarni, Shrirang Ambaji
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - The administration of data in soft copies is growing increasingly popular as organizations switch from keeping documents in hard copies to more contemporary information handling systems that enable faster sharing and exchanging of data. Unfortunately, as the data has gotten easier to manage, modify, and store, it has also become easier to lose or corrupt them. It is well known that many approaches, including Autoencoders, K means, and generative models, can identify noise and regenerate data in response. Using the unique modified Adamic Adar technique, which was previously used for link prediction in the context of graphs, we tackled the aforementioned issue in this study. On the same medium-sized dataset, our suggested solution with an accuracy of 78% outperformed more established techniques like K means, which had an efficiency of only 50%. We further regenerated them using several regression and deep learning models. The research also discusses the extension of GANs, which are well renowned for their effectiveness in image generation, to lost data regeneration.
AB - The administration of data in soft copies is growing increasingly popular as organizations switch from keeping documents in hard copies to more contemporary information handling systems that enable faster sharing and exchanging of data. Unfortunately, as the data has gotten easier to manage, modify, and store, it has also become easier to lose or corrupt them. It is well known that many approaches, including Autoencoders, K means, and generative models, can identify noise and regenerate data in response. Using the unique modified Adamic Adar technique, which was previously used for link prediction in the context of graphs, we tackled the aforementioned issue in this study. On the same medium-sized dataset, our suggested solution with an accuracy of 78% outperformed more established techniques like K means, which had an efficiency of only 50%. We further regenerated them using several regression and deep learning models. The research also discusses the extension of GANs, which are well renowned for their effectiveness in image generation, to lost data regeneration.
UR - https://www.scopus.com/pages/publications/85150048561
UR - https://www.scopus.com/pages/publications/85150048561#tab=citedBy
U2 - 10.1109/ICRAIE56454.2022.10054302
DO - 10.1109/ICRAIE56454.2022.10054302
M3 - Conference contribution
AN - SCOPUS:85150048561
T3 - 7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022 - Proceedings
SP - 47
EP - 52
BT - 7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022
Y2 - 1 December 2022 through 3 December 2022
ER -