Data Regeneration from Poisoned Datasets

  • Charvi Bannur*
  • , Chaitra Bhat
  • , Ishita Bharadwaj
  • , Kushagra Singh
  • , Shrirang Ambaji Kulkarni
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The administration of data in soft copies is growing increasingly popular as organizations switch from keeping documents in hard copies to more contemporary information handling systems that enable faster sharing and exchanging of data. Unfortunately, as the data has gotten easier to manage, modify, and store, it has also become easier to lose or corrupt them. It is well known that many approaches, including Autoencoders, K means, and generative models, can identify noise and regenerate data in response. Using the unique modified Adamic Adar technique, which was previously used for link prediction in the context of graphs, we tackled the aforementioned issue in this study. On the same medium-sized dataset, our suggested solution with an accuracy of 78% outperformed more established techniques like K means, which had an efficiency of only 50%. We further regenerated them using several regression and deep learning models. The research also discusses the extension of GANs, which are well renowned for their effectiveness in image generation, to lost data regeneration.

Original languageEnglish
Title of host publication7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages47-52
Number of pages6
ISBN (Electronic)9781665489102
DOIs
Publication statusPublished - 2022
Event7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022 - Mangalore, India
Duration: 01-12-202203-12-2022

Publication series

Name7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022 - Proceedings

Conference

Conference7th IEEE International Conference on Recent Advances and Innovations in Engineering, ICRAIE 2022
Country/TerritoryIndia
CityMangalore
Period01-12-2203-12-22

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Information Systems and Management
  • Energy Engineering and Power Technology
  • Engineering (miscellaneous)
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Data Regeneration from Poisoned Datasets'. Together they form a unique fingerprint.

Cite this