TY - GEN
T1 - Preprocessing of Datasets Using Sequential and Parallel Approach
T2 - International Conference on Expert Clouds and Applications, ICOECA 2021
AU - Rai, Shwetha
AU - Geetha, M.
AU - Kumar, Preetham
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - Data preprocessing is a technique in data mining to make the data read for further processing according to the requirement. Preprocessing is required because the data might be incomplete, redundant, come from different sources which may require aggregation, etc., and data can be processed either sequentially or in parallel. There are several parallel frameworks such as Hadoop, MPI, and CUDA to process the data. A survey has been done to understand these parallel frameworks, and a comparison between sequential and parallel approach is carried out to compare the efficiency of the two approaches.
AB - Data preprocessing is a technique in data mining to make the data read for further processing according to the requirement. Preprocessing is required because the data might be incomplete, redundant, come from different sources which may require aggregation, etc., and data can be processed either sequentially or in parallel. There are several parallel frameworks such as Hadoop, MPI, and CUDA to process the data. A survey has been done to understand these parallel frameworks, and a comparison between sequential and parallel approach is carried out to compare the efficiency of the two approaches.
UR - http://www.scopus.com/inward/record.url?scp=85113356430&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85113356430&partnerID=8YFLogxK
U2 - 10.1007/978-981-16-2126-0_27
DO - 10.1007/978-981-16-2126-0_27
M3 - Conference contribution
AN - SCOPUS:85113356430
SN - 9789811621253
T3 - Lecture Notes in Networks and Systems
SP - 311
EP - 320
BT - Expert Clouds and Applications - Proceedings of ICOECA 2021
A2 - Jeena Jacob, I.
A2 - Gonzalez-Longatt, Francisco M.
A2 - Kolandapalayam Shanmugam, Selvanayaki
A2 - Izonin, Ivan
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 18 February 2021 through 19 February 2021
ER -