Skip to main navigation Skip to search Skip to main content

Parallelized Hybrid Sorting Using Quick and Insertion Sort for Big Data

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In the context of big data, efficient sorting of massive datasets is essential for optimal performance in data-intensive applications such as database management, data analytics and scientific computing. This paper proposes a parallelized hybrid sorting algorithm for optimizing the efficiency of sorting large-scale data by integrating Quick Sort and Insertion Sort. The hybrid approach utilizes the speed of Quick Sort with larger data partitions and applies Insertion Sort for efficiency on smaller, nearly sorted subarrays. To further improve the performance, two parallelization implementations using MPI and CUDA are carried out. The approaches that use MPI make use of distributed memory across multiple processes, which makes use of k-Way merge using Min-Heap at the root for efficient consolidation. In contrast, CUDA-based implementations utilize GPU parallelism, in which threads are independently handling data segments and the final merge is done by using k-Way merge using Min-Heap. Time of computation and algorithm efficiency are measured for each method on large datasets. Comparison between sequential, MPI and CUDA executions show substantial performance improvements. For smaller datasets, such as 1000 elements, MPI results in an improvement of up to 141 times compared to sequential execution, while a speedup of up to 428 times is observed for larger datasets of 4 million elements with CUDA. The drastic improvement in performance noticed with the use of CUDA highlights the benefits of employing modern parallel and GPU-based methods to reduce computation time and enhance resource utilization.

Original languageEnglish
Title of host publicationRecent Trends in Artificial Intelligence and Data Sciences - Select Proceedings of the 15th International Conference, CONFLUENCE 2025
EditorsSumit Kumar, Garima Aggarwal, Bhuvan Unhelkar, Raju Pal
PublisherSpringer Science and Business Media Deutschland GmbH
Pages503-513
Number of pages11
ISBN (Print)9789819692026
DOIs
Publication statusPublished - 2025
Event15th International Conference on Recent Trends in Artificial Intelligence and Data Sciences, CONFLUENCE 2025 - New Delhi, India
Duration: 16-01-202517-01-2025

Publication series

NameLecture Notes in Electrical Engineering
Volume1447 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference15th International Conference on Recent Trends in Artificial Intelligence and Data Sciences, CONFLUENCE 2025
Country/TerritoryIndia
CityNew Delhi
Period16-01-2517-01-25

All Science Journal Classification (ASJC) codes

  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Parallelized Hybrid Sorting Using Quick and Insertion Sort for Big Data'. Together they form a unique fingerprint.

Cite this