Skip to main navigation Skip to search Skip to main content

Optimization of the Bitap Algorithm for High-Performance Pattern Matching in DNA Sequences

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Deoxyribonucleic acid (DNA) is the molecule that holds genetic information needed for the growth, development and functioning of an organism. The four types of nitrogenous bases found in nucleotides that make up each strand of DNA are: adenine (A), cytosine (C), guanine (G), or thymine (T). Efficient analysis of DNA sequencing is required since mutations in a DNA sequence can lead to genetic disorders or diseases. Bitap is an approximate string-matching algorithm that provides an effective solution for identifying such mutations in large genomic datasets. However, with an increase in data size, the performance of the sequential Bitap algorithm also declines. This paper presents a parallelized approach to improve the efficiency of the Bitap algorithm by using the Message Passing Interface (MPI) for distributed memory systems and Compute Unified Device Architecture (CUDA) for parallel computation on Graphics Processing Units (GPUs). This study focuses on comparison of execution times of three implementations of the Bitap algorithm for mutation detection: the sequential implementation, the MPI-based parallelization, and the CUDA-based GPU implementation. Experimental results indicate that the parallel approaches achieve significant performance improvements as compared to the sequential implementation, thereby enabling faster processing and better scalability for large DNA datasets.

Original languageEnglish
Title of host publicationRecent Trends in Artificial Intelligence and Data Sciences - Select Proceedings of the 15th International Conference, CONFLUENCE 2025
EditorsSumit Kumar, Garima Aggarwal, Bhuvan Unhelkar, Raju Pal
PublisherSpringer Science and Business Media Deutschland GmbH
Pages515-526
Number of pages12
ISBN (Print)9789819692026
DOIs
Publication statusPublished - 2025
Event15th International Conference on Recent Trends in Artificial Intelligence and Data Sciences, CONFLUENCE 2025 - New Delhi, India
Duration: 16-01-202517-01-2025

Publication series

NameLecture Notes in Electrical Engineering
Volume1447 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference15th International Conference on Recent Trends in Artificial Intelligence and Data Sciences, CONFLUENCE 2025
Country/TerritoryIndia
CityNew Delhi
Period16-01-2517-01-25

All Science Journal Classification (ASJC) codes

  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Optimization of the Bitap Algorithm for High-Performance Pattern Matching in DNA Sequences'. Together they form a unique fingerprint.

Cite this