Skip to main navigation Skip to search Skip to main content

Elastic MapReduce for Scalable Image Processing in the Cloud

  • Rakesh S. Raj
  • , M. P. Pavan Kumar
  • , K. N. Manjunath
  • , B. E. Rangaswamy
  • , N. V. Shamna
  • , N. B. Pradeep

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In digital imaging for medical diagnostics, especially chest X-rays, raster images like JPEG, PNG, and TIFF are frequently utilized. For effective preprocessing, annotation, and machine learning training, large-scale image collections must be arranged according to format and resolution. This study suggests a scalable method for sorting and storing raster-type medical images using Elastic MapReduce (EMR) from Amazon Web Services (AWS). The pipeline uses AWS S3 storage and the Hadoop MapReduce architecture to distribute the identification of image characteristics and arrange them into structured S3 pathways. The outcomes show fault tolerance, cost-effectiveness, and high throughput for datasets with more than hundreds of thousands of images. Elastic MapReduce has gained popularity as a framework for handling massive amounts of data because of its fault-tolerant, scalable, and economical infrastructure. This study examines optimization strategies, assesses performance under various workloads, and looks into the integration of image processing pipelines into EMR clusters. The findings demonstrate that EMR can significantly increase throughput and scalability for large-scale image processing activities as classification, feature extraction, and filtering.

Original languageEnglish
Title of host publication2025 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2025 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages43-48
Number of pages6
ISBN (Electronic)9798331538989
DOIs
Publication statusPublished - 2025
Event9th IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2025 - Mangalore, India
Duration: 17-10-202518-10-2025

Publication series

Name2025 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2025 - Proceedings

Conference

Conference9th IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2025
Country/TerritoryIndia
CityMangalore
Period17-10-2518-10-25

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Elastic MapReduce for Scalable Image Processing in the Cloud'. Together they form a unique fingerprint.

Cite this