Skip to main navigation Skip to search Skip to main content

Memory optimization at Edge for Distributed Convolution Neural Network

Research output: Contribution to journalArticlepeer-review

Abstract

Internet of Things (IoT) edge intelligence has emerged by optimizing the deep learning (DL) models deployed on resource-constraint devices for quick decision-making. In addition, edge intelligence reduces network overload and latency by bringing intelligent analytics closer to the source. On the other hand, DL models need a lot of computing resources. As a result, they have high computational workloads and memory footprint, making it impractical to deploy and execute on IoT edge devices with limited capabilities. In addition, existing layer-based partitioning methods generate many intermediate results, resulting in a huge memory footprint. In this article, we propose a framework to provide a comprehensive solution that enables the deployment of convolutional neural networks (CNNs) onto distributed IoT devices for faster inference and reduced memory footprint. This framework considers a pretrained YOLOv2 model, and a weight pruning technique is applied to the pre-trained model to reduce the number of non-contributing parameters. We use the fused layer partitioning method to vertically partition the fused layers of the CNN and then distribute the partition among the edge devices to process the input. In our experiment, we have considered multiple Raspberry Pi as edge devices. Raspberry Pi with a neural computing stick is a gateway device to combine the results from various edge devices and get the final output. Our proposed model achieved inference latency of 5 to (Formula presented.) 7 seconds for (Formula presented.) to (Formula presented.) fused layer partitioning for five devices with a 9% improvement in memory footprint.

Original languageEnglish
Article numbere4648
JournalTransactions on Emerging Telecommunications Technologies
Volume33
Issue number12
DOIs
Publication statusPublished - 12-2022

All Science Journal Classification (ASJC) codes

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Memory optimization at Edge for Distributed Convolution Neural Network'. Together they form a unique fingerprint.

Cite this