Abstract
Internet of Things (IoT) edge intelligence has emerged by optimizing the deep learning (DL) models deployed on resource-constraint devices for quick decision-making. In addition, edge intelligence reduces network overload and latency by bringing intelligent analytics closer to the source. On the other hand, DL models need a lot of computing resources. As a result, they have high computational workloads and memory footprint, making it impractical to deploy and execute on IoT edge devices with limited capabilities. In addition, existing layer-based partitioning methods generate many intermediate results, resulting in a huge memory footprint. In this article, we propose a framework to provide a comprehensive solution that enables the deployment of convolutional neural networks (CNNs) onto distributed IoT devices for faster inference and reduced memory footprint. This framework considers a pretrained YOLOv2 model, and a weight pruning technique is applied to the pre-trained model to reduce the number of non-contributing parameters. We use the fused layer partitioning method to vertically partition the fused layers of the CNN and then distribute the partition among the edge devices to process the input. In our experiment, we have considered multiple Raspberry Pi as edge devices. Raspberry Pi with a neural computing stick is a gateway device to combine the results from various edge devices and get the final output. Our proposed model achieved inference latency of 5 to (Formula presented.) 7 seconds for (Formula presented.) to (Formula presented.) fused layer partitioning for five devices with a 9% improvement in memory footprint.
| Original language | English |
|---|---|
| Article number | e4648 |
| Journal | Transactions on Emerging Telecommunications Technologies |
| Volume | 33 |
| Issue number | 12 |
| DOIs | |
| Publication status | Published - 12-2022 |
All Science Journal Classification (ASJC) codes
- Electrical and Electronic Engineering
Fingerprint
Dive into the research topics of 'Memory optimization at Edge for Distributed Convolution Neural Network'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver