REDRAW: Fast and Efficient Hardware Accelerator with Reduced Reads and Writes for 3D UNet

Mekie, Joycee

doi:10.23919/DATE56975.2023.10137190

REDRAW: Fast and Efficient Hardware Accelerator with Reduced Reads and Writes for 3D UNet

Source

Proceedings Design Automation and Test in Europe Date

ISSN

15301591

Date Issued

2023-01-01

Author(s)

Glint, Tom

Awasthi, Manu

Mekie, Joycee

DOI

10.23919/DATE56975.2023.10137190

Volume

2023-April

Abstract

Hardware Accelerators (HAs) proposed so far have been designed with a focus on 2D Convolutional Neural Networks (CNNs) and 3D CNNs using temporal data. To the best of our knowledge, there is no existing HA for 3D CNNs using spatial data. 3D UNet is a 3D CNN with significant applications in the medical domain. However, the total on-chip buffer size (>20 MB) required for the complete stationery approach of processing 3D UNet is cost prohibitive. In this work, we analyze the 3D UNet workload and propose a HA with an optimized memory hierarchy with a total on-chip buffer of less than 4 MB while conceding near theoretical minimum memory accesses required for processing 3D UNet. We demonstrate the efficiency of the proposed HA by comparing it with SOTA Simba architecture with the same number of MAC Units and show a 1.3× increase in TOPSwatt for an ISO-area design. Further, we revise the proposed architecture to increase the ratio of compute operations to memory operations and to meet the latency requirement of 3D UNet-based embedded applications. The revised architecture, compared against a dual instance of Simba, has similar latency. Against the dual instance of Simba, the proposed architecture achieves a 1.8 × increase in TOPS/watt in a similar area.

Unpaywall

URI

https://d8.irins.org/handle/IITG2025/26984

Subjects

3D UNet | DNN Accelerator | Modeling