NIMA: near in-memory high-precision accumulation unit for heterogeneous analog/digital deep learning acceleration

Warning The system is temporarily closed to updates for reporting purpose.

Şanlı, İrem and Ferro, Elena and Vasilopoulos, Athanasios and Boesch, Thomas and Sebastian, Abu and Boybat, Irem (2025) NIMA: near in-memory high-precision accumulation unit for heterogeneous analog/digital deep learning acceleration. In: IEEE International Symposium on Circuits and Systems (ISCAS), London, United Kingdom

Full text not available from this repository. (Request a copy)

Abstract

Analog In-Memory Computing (AIMC) crossbars often face size limitations that hinder mapping entire neural network layers onto a single AIMC tile. To overcome this, tall layers are typically distributed across multiple tiles or within a single tile by multiplexing different sets of weights at various time intervals. However, this generates partial vector-matrix multiplication (VMM) results that need to be accumulated to produce the final output, underscoring the need for integrated accumulation capabilities in AIMC systems. In this work, we propose a Near In-Memory High-Precision Accumulation Unit (NIMA) with built-in internal and external tile accumulation functionality, positioned at the periphery of the AIMC crossbar. This unit leverages the affine correction capabilities of existing systems and ensures high precision during partial VMM accumulation. We have physically implemented NIMA in a 14nm CMOS technology, providing comprehensive performance and area evaluations and comparisons with prior art. We investigate and compare different layer mappings and accumulation schemes supported by the proposed unit, evaluating both latency and Mean Absolute Error (MAE). We further assess the precision of the proposed unit on large layers of ResNet9 and ResNet32 for image classification on CIFAR10/CIFAR100 datasets.
Item Type: Papers in Conference Proceedings
Uncontrolled Keywords: Analog in-memory computing; deep neural network inference; near-memory computing; partial sum accumulation
Divisions: Faculty of Engineering and Natural Sciences
Depositing User: İrem Şanlı
Date Deposited: 03 Sep 2025 15:00
Last Modified: 03 Sep 2025 15:00
URI: https://research.sabanciuniv.edu/id/eprint/52105

Actions (login required)

View Item
View Item