Attuluri, Yathin KumarYathin KumarAttuluriChudasama, RuchitRuchitChudasamaPrasad, KailashKailashPrasadMekie, JoyceeJoyceeMekie2025-08-312025-08-312024-01-01[9798350384406]10.1109/VLSID60093.2024.000472-s2.0-85190373746https://d8.irins.org/handle/IITG2025/29161In-Memory computing (IMC) has emerged as a promising approach to address the von Neumann bottleneck in deep learning applications. This work proposes FP-ATM, a 6T SRAM-based all-digital design for multiply-accumulate (MAC) operations, featuring a flexible NOR Adder Tree for In-Memory Computing. The proposed macro is data-aware and can support input activations and weights for INT8 and BF16 number formats in a convolutional neural network. Using multiple macros in different configurations can support neural networks with different topologies. The proposed macro is based on bit-serial multiplication and parallel adder trees. This architecture can achieve massively parallel MAC operations with high energy efficiency and throughput. The proposed macro achieves a peak energy efficiency of 267.7 TFLOPS/W at 0.65V, 8.5 times the state-of-the-art work. The maximum frequency is 1.67 GHz and achieves throughput of 2.67 GFLOPS/Kb at a voltage of 0.9V.falseDigital In-Memory Computing | Floating Point | NOR Adder Tree MacroFP-ATM: A Flexible Floating Point NOR Adder Tree Macro for In-Memory ComputingConference Paper247-25220240cpConference Proceeding0