TY - GEN
T1 - An Exploration of Accuracy Configurable Matrix Multiply-Addition Architectures using HLS
AU - Leon-Vega, Luis G.
AU - Salazar-Villalobos, Eduardo
AU - Castro-Godinez, Jorge
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Low-power consumption and constraint resources limit the implementation of deep learning inference solutions at the edge. Besides, the approximate computing paradigm reports promising techniques for the design of DNN accelerators to deal with inherent limitations of the edge. This paper summarises the automatic generation of generic matrix multiplication-addition (GEMMA) processing elements (PEs), leveraging High-Level Synthesis and emphasising in adaptable matrix size, data bit-width, and data type for accuracy configuration, and their impact on the overall design resource consumption. For generated PEs efficiency evaluation, this work presents a novel Figure of merit that considers computing performance and resource utilisation regarding the FPGA platform underneath. Finally, we analyse the impact of different design configurations in the numerical errors introduced due to the output bit-width preservation regarding the input, and matrix size, data bit-width and type configuration.
AB - Low-power consumption and constraint resources limit the implementation of deep learning inference solutions at the edge. Besides, the approximate computing paradigm reports promising techniques for the design of DNN accelerators to deal with inherent limitations of the edge. This paper summarises the automatic generation of generic matrix multiplication-addition (GEMMA) processing elements (PEs), leveraging High-Level Synthesis and emphasising in adaptable matrix size, data bit-width, and data type for accuracy configuration, and their impact on the overall design resource consumption. For generated PEs efficiency evaluation, this work presents a novel Figure of merit that considers computing performance and resource utilisation regarding the FPGA platform underneath. Finally, we analyse the impact of different design configurations in the numerical errors introduced due to the output bit-width preservation regarding the input, and matrix size, data bit-width and type configuration.
KW - AI accelerators
KW - approximate computing
KW - design automation
KW - High-Level Synthesis
UR - http://www.scopus.com/inward/record.url?scp=85137670951&partnerID=8YFLogxK
U2 - 10.1109/DCAS53974.2022.9845501
DO - 10.1109/DCAS53974.2022.9845501
M3 - Contribución a la conferencia
AN - SCOPUS:85137670951
T3 - Proceedings of the 2022 IEEE Dallas Circuits and Systems Conference, DCAS 2022
BT - Proceedings of the 2022 IEEE Dallas Circuits and Systems Conference, DCAS 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th IEEE Dallas Circuits and Systems Conference, DCAS 2022
Y2 - 17 June 2022 through 19 June 2022
ER -