TY - GEN
T1 - Improving Performance of Error-Tolerant Applications
T2 - 5th Jornadas Costarricenses de Investigacion en Computacion e Informatica, JoCICI 2021 - 5th Costa Rican Conference on Research in Computing and Informatics, JoCICI 2021
AU - Gonzalez-Aragon, Tomas
AU - Castro-Godinez, Jorge
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Trending workloads and applications are leading many of the new advances in computer architecture and design paradigms. For instance, deep learning applications are transforming the way we do computing. On one hand, specific architectures are currently commercialized as neural processing units, specialized hardware accelerators for these types of applications, achieving significant performance improvements. On the other hand, design paradigms, such as approximate computing, exploit existing inherent tolerance to imprecise computations in these applications to reduce their computation complexity and produce energy-efficient implementations. Relevant available approximations are limited to the software layer to improve the performance of deep learning applications when using an off-the-shelf specialized accelerator alongside edge computing platforms. In this work, we present a case study of performance improvement by introducing approximate computing techniques to three deep learning classification applications. Our test platform is a Raspberry Pi 4, as edge computing device, and a Movidius Myriad X, as neural accelerator. Our experimental results show that using a mixture of approximate techniques can achieve a performance improvement from 20x to 48x with no accuracy degradation for a compute-intensive classification application.
AB - Trending workloads and applications are leading many of the new advances in computer architecture and design paradigms. For instance, deep learning applications are transforming the way we do computing. On one hand, specific architectures are currently commercialized as neural processing units, specialized hardware accelerators for these types of applications, achieving significant performance improvements. On the other hand, design paradigms, such as approximate computing, exploit existing inherent tolerance to imprecise computations in these applications to reduce their computation complexity and produce energy-efficient implementations. Relevant available approximations are limited to the software layer to improve the performance of deep learning applications when using an off-the-shelf specialized accelerator alongside edge computing platforms. In this work, we present a case study of performance improvement by introducing approximate computing techniques to three deep learning classification applications. Our test platform is a Raspberry Pi 4, as edge computing device, and a Movidius Myriad X, as neural accelerator. Our experimental results show that using a mixture of approximate techniques can achieve a performance improvement from 20x to 48x with no accuracy degradation for a compute-intensive classification application.
KW - Approximate computing
KW - deep learning
KW - edge computing
KW - neural accelerator
UR - http://www.scopus.com/inward/record.url?scp=85133495712&partnerID=8YFLogxK
U2 - 10.1109/JoCICI54528.2021.9794353
DO - 10.1109/JoCICI54528.2021.9794353
M3 - Contribución a la conferencia
AN - SCOPUS:85133495712
T3 - Proceedings - 5th Jornadas Costarricenses de Investigacion en Computacion e Informatica, JoCICI 2021
BT - Proceedings - 5th Jornadas Costarricenses de Investigacion en Computacion e Informatica, JoCICI 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 25 October 2021 through 29 October 2021
ER -