CLAppED: A Design Framework for Implementing Cross-Layer Approximation in FPGA-based Embedded Systems

With the rising variation and complexity of embedded work-loads, FPGA-based systems are being increasingly used for many applications. The reconfigurability and high parallelism offered by FPGAs are used to enhance the overall performance of these applications. However, the resource constraints of e...

Full description

Saved in:
Bibliographic Details
Published in2021 58th ACM/IEEE Design Automation Conference (DAC) pp. 475 - 480
Main Authors Ullah, Salim, Sahoo, Siva Satyendra, Kumar, Akash
Format Conference Proceeding
LanguageEnglish
Published IEEE 05.12.2021
Subjects
Online AccessGet full text
DOI10.1109/DAC18074.2021.9586260

Cover

More Information
Summary:With the rising variation and complexity of embedded work-loads, FPGA-based systems are being increasingly used for many applications. The reconfigurability and high parallelism offered by FPGAs are used to enhance the overall performance of these applications. However, the resource constraints of embedded platforms can limit the performance in multiple ways. In recent years, Approximate Computing has emerged as a viable tool for improving the performance by utilizing reduced precision data structures and resource-optimized high-performance arithmetic operators. However, most of the related state-of-the-art research has mainly focused on utilizing approximate computing principles individually on different layers of the computing stack. Nonetheless, approximations across different layers of computing stack can substantially enhance the system's performance. To this end, we present a framework to enable the intelligent exploration and highly accurate identification of the feasible design points in the large design space enabled by cross-layer approximations. Our framework proposes a novel polynomial regression-based method to model approximate arithmetic operators. The proposed method enables machine learning models to better correlate approximate operators with their impact on an application's output quality. We use a 2D convolution operator as a test case and present the results for FPGA- based approximate hardware accelerators.
DOI:10.1109/DAC18074.2021.9586260