A scalable FPGA based accelerator for Tiny-YOLO-v2 using OpenCL

Deep Convolution Neural Network (CNN) algorithm have recently gained popularity in many applications such as image classification, video analytic, object recognition and segmentation. Being compute-intensive and memory expensive, CNN computations are common accelerated by GPUs with high power dissip...

Full description

Saved in:

Bibliographic Details
Published in	International journal of reconfigurable and embedded systems Vol. 8; no. 3; p. 206
Main Authors	June Wai, Yap, Yussof, Zulkanain Mohd, Md Salim, Sani Irwan
Format	Journal Article
Language	English
Published	Yogyakarta IAES Institute of Advanced Engineering and Science 01.11.2019
Subjects	Algorithms Artificial neural networks Convolution Graphics processing units Image classification Image segmentation Object recognition Video data
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep Convolution Neural Network (CNN) algorithm have recently gained popularity in many applications such as image classification, video analytic, object recognition and segmentation. Being compute-intensive and memory expensive, CNN computations are common accelerated by GPUs with high power dissipations. Recent studies show implementation of CNN on FPGA and it gain higher advantage in term of energy-efficient and flexibility over Software-configurable-GPUs. The proposed framework is verified by implement Tiny-YOLO-v2 on De1SoC. The design development in this project is HLS approach to ease effort from writing complex RTL codes and provide fast verification through emulation and profiling tools provided in the OpenCL SDK. To best of our knowledge, this is the first implementation of Tiny-YOLO-v2 CNN based object detection algorithm on a small scale De1SoC board using Intel FPGA SDK for OpenCL approach.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2089-4864 2089-4864
DOI:	10.11591/ijres.v8.i3.pp206-214