Publication Details

Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA

MUSIL, P.; JURÁNEK, R.; MUSIL, M.; ZEMČÍK, P. Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, vol. 30, no. 1, p. 267-280. ISSN: 1051-8215.
Czech title
Kaskádová paměťová architektura pro detekci objektů různé velikosti ve videu v FPGA
Type
journal article
Language
English
Authors
URL
Keywords

Object Detection, AdaBoost, WaldBoost, Acceleration, FPGA

Abstract

Object detection in embedded systems is important for many contemporary applications that involve vision and scene analysis. In this paper, we propose a novel architecture for object detection implemented in FPGA, based on the Stripe Memory Engine (SME), and point out shortcomings of existing architectures. SME processes a stream of image data so that it stores a narrow stripe of the input image and its scaled versions and uses a detector unit which is efficiently pipelined across multiple image positions within the SME. We show how to process images with up to 4K resolution at high framerates using cascades of SMEs. As a detector algorithm, the SMEs use boosted soft cascade with simple image features that require only pixel comparisons and look-up tables; therefore, they are well suitable for hardware implemenation. We describe the components of our architecture and compare it to several published works in several configurations. As an example, we implemented face detection and license plate detection applications that work with HD images (1280×720 pixels) running at over 60 frames per second on Xilinx Zynq platform. We analyzed their power consumption, evaluated the accuracy of our detectors, and compared them to Haar Cascades from OpenCV that are often used by other authors. We show that our detectors offer better accuracy as well as performance at lower power consumption.

Published
2020
Pages
267–280
Journal
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, vol. 30, no. 1, ISSN 1051-8215
DOI
UT WoS
000521641800022
EID Scopus
BibTeX
@article{BUT161413,
  author="Petr {Musil} and Roman {Juránek} and Martin {Musil} and Pavel {Zemčík}",
  title="Cascaded Stripe Memory Engines for Multi-Scale Object Detection in FPGA",
  journal="IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY",
  year="2020",
  volume="30",
  number="1",
  pages="267--280",
  doi="10.1109/TCSVT.2018.2886476",
  issn="1051-8215",
  url="https://ieeexplore.ieee.org/document/8573854"
}
Files
Back to top