A Model of Dual Fabry-Perot Etalon-Based External-Cavity Tunable Laser Us...
Internal motion within pulsating pure-quartic soliton molecules in a fibe...
Enhanced light emission of germanium light-emitting-diode on 150 mm germa...
The Fabrication of GaN Nanostructures Using Cost-Effective Methods for Ap...
Negative-to-Positive Tunnel Magnetoresistance in van der Waals Fe3GeTe2/C...
Quantum Light Source Based on Semiconductor Quantum Dots: A Review
A High-Reliability RF MEMS Metal-Contact Switch Based on Al-Sc Alloy
Development of a Mode-Locked Fiber Laser Utilizing a Niobium Diselenide S...
Development of Multiple Fano-Resonance-Based All-Dielectric Metastructure...
Traffic Vibration Signal Analysis of DAS Fiber Optic Cables with Differen...
官方微信
友情链接

A Programmable and Flexible Vision Processor

2022-09-19

 

Author(s): Luo, Q (Luo, Qian); Yao, CH (Yao, Chunhe); Ning, K (Ning, Ke); Zheng, XM (Zheng, Xuemin); Zhao, MX (Zhao, Mingxin); Cheng, L (Cheng, Li); Yu, SM (Yu, Shuangming); Liu, J (Liu, Jian); Wu, NJ (Wu, Nanjian); Liu, LY (Liu, Liyuan)

Source: IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS Volume: 69 Issue: 9 Pages: 3884-3888 DOI: 10.1109/TCSII.2022.3181161 Published: SEP 2022

Abstract: Vision chips perform image capture and real-time intelligent image processing by integrating an imager and a vision processor on a single chip, having broad application prospects. This brief proposes a programmable and flexible vision processor with a dual-issue micro-architecture. The processor consists of a reconfigurable vector unit, a flexible memory access network, and a non-maximum suppression (NMS) block. It can efficiently implement both deep neural network (DNN) and traditional computer vision (CV) algorithms. The vector unit performs single-instruction multiple-vector (SIMV) parallel operations with reconfigurable vector width. The flexible memory access network adaptively supports multiple vector operations under different vector widths. A four-MAC processing element (PE) in the vector unit is designed to increase computational power and data reuse rate. The NMS block can speed up the object location processing of the detection networks. The chip is fabricated in a 28nm process. The experimental results show that the maximum clock frequency, peak performance, and peak energy efficiency are 600MHz, 1.2TOPS, and 2.03TOPS/W, respectively. The Mobilenet-Vl processing achieves a throughput of 404 fps under a 256x224 image size and an 87.15%(top-5) accuracy on the ImageNet dataset.

Accession Number: WOS:000848263100056

ISSN: 1549-7747

eISSN: 1558-3791

Full Text: https://ieeexplore.ieee.org/document/9790848



关于我们
下载视频观看
联系方式
通信地址

北京市海淀区清华东路甲35号(林大北路中段) 北京912信箱 (100083)

电话

010-82304210/010-82305052(传真)

E-mail

semi@semi.ac.cn

交通地图
版权所有 中国科学院半导体研究所

备案号:京ICP备05085259-1号 京公网安备110402500052 中国科学院半导体所声明