中国科学院半导体研究所

Optimizing FPGA-based DCN Accelerator with On-Chip Dataflow Reordering and Serial-Parallel Computing Array

2024-05-14

Zhang, Ming; Xu, Jian; He, Jinzhong; Qin, Hong Source: 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023, p 118-123, 2023, 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023;

Abstract:

Deep convolutional neural networks (DCNs) have recently experienced rapid development in the direction of lightweight and edge deployment. However, accelerators for DCNs face challenges in balancing computational and data bandwidth, leading to inefficient computation and high hardware costs. Additionally, different network structures make it challenging to design and reconfigure accelerators flexibly. To address these issues, this paper proposes a parallel-serial channel accelerator system, which resolves the low utilization of multipliers caused by small channels and inadequate bandwidth of fully connected layers. The results demonstrate that the proposed accelerator in this study maintains high computational performance and efficiency on typical DCNs. When implemented on Xilinx VCU128 at 200 MHz, the peak computational performance reaches 204.5 GOPS, with an efficiency of 0.37 GOPS/DSP and a maximum utilization rate of computing array up to 99.63%, surpassing previous works.

Optimizing FPGA-based DCN Accelerator with On-Chip Dataflow Reordering and Serial-Parallel Computing Array

关于我们

下载视频观看

联系方式

通信地址

电话

E-mail

交通地图

友情链接

中华人民共和国科学技术部

中国科学院

中国工程院

国家自然科学基金委员会

中国科学院大学

中国科学技术大学

中国科学院科技产业网

版权所有中国科学院半导体研究所

备案号：京ICP备05085259-1号京公网安备110402500052 中国科学院半导体所声明

Optimizing FPGA-based DCN Accelerator with On-Chip Dataflow Reordering and Serial-Parallel Computing Array

关于我们

联系方式

通信地址

电话

E-mail

友情链接

版权所有 中国科学院半导体研究所

备案号：京ICP备05085259-1号 京公网安备110402500052 中国科学院半导体所声明

版权所有中国科学院半导体研究所

备案号：京ICP备05085259-1号京公网安备110402500052 中国科学院半导体所声明