Low Error-Rate Approximate Multiplier Design for DNNs with Hardware-Driven Co-Optimization

Details

Presenter(s)

Yao Lu

Affiliation: Affiliation

State Key Laboratory of ASIC and System, Fudan University
Country

View profile

Author(s)

Yao Lu

Affiliation: Affiliation

State Key Laboratory of ASIC and System, Fudan University

View profile

Jide Zhang

Affiliation: Affiliation

State Key Laboratory of ASIC and System, Fudan University

View profile

Su Zheng

Affiliation: Affiliation

State Key Laboratory of ASIC and System, Fudan University

View profile

Zhen Li

Affiliation: Affiliation

Beijing Institute of Technology

View profile

Lingli Wang

Affiliation: Affiliation

State Key Laboratory of ASIC and System, Fudan University

View profile

Abstract

In this paper, two approximate 3×3 multipliers are proposed and the synthesis results of ASAP-7nm process library justify that they can reduce the area by 31.38\\% and 36.17%, and the power consumption by 36.73% and 35.66% compared with the exact multiplier, respectively. They can be aggregated with a 2×2 multiplier to produce an 8×8 multiplier with low error-rate based on the distribution of DNN weights. We propose a hardware-driven software co-optimization method to improve the DNN accuracy by retraining. Based on the proposed two approximate 3-bit multipliers, three approximate 8-bit multipliers with low error-rate are designed for DNNs. Compared with the exact 8-bit unsigned multiplier, our best design has no DNN accuracy loss for LeNet on MNIST without the retraining. On CIFAR10, the best design has a 0.88% DNN accuracy loss on LeNet, which can be reduced to -0.2% after retraining. For larger DNNs, the DNN accuracy of the best design can achieve significant advantage over other approximate multipliers on CIFAR10.

Slides

Low Error-Rate Approximate Multiplier Design for DNNs with Hardware-Driven Co-Optimization (application/pdf)

Download