A Multilayer Perceptron Training Accelerator using Systolic Array

As the volume and type of network traffic increase, there is a growing demand for machine learning to detect attacks. There is a use to train a model in a short time. For example, a network intrusion detection (NID) system needs to learn new attack patterns that constantly occur in a short period. We propose a training accelerator as a systolic array on a Xilinx U50 Alveo FPGA card to solve this problem. We found that the accuracy is almost the same as conventional training even when the forward and backward paths are run simultaneously by delaying the weight update. Compared to the Intel Core i9 CPU and NVIDIA RTX 3090 GPU, it was three times faster than the CPU and 2.5 times faster than the GPU. The processing speed per power consumption was 11.5 times better than the CPU and 21.4 times better than the GPU. From these results, we can conclude that implementing a training accelerator on FPGAs as a systolic array can achieve high speed and high energy efficiency.

Slides

A Multilayer Perceptron Training Accelerator using Systolic Array (application/pdf)

Download