Skip to main content
Video s3
    Details
    Presenter(s)
    Takeshi Senoo Headshot
    Display Name
    Takeshi Senoo
    Affiliation
    Affiliation
    Tokyo Institute of Technology
    Country
    Country
    Japan
    Author(s)
    Display Name
    Takeshi Senoo
    Affiliation
    Affiliation
    Tokyo Institute of Technology
    Display Name
    Akira Jinguji
    Affiliation
    Affiliation
    Tokyo Institute of Technology
    Display Name
    Ryosuke Kuramochi
    Affiliation
    Affiliation
    Tokyo Institute of Technology
    Display Name
    Hiroki Nakahara
    Affiliation
    Affiliation
    Tokyo Institute of Technology
    Abstract

    As the volume and type of network traffic increase, there is a growing demand for machine learning to detect attacks. There is a use to train a model in a short time. For example, a network intrusion detection (NID) system needs to learn new attack patterns that constantly occur in a short period. We propose a training accelerator as a systolic array on a Xilinx U50 Alveo FPGA card to solve this problem. We found that the accuracy is almost the same as conventional training even when the forward and backward paths are run simultaneously by delaying the weight update. Compared to the Intel Core i9 CPU and NVIDIA RTX 3090 GPU, it was three times faster than the CPU and 2.5 times faster than the GPU. The processing speed per power consumption was 11.5 times better than the CPU and 21.4 times better than the GPU. From these results, we can conclude that implementing a training accelerator on FPGAs as a systolic array can achieve high speed and high energy efficiency.

    Slides
    • A Multilayer Perceptron Training Accelerator using Systolic Array (application/pdf)