Skip to main content
Video s3
    Details
    Author(s)
    Display Name
    Hsinyu Tsai
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Pritish Narayanan
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Shubham Jain
    Affiliation
    Affiliation
    IBM Corporation
    Display Name
    Stefano Ambrogio
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Kohji Hosokawa
    Affiliation
    Affiliation
    IBM Tokyo Research Laboratory
    Display Name
    Masatoshi Ishii
    Affiliation
    Affiliation
    IBM Research - Tokyo
    Display Name
    Charles Mackin
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Ching-Tzu Chen
    Affiliation
    Affiliation
    IBM Thomas J. Watson Research Center
    Display Name
    Atsuya Okazaki
    Affiliation
    Affiliation
    IBM Research - Tokyo
    Display Name
    Akiyo Nomura
    Affiliation
    Affiliation
    IBM Research - Tokyo
    Display Name
    Irem Boybat
    Affiliation
    Affiliation
    IBM Research - Zurich
    Affiliation
    Affiliation
    IBM Thomas J. Watson Research Center
    Display Name
    Martin M. Frank
    Affiliation
    Affiliation
    IBM Thomas J. Watson Research Center
    Display Name
    Takeo Yasuda
    Affiliation
    Affiliation
    IBM Research - Tokyo
    Display Name
    Alexander Friz
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Yasuteru Kohda
    Affiliation
    Affiliation
    IBM Research - Tokyo
    Display Name
    An Chen
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Andrea Fasoli
    Affiliation
    Affiliation
    IBM Research - Almaden
    Display Name
    Malte Rasch
    Affiliation
    Affiliation
    IBM Research
    Affiliation
    Affiliation
    IBM Research Europe
    Display Name
    Jose Luquin
    Affiliation
    Affiliation
    IBM Research
    Affiliation
    Affiliation
    Delft University of Technology and Scientific Director of QuTech
    Display Name
    Geoffrey Burr
    Affiliation
    Affiliation
    IBM Research - Almaden
    Abstract

    We describe a highly heterogeneous and programmable accelerator architecture that combines analog NVM memory-array “Tiles” for weight-stationary, energy-efficient MAC operations, together with heterogeneous special-function Compute-Cores for auxiliary digital computation. Massively parallel vectors of neuron-activation data are exchanged over short distances using a dense and efficient circuit-switched 2D mesh, enabling a wide range of DNN workloads, including CNNs, LSTMs, and Transformers. We also describe a 14-nm inference chip consisting of multiple 512×512 arrays of Phase Change Memory (PCM) devices which implements multiple DNN benchmarks using such a circuit-switched 2D mesh.