Skip to main content
Video s3
    Details
    Presenter(s)
    Hongrui Song Headshot
    Display Name
    Hongrui Song
    Affiliation
    Affiliation
    Nanjing University
    Country
    Author(s)
    Display Name
    Hongrui Song
    Affiliation
    Affiliation
    Nanjing University
    Display Name
    Ya Wang
    Affiliation
    Affiliation
    Nanjing University
    Display Name
    Meiqi Wang
    Affiliation
    Affiliation
    Nanjing University
    Display Name
    Zhongfeng Wang
    Affiliation
    Affiliation
    Nanjing University, China
    Abstract

    We develop a unified compressed version of Vision Transformer (UCViT), whose main focus is on compressing the computation-intensive ViT model by a unified compression method, which converts the dense matrix multiplication to hardware-friendly operations with low bit-width dominated by shift and addition calculations.Meanwhile, to compensate for the accuracy degradation, we additionally introduce a small matrix with relative high-precision associated with the mechanism of multi-head attention in Transformer-based model.

    Slides
    • UCViT: Hardware-Friendly Vision Transformer via Unified Compression (application/pdf)