Skip to main content
Video s3
    Details
    Presenter(s)
    Chih-Hsuan Lin Headshot
    Display Name
    Chih-Hsuan Lin
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Country
    Country
    Taiwan
    Author(s)
    Display Name
    Yung-Han Ho
    Affiliation
    Affiliation
    NYCU
    Display Name
    Chih-Hsuan Lin
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Display Name
    Peng-Yu Chen
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Display Name
    Mu-Jung Chen
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Display Name
    Chih-Peng Chang
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Display Name
    Wen-Hsiao Peng
    Affiliation
    Affiliation
    National Chiao Tung University
    Display Name
    Hsueh-Ming Hang
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Abstract

    This paper proposes a learning-based video compression framework that applies a conditional flow-based model for inter-frame coding and takes YUV 4:2:0 as the input format. Most learning-based video compression models use predictive coding and directly encode the residual signal, which is considered a sub-optimal solution. In addition, those models usually only operate on RGB, which is also regarded as an inefficient format. Furthermore, they require multiple models to fit on different bit rates. To solve these issues, we introduce a conditional flow-based video compression framework to improve the coding efficiency. To adapt to YUV 4:2:0 format, we incorporate lossless space-to-depth and depth-to-space transformation in our design. Lastly, we apply rate-adaption net on both I-frame and P-frame coder to achieve variable-rate coding and can further be extended to rate control applications. Our experimental results show comparable or better performance against x265 for UVG and MCL-JCV common test datasets in terms of PSNR-YUV.

    Slides
    • Learned Video Compression for YUV 4:2:0 Content Using Flow-Based Conditional Inter-Frame Coding (application/pdf)