Skip to main content
Video s3
    Details
    Presenter(s)
    Mareeta Mathai Headshot
    Display Name
    Mareeta Mathai
    Affiliation
    Affiliation
    Santa Clara University
    Country
    Author(s)
    Display Name
    Mareeta Mathai
    Affiliation
    Affiliation
    Santa Clara University
    Display Name
    Ying Liu
    Affiliation
    Affiliation
    Santa Clara University
    Display Name
    Nam Ling
    Affiliation
    Affiliation
    Santa Clara University
    Abstract

    Future frame prediction is a challenging task in the deep learning field due to its inherent uncertainty and complex spatiotemporal dynamics. Significant improvement in prediction accuracy has been achieved by state-of-the-art methods at the expense of complex, computationally intensive deep neural networks, which makes it difficult to deploy in mobile devices. Hence, we propose a lightweight model using 3D separable convolutions, which can predict future video frames with reasonable accuracy-complexity tradeoffs. Experimental studies on three popular video prediction datasets demonstrate that compared to state-of-the-art methods, our proposed method significantly reduces the model size and computational complexity with tolerable accuracy loss.

    Slides
    • A Lightweight Model with Separable CNN and LSTM for Video Prediction (application/pdf)