Skip to main content
Video s3
    Details
    Presenter(s)
    Ling Zhang Headshot
    Display Name
    Ling Zhang
    Affiliation
    Affiliation
    ShanghaiTech University
    Country
    Author(s)
    Display Name
    Ling Zhang
    Affiliation
    Affiliation
    ShanghaiTech University
    Display Name
    Wei Zhou
    Affiliation
    Affiliation
    ShanghaiTech University
    Display Name
    Xiangyu Zhang
    Affiliation
    Affiliation
    ShanghaiTech University
    Display Name
    Xin Lou
    Affiliation
    Affiliation
    ShanghaiTech University
    Abstract

    In the existing near-/in-sensor computing architectures for vision tasks, the affect of the image signal processing (ISP) pipeline, which is of great importance to the final vision performance, is always ignored. In this work, we propose a synthesized RAW image-based end-to-end computer vision paradigm, taking the affect of ISP pipeline into account. Experimental results show that by training/tuning the CNN models using synthesized RAW images, it is possible to design an end-to-end (from RAW image to vision task) vision system that directly consumes RAW image data from the sensor with negligible vision performance degradation. By skipping the ISP pipeline, an image sensor can be directly integrated with the back-end vision processor without a complex image processor in the middle, making near-/in-sensor computing a practical approach.

    Slides
    • An End-to-End Computer Vision System Architecture (application/pdf)