Skip to main content
Video s3
    Details
    Author(s)
    Display Name
    Aman Sinha
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Display Name
    Yuhao Fang
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Display Name
    Bo-Cheng Lai
    Affiliation
    Affiliation
    National Yang Ming Chiao Tung University
    Abstract

    Genomic big data analysis pipelines are bottlenecked with massive data movement between CPU and memory hierarchy. Recently developed Stacked Embedded DRAM (SEDRAM) with high density hybrid bonding offers not only high bandwidth concurrent data accesses, but also highly parallel distributed data processing on the logic layer. However, the complex logical flow and data dependencies pose challenges to existing Near-DRAM Processing (NDP) solutions for analysis such as string pattern matching using FM-Index. In this work, we propose REGAL, a highly scalable and re-programmable solution on SEDRAM memory system. REGAL architecture maximizes intra-query parallelism and enhances occupancy of all components. The efficient data layout and mapping minimizes round-trip communications between processing engines. The programmability of REGAL further enables effective prefetching to support variations in data characteristics and involved algorithms. REGAL demonstrates up-to 17.3x and 70.6x speedup and energy reductions compared to multithreaded CPU implementation for FM-Index queries, while achieving performance comparable to state-of-the-art fixed-function NDP implementation.