Video Not Available
Details
Abstract
This paper proposes an error-surface-based FME algorithm and the corresponding hardware implementation. This method requires no iteration and interpolation, thus reducing the area and power consumption and increasing the throughput of the hardware. The experimental results show that the corresponding BDBR loss is only 0.47% compared to VTM 16.0 in LD-P configuration. The hardware can support 13 different sizes of CU varying from 128×128 to 8×8. The measured throughput can reach 4K@30fps at 400MHz, with a gate count of 192k and power consumption of 12.64 mW.