Approximate computing has become a promising approach for error-tolerant applications, providing significant enhancements in area, power, and delay by easing stringent accuracy constraints. This paper presents three novel high-performance approximate multiplier (AM) architectures that employ optimized approximate 4–2 compressors and a pre-computation scheme for the least significant bits (LSBs) of the final product to minimize critical path delay. The proposed multipliers are developed for both signed and unsigned arithmetic and target applications where energy efficiency and speed are prioritized over exact precision.
The underlying approximate compressors, derived using by introducing controlled inaccuracies only in rare input conditions and Karnaugh map-based logic reduction, thereby achieving a favorable trade-off between hardware cost and error. These compressors, implemented using AOI, OAI, NAND, and NOR logic primitives, achieve a 55%–72% reduction in ADP, a 52%–67% reduction in PDP, and a 53%–80% reduction in PADP compared to exact counterparts. The proposed multipliers significantly reduce the ADP, PDP, and PADP compared to the state-of-the-art.
All proposed multiplier architectures are synthesized using the Cadence® Genus in the 90 nm CMOS library and evaluated using standard design metrics. Monte Carlo simulations confirm low error rates and high computational reliability. The unsigned multipliers are applied to image blending tasks, yielding favorable visual results. At the same time, the signed variants are employed in neural network applications, achieving inference accuracy of 96%–98% with enhanced speed and energy efficiency.
扫码关注我们
求助内容:
应助结果提醒方式:
