A high-performance and low-power 32-bit multiply-accumulate unit (MAC) is described in this paper. The fast mixed-length encoding scheme used in the MAC leverages the advantage of a 16-bit encoding scheme without addi...
详细信息
A high-performance and low-power 32-bit multiply-accumulate unit (MAC) is described in this paper. The fast mixed-length encoding scheme used in the MAC leverages the advantage of a 16-bit encoding scheme without adding extra delay to the faster four-stage Wallace tree of a 12-bit encoding scheme. With this new encoding scheme, one-cycle throughput for 16-bit x 16-bit and 32-bit x 16-bit MAC instructions was achieved at very high frequencies. To handle media streams more efficiently, the single-instruction-multiple-data (SIMD) and the multiply-with-implicit-accumulate (MIA) features were added. A mixture of static CMOS logic and complementary pass-gate logic (CPL) was used to achieve the high-speed and low-power goals. Several power-saving techniques were also implemented in this MAC.
暂无评论