Hand-held ultrasound devices have been widely used in the field of healthcare and power-efficient, real-time imaging is essential. This work presents the world's first ultrasound imaging processor supporting advan...
详细信息
Hand-held ultrasound devices have been widely used in the field of healthcare and power-efficient, real-time imaging is essential. This work presents the world's first ultrasound imaging processor supporting advanced modes, including vector flow imaging and elastography imaging. Plane-wave beamforming is utilized to ensure that the pulse repetition frequency (PRF) is sufficiently high for the advanced mode. The storage size and power consumption are minimized through algorithm-architecture co-optimization. The proposed plane-wave beamforming reduces the storage size of the required delay values by 43.7%. By exchanging the processing order, the storage size is reduced by 78.1% for elastography imaging. Parallel beamforming and interleaved firing are employed to achieve real-time imaging for all the supported modes. Fabricated in 40-nm CMOS technology, the proposed processor integrates 4.7M logic gates in core area of 3.24mm(2). This work achieves a 20.3x higher beamforming rate with 5.3-to-29.1x lower power consumption than the state-of-the-art design. It also has 60% lower hardware complexity (in terms of gate count), in addition to the capability for supporting the advanced mode.
High-mobility communication technology enables important applications in the near future. In such scenarios, the wireless channel exhibits high Doppler spread, which makes orthogonal frequency division multiplexing (O...
详细信息
High-mobility communication technology enables important applications in the near future. In such scenarios, the wireless channel exhibits high Doppler spread, which makes orthogonal frequency division multiplexing (OFDM) adopted in current wireless system suffer from severe inter-carrier interference (ICI). The orthogonal time frequency space (OTFS) technique is a promising modulation to address this issue. It demonstrates higher resilience to Doppler spread than OFDM in terms of bit error rate (BER) at the expense of higher detection complexity. This work presents the first high-throughput multi-user multi-input multi-output (MU-MIMO) detector for OTFS communication systems. A low-complexity message-passing (MP) detection algorithm is proposed to achieve 93% lower computational complexity by leveraging the structure of Gram matrix. A memory-efficient residual noise (RN) update scheme is devised to reduce the memory size for storing the partial interference by 94%. The proposed MP detector achieves a 60% reduction in latency by employing mean computation unit (MCU) and dual-mode multiplier. In addition, a 91% memory access reduction and an 89% memory size reduction in the channel memory bank are achieved, respectively, by leveraging layer ordering, partial Gram matrix saving, and block diagonal approximation. The chip supports up to 32 users, 256 receive antennas, and 256-QAM modulation. Fabricated in a 40-nm CMOS technology, the chip integrates 6.76 M gates in area of 6.47 mm(2) and it delivers a maximal throughput of 6.4 Gb/s. The power consumption is 131 mW at 200 MHz from a 0.9 V supply. In comparison to state-of-the-art MU-MIMO OFDM detectors, this work achieves 3.3-to-21.3x higher maximal throughput and 2.2-to-67.0x lower normalized energy, in addition to higher resilience to Doppler spread.
暂无评论