The paper introduces an innovative time-dependent finite impulse response (FIR) filter designed for utilization in time-mode signalprocessing systems like mobile phones and digital transmitters-receivers. The filter&...
详细信息
A promising nanotechnology, quantum dot cellular automata (QCA) has the potential to replace the standard CMOS era owing to its high speed and low power consumption. A crucial component of virtual systems is an counte...
详细信息
Flow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (t) of latent variables, which ...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Flow-based generative super-resolution (SR) models learn to produce a diverse set of feasible SR solutions, called the SR space. Diversity of SR solutions increases with the temperature (t) of latent variables, which introduces random variations of texture among sample solutions, resulting in visual artifacts and low fidelity. In this paper, we present a simple but effective image ensembling/fusion approach to obtain a single SR image eliminating random artifacts and improving fidelity without significantly compromising perceptual quality. We achieve this by benefiting from a diverse set of feasible photorealistic solutions in the SR space spanned by flow models. We propose different image ensembling and fusion strategies which offer multiple paths to move sample solutions in the SR space to more desired destinations in the perception-distortion plane in a controllable manner depending on the fidelity vs. perceptual quality requirements of the task at hand. Experimental results demonstrate that our image ensembling/fusion strategy achieves more promising perception-distortion trade-off compared to sample SR images produced by flow models and adversarially trained models in terms of both quantitative metrics and visual quality.
A convenient and effective mobile tool for enabling the painless removal of background from photographs is the image Background Remover Android Application. The essential job of background removal in image analysis an...
详细信息
Limited by the sensor resolution of plenoptic cameras, it is challenging to obtain spatial and angular high resolution light field (LF). In this paper, we propose a method for spatial and angular super-resolution (SR)...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Limited by the sensor resolution of plenoptic cameras, it is challenging to obtain spatial and angular high resolution light field (LF). In this paper, we propose a method for spatial and angular super-resolution (SR) simultaneously while preserving parallax structure, which generates a high resolution (HR) dense LF from a low resolution (LR) sparse one with detailed textures. The proposed framework consists of four parts, including a depth estimation module to get scene geometry, an image warping process to generate intermediate warped views, an epipolar plane image (EPI) generation module to recover structural features, and a spatial-angular SR module to fully explore the informative features in 4D LF. Finally, the high quality HR scenes can be effectively reconstructed. Experimental results over several LF datasets demonstrate that our approach not only outperforms the existing spatial and angular SR methods separately, but also achieves better quantitative and qualitative results than the one-stage and two-stage spatial-angular approaches.
The panoramic view cameras offer more broad perspectives and continuous information for person re-identification (ReID). However, the panoramic-view videos suffer from objects distortion and bring more occlusion due t...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
The panoramic view cameras offer more broad perspectives and continuous information for person re-identification (ReID). However, the panoramic-view videos suffer from objects distortion and bring more occlusion due to the fixed or moving capture points. This paper proposes a novel Bayesian Transformer Network (BTN) to adaptively capture the occlusion clues as Bayesian prior to guide the discriminative pedestrian-related feature extraction in the high-occlusion scenes. The Bayesian prior is built via a pre-trained CNN, which could recognize different occluded scenarios based on the severeness of noisy backgrounds. Moreover, to fully explore the occlusion prior, we propose to embed the semantic labels into a well-designed transformer network. By fostering the collaborative occlusion clues between the person and background, our method could achieve outstanding performance on both public benchmarks and panoramic view videos, which verifies the advantages of our BTN framework over existing methods.
The joint design of mismatched filter and waveform is an important technique in suppressing sidelobes of phase coded waveform. Conventionally, an implied constraint that the mainlobe response is of the same width as a...
详细信息
Cardiac magnetic resonance (CMR) imaging is frequently recommended for patients at intermediate risk of cardiovascular disease to triage them for medication or invasive aggressive treatment. Mitral annulus (MA) motion...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Cardiac magnetic resonance (CMR) imaging is frequently recommended for patients at intermediate risk of cardiovascular disease to triage them for medication or invasive aggressive treatment. Mitral annulus (MA) motion and velocities represent the cardiac contraction and relaxation, and hold potential to improve the detection of subtle cardiac dysfunction. However, conventional interpretation of CMR images requires expert manipulation and is often operator-dependent. In this paper, we propose an end-to-end MA Point Tracking Network (MANet) to automatically detect and track MA motion during cardiac cycle. The MANet model consists of MA point detection module and motion tracking module. In MA point detection, we design the convolutional-based feature extraction and elastic regression to detect MA points frame by frame of each CMR video. Then, in MA tracking, we adopt the Deep SORT model to capture spatio-temporal continuity between frames and fine-tune the coordinate position of MA points. 171 CMR videos with 4275 frames are used in comparison experiments, and the results demonstrate that our MANet model achieves promising performance in reference to clinical ground truth (r=0.71, P<0.001). This work provides an important preamble for cardiac motion tracking and cardiac function evaluation.
The global prevalence of visual impairment and blindness due to diabetic retinopathy can be significantly reduced through improved diabetes management. Diabetic retinopathy (DR) screening is crucial as it can prevent ...
详细信息
This work significantly advances optimized Quadrature Mirror Filter (QMF) bank design methodologies for promising signalprocessing and communication systems applications. An innovative approach to design a two-channe...
详细信息
暂无评论