Low-light image enhancement is crucial for human vision and computer vision task, attracting significant attention. However, most current enhancement methods are supervised and lack the ability to adjust based on ligh...
详细信息
Noisy color image and guided near-infrared (NIR) image can be jointly employed to eliminate noise and enhance details. Existing methods mostly rely on explicit designed filters and hand-crafted objective function opti...
详细信息
ISBN:
(纸本)9781479981311
Noisy color image and guided near-infrared (NIR) image can be jointly employed to eliminate noise and enhance details. Existing methods mostly rely on explicit designed filters and hand-crafted objective function optimization. These methods usually introduce erroneous structures from guidance signal. Besides, they are time-consuming and not suitable for real time applications. In this paper, we come up with a learning based method. The noisy color image and NIR image are fused, then fed into a fully convolutional neural network. The network learns a directly map from degraded image to restored sharp image. Our architecture can effectively eliminate image noise and transfer detail structure from guided image. Our trained network accepts any resolution of input image and runs in constant time. We evaluate the presented approach on both synthetic and real images. Results show that our approach outperforms the state-of-art methods.
image denoising is one key concept in image restoration and it is widely used in various imageprocessing applications. There are many traditional methods for image denoising existing, all these methods are based on f...
image denoising is one key concept in image restoration and it is widely used in various imageprocessing applications. There are many traditional methods for image denoising existing, all these methods are based on filtering in spatial domain and frequency domain. This work focuses on removing of impulse noise, which converts the pixel values to zero or maximum. Proposed method comprises the convolutional neural network (CNN)to remove the impulse noise. The Residual CNN(RCNN) is used in the proposed method. The structure of the network consists of three stages that is convolutional layers followed by the residual block and finally convolutional layers. The skip connections in RCNN reduce the gradient vanish problem in traditional CNN based denoising methods. The proposed network trained by using dataset of 12 images. The stochastic gradient descent momentum (SGDM) optimizer used to optimize the weights. The RCNN trained using SGDM optimizer takes less time for convergence into minimum. The proposed network is tested with various testing images. The proposed RCNN based image denoising gives better results than the traditional median filter-based image denoising with respect to PSNR and SSIM.
In recent years, deep-learning based aesthetics assessment methods have shown promising results. However, existing methods can only achieve limited success because 1) most of the methods take one fixed-size patch as t...
详细信息
In recent years, deep-learning based aesthetics assessment methods have shown promising results. However, existing methods can only achieve limited success because 1) most of the methods take one fixed-size patch as the training example, which loses the fine grained details and the holistic layout information, and 2) most of the methods ignore ordinal issues in image aesthetic assessment, ie. image scored 5.3 is more likely to be in the high quality class than image scored 4.5. To address these challenges, we presents a novel convolutional networks with two branches to encode global and local features. The first branch not only captures the spatial layout information but also feedbacks the top-down neural attention. The second branch selects the important attended region to extract the fine details features. A sobel-based attention layer is integrated with the second branch to enhance fine details encoding. Regarding the second problem, we combine the strength of classification approach and regression approach by a multi-task learning framework. Extensive experiments on challenging Aesthetic and Visual Analysis (AVA) dataset and *** dataset indicate the effectiveness of the proposed method.
Speech synthesis methods can create realistic-sounding speech, which may be used for fraud, spoofing, and mis-information campaigns. Forensic methods that detect synthesized speech are important for protection against...
详细信息
Speech synthesis methods can create realistic-sounding speech, which may be used for fraud, spoofing, and mis-information campaigns. Forensic methods that detect synthesized speech are important for protection against such attacks. Forensic attribution methods provide even more information about the nature of synthesized speech signals because they identify the specific speech synthesis method (i.e., speech synthesizer) used to create a speech signal. Due to the increasing number of realistic-sounding speech synthesizers, we propose a speech attribution method that generalizes to new synthesizers not seen during training. To do so, we investigate speech synthesizer attribution in both a closed set scenario and an open set scenario. In other words, we consider some speech synthesizers to be "known" synthesizers (i.e., part of the closed set) and others to be "unknown" synthesizers (i.e., part of the open set). We represent speech signals as spectrograms and train our proposed method, known as compact attribution transformer (CAT), on the closed set for multi-class classification. Then, we extend our analysis to the open set to attribute synthesized speech signals to both known and unknown synthesizers. We utilize a t-distributed stochastic neighbor embedding (tSNE) on the latent space of the trained CAT to differentiate between each unknown synthesizer. Additionally, we explore poly-1 loss formulations to improve attribution results. Our proposed approach successfully attributes synthesized speech signals to their respective speech synthesizers in both closed and open set scenarios.
The reflection problem often occurs when imaging through a semitransparent material such as glass. It degrades the image quality and affects the subsequent analyses on the image. Traditional single-image based reflect...
详细信息
The reflection problem often occurs when imaging through a semitransparent material such as glass. It degrades the image quality and affects the subsequent analyses on the image. Traditional single-image based reflection removal methods assume the reflection is blurry. Deep neural networks (DNNs) are, then, used to identify the blurry reflection and remove it. However, it is often that the blurry reflection still contains strong edges. They will be treated as the background and kept in the image. In this letter, we propose a novel two-stage DNN based reflection removal algorithm. In the first stage, we include a new feature reduction term in the loss function when training the network. Due to its strong reflection suppression ability, the reflection components in the image can he more effectively suppressed. However, it will also attenuate the gradient values of the background image. For recovering the background, in the second stage, we first estimate a reflection gradient confidence map based on the initial estimation result and use it to identify the strong background gradients. Then, we use a generative adversarial network to reconstruct the background image from its gradients. Experimental results show that the proposed two-stage approach can give a superior performance compared with the state-of-the-art DNN based methods.
Recently, the advancements of Internet-of-Things (IoT) have expanded its application in underwater environment which leads to the development of a new field of Internet of Underwater Things (IoUT). It offers a broader...
详细信息
Radar systems emit a time-varying signal and measure the response of a radar-reflecting surface. In the case of narrowband, monostatic radar signal domain, all spatial information is projected into a radar cross-secti...
详细信息
Radar systems emit a time-varying signal and measure the response of a radar-reflecting surface. In the case of narrowband, monostatic radar signal domain, all spatial information is projected into a radar cross-section (RCS) scalar. The authors address the challenging problem of determining shape class using monostatic RCS estimates collected as a time series from a rotating object tumbling with unknown motion parameters under detectability limitations and signal noise. Previous shape classification methods have relied on image-like synthetic aperture radar or multistatic (multiview) radar configurations with known geometry. Convolutional neural networks (CNNs) have revolutionised learning tasks in the computer vision domain by leveraging images and video rich with high-resolution two-dimensional (2D) or 3D spatial information. They show that a feed-forward CNN can be trained to successfully classify object shape using only noisy monostatic RCS signals with unknown motion. They construct datasets containing over 100,000 simulated RCS signals belonging to different shape classes. They introduce deep neural network architectures that produce 2% classification error on testing data. They also introduce a refinement network that transforms simulated signals to appear more realistic and improve training utility. The results are a pioneering step toward the recognition of more complex targets using narrowband, monostatic radar.
The convolutional neural network (CNN) and other neural networks (NNs) provide promising tools for robotized characterization of tumor cells. However, the tumor growth areas in ultrasound images are normally obscure, ...
详细信息
The convolutional neural network (CNN) and other neural networks (NNs) provide promising tools for robotized characterization of tumor cells. However, the tumor growth areas in ultrasound images are normally obscure, with uncertain edges. It is not acceptable to prepare ultrasound images straightforwardly with the CNN. To solve the problem, this paper puts forward a faster region-convolutional neural network (R-CNN) to identify tumor cells with the aid of auto encoders Taking two fully-connected layers with dropout and ReLU enactments as the base, the proposed faster R-CNN adopts 3D convolutional and max pooling layers, enabling the user to extract features from potential tumor growth areas. In addition, the thin and deep layers of the network were connected to facilitate the identification of blurry or small tumor growth areas. Experimental results show that the proposed faster R-CNN with auto encoders outperformed traditional data mining and artificial intelligence (AI) methods in prediction accuracy of tumor cells.
Accurate analysis of meibomian gland morphology based on meibography images is of great importance for the diagnosis of dry eye disease. However, it is still a difficult task due to the time-consuming and variability ...
详细信息
暂无评论