The generative adversarial network(GAN)is first proposed in 2014,and this kind of network model is machine learning systems that can learn to measure a given distribution of data,one of the most important applications...
详细信息
The generative adversarial network(GAN)is first proposed in 2014,and this kind of network model is machine learning systems that can learn to measure a given distribution of data,one of the most important applications is style *** transfer is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output ***-GAN is a classic GAN model,which has a wide range of scenarios in style *** its unsupervised learning characteristics,the mapping is easy to be learned between an input image and an output ***,it is difficult for CYCLE-GAN to converge and generate high-quality *** order to solve this problem,spectral normalization is introduced into each convolutional kernel of the *** convolutional kernel reaches Lipschitz stability constraint with adding spectral normalization and the value of the convolutional kernel is limited to[0,1],which promotes the training process of the proposed ***,we use pretrained model(VGG16)to control the loss of image content in the position of l1 *** avoid overfitting,l1 regularization term and l2 regularization term are both used in the object loss *** terms of Frechet Inception Distance(FID)score evaluation,our proposed model achieves outstanding performance and preserves more discriminative *** results show that the proposed model converges faster and achieves better FID scores than the state of the art.
We propose an unsupervised person search method for video surveillance. This method considers both the spatial features of persons within each frame and the temporal relationship of the same person among different fra...
详细信息
Many vision-related processing tasks, including edge detection and image segmentation, can be performed more easily when all objects in the scene are in good focus. However, in practice, this may not be always feasibl...
详细信息
Knowing the locations of nodes in wireless sensor networks (WSN) is essential for many applications. Nodes in a WSN can have multiple capabilities and exploiting one or more of the capabilities can help to solve the l...
详细信息
Knowing the locations of nodes in wireless sensor networks (WSN) is essential for many applications. Nodes in a WSN can have multiple capabilities and exploiting one or more of the capabilities can help to solve the localization problem. In this paper, we assume that each node in a WSN has the capability of distance measurement and present a location computation technique called linear intersection for node localization. We also propose an applied localization model using linear intersection and do some concerned experiments to estimate the location computation algorithm.
Linear Discriminant Analysis (LDA) is one of the most used feature extraction techniques for face recognition. However, it often suffers from the small sample size problem with high dimension setting. Random Subspace ...
详细信息
Linear Discriminant Analysis (LDA) is one of the most used feature extraction techniques for face recognition. However, it often suffers from the small sample size problem with high dimension setting. Random Subspace Method (RSM) is a popular combining technique to improve weak classifier. Nevertheless, it remains a problem how to construct an optimal random subspace for discriminant analysis. In this paper, we propose an improved random sampling LDA for face recognition. Firstly, AdaBoost is adopted to select Gabor feature and remove redundant information. Secondly, in the selected Gabor feature space, we combine principal component analysis and RSM approaches to construct optimal random subspaces for LDA. After that, direct LDA (D-LDA) and R-LDA is applied in each subspace, respectively. Final results are obtained by combining all the LDA classifiers using a fusion rule. Experiments with both the ORL and FERET face databases demonstrate the effectiveness of our proposed method, and it shows promising results compared with previous approaches.
DSP/FPGA-based parallel architecture oriented to real-time Image processing applications is presented. The architecture is structured with high performance DSPs interconnected by FPGA. Within FPGA a FIFO interconnecti...
详细信息
Bone scintigraphy is widely used to diagnose bone diseases. Accurate hotspot segmentation is a critical task for tumor metastasis diagnosis. In this paper, we propose an interactive approach to detect and extract hots...
详细信息
ISBN:
(纸本)9781479999897
Bone scintigraphy is widely used to diagnose bone diseases. Accurate hotspot segmentation is a critical task for tumor metastasis diagnosis. In this paper, we propose an interactive approach to detect and extract hotspots in thoracic region based on a new multiple instance learning (MIL) method called EM-MILBoost. We convert the segmentation problem to a multiple instance learning task by constructing positive and negative bags according to the input bounding box. In order to be robust against noisy input, we train a region-level hotspot classifier with EM-MILBoost and develop several segmentation strategies based on it. The experimental results demonstrate that our method outperforms other methods and is robust against various noisy input.
Segmentation becomes a difficult task if the objects and background are not homogeneous and having overlapping characteristics. Cattle segmentation from its background is required in several typical applications, such...
详细信息
Segmentation becomes a difficult task if the objects and background are not homogeneous and having overlapping characteristics. Cattle segmentation from its background is required in several typical applications, such as: the automatic cattle race classification. The cattle's fur detection which is inspired from the human skin detection is investigated in this paper for cattle and background segmentation in automatic beef cattle race classification. The Gaussian mixture model that was used in skin detection has been adopted to model Bali cow and Hybrid Ongole cow in this beef cattle race classification. The RGB color space and two texture descriptors are used as the features set. The addition of texture descriptor has increased the performance of the fur detection and automatic race classification. The GMM performs well but the noise and the complexity of the background lead to misclassification.
A considerable amount of research work has been done for texture classification using local or global feature extraction methods. Inspired by Weber's Law, a simple and robust Weber Local Descriptor (WLD) is a rece...
详细信息
A considerable amount of research work has been done for texture classification using local or global feature extraction methods. Inspired by Weber's Law, a simple and robust Weber Local Descriptor (WLD) is a recently developed for local feature extraction. This WLD method did not consider the contrast information. In order to improve texture classification accuracy, we propose a hybrid approach that combines the WLD with contrast information in this paper. It utilizes the histogram of two complementary features WLD and the image variance calculated with the Probability Weighted Moments. Support vector machine is used for classification. The comparison of the proposed method with state of art methods like local binary pattern and WLD is experimental investigated on two publically available dataset, named as Brodatz and KTH-TIPS2-a. Results show that our proposed method outperforms over the state of art methods for texture classification.
Batik, as a cultural heritage from Indonesia, has a lot of motifs based on certain patterns. This paper discusses feature extraction methods for the recognition of batik motifs in digital images. In this study, the us...
详细信息
Batik, as a cultural heritage from Indonesia, has a lot of motifs based on certain patterns. This paper discusses feature extraction methods for the recognition of batik motifs in digital images. In this study, the use of several feature extraction methods have been compared in terms of their performance with several scenarios for testing level accuracy. The methods include Gray Level Co-occurrence Matrices (GLCM), Canny Edge Detection, and Gabor filters. The experimental results show that the use of GLCM features has performed the best with a classification accuracy reaching 80%.
暂无评论