The ability to synthesize convincing human speech has become easier due to the availability of speech generation tools. This necessitates the development of forensics methods that can authenticate and attribute speech...
详细信息
Purpose: The advancement of high-content optical microscopy has enabled the acquisition of very large three-dimensional (3D) image datasets. The analysis of these image volumes requires more computational resources th...
详细信息
Person re-identification (re-ID) has wide applications in surveillance and security. It is also challenging due to viewpoint, occlusion and illumination variations across different cameras. One solution to unsupervise...
Person re-identification (re-ID) has wide applications in surveillance and security. It is also challenging due to viewpoint, occlusion and illumination variations across different cameras. One solution to unsupervised person re-ID problems is synthetic data augmentation. Generative neural networks have been used to translate images from the source domain into the target domain. In this paper, we introduce a new virtual-human image dataset that can be used as the source domain for person re-ID. This new dataset has images labeled by person identity, background, viewpoint and illumination intensity. We also explore GAN-based and Diffusion-based generative methods for unpaired image-to-image translation and provide qualitative and quantitative evaluation for the synthetic results.
Unsupervised person re-identification (re-ID) aims to learn identity information from a source domain (e.g. one surveillance system) and apply it to a target domain (e.g. a different surveillance system). This is chal...
Unsupervised person re-identification (re-ID) aims to learn identity information from a source domain (e.g. one surveillance system) and apply it to a target domain (e.g. a different surveillance system). This is challenging due to occlusion, viewpoint, and illumination variations between the different domains (i.e. systems). In this paper, we propose a neural network architecture, known as Synthetic Model Bank (SMB), to address illumination variation in unsupervised person re-ID. The basic idea of SMB is to use synthetic data for training different re-ID models for different illumination conditions. From our experiments, the proposed SMB outperforms other synthetic augmentation methods on several re-ID benchmarks.
Unmanned Aerial Vehicles (UAVs) have the ability to acquire high resolution RGB images from plant fields. A field used for plant research is usually divided into smaller groups of plants known as "plots" to ...
Unmanned Aerial Vehicles (UAVs) have the ability to acquire high resolution RGB images from plant fields. A field used for plant research is usually divided into smaller groups of plants known as "plots" to evaluate varieties or management practices. In this paper, we propose an optimization-based, rotation-adaptive approach for extracting plots in a UAV RGB orthomosaic image. From the experiments, our proposed method is robust against range/plot rotation and achieves higher segmentation accuracy compared with existing plot extraction approaches.
The sorghum panicle is an important trait related to grain yield and plant development. Detecting and counting sorghum panicles can provide significant information for plant phenotyping. Current deep-learning-based ob...
The sorghum panicle is an important trait related to grain yield and plant development. Detecting and counting sorghum panicles can provide significant information for plant phenotyping. Current deep-learning-based object detection methods for panicles require a large amount of training data. The data labeling is time-consuming and not feasible for real application. In this paper, we present an approach to reduce the amount of training data for sorghum panicle detection via semi-supervised learning. Results show we can achieve similar performance as supervised methods for sorghum panicle detection by only using 10% of original training data.
As voice synthesis systems and deep learning tools continue to improve, so does the possibility that synthesized speech can be used for nefarious purposes. Methods that determine if audio signals contain synthesized o...
As voice synthesis systems and deep learning tools continue to improve, so does the possibility that synthesized speech can be used for nefarious purposes. Methods that determine if audio signals contain synthesized or authentic speech are needed. In this paper, we investigate three transformers to detect synthesized speech: Compact Convolutional Transformer (CCT), Patchout faSt Spectrogram Transformer (PaSST), and Self-Supervised Audio Spectrogram Transformer (SSAST). We show that each transformer independently detects synthesized speech well. Then, we propose an ensemble of transformers that can provide even better performance. Finally, we explore how much of an audio signal is needed for high synthesized speech detection. Evaluated on the ASVspoof2019 dataset, we demonstrate that our transformer ensemble detects synthesized speech from shorter segments of audio signals, even on a highly imbalanced dataset.
The diagnosis of faults in grid-connected photovoltaic (GCPV) systems is a challenging task due to their complex nature and the high similarity between faults. To address this issue, we propose a wrapper approach call...
详细信息
Electric vehicles(EVs)are widely deployed throughout the world,and photovoltaic(PV)charging stations have emerged for satisfying the charging demands of EV *** paper proposes a multi-objective optimal operation method...
详细信息
Electric vehicles(EVs)are widely deployed throughout the world,and photovoltaic(PV)charging stations have emerged for satisfying the charging demands of EV *** paper proposes a multi-objective optimal operation method for the centralized battery swap charging system(CBSCS),in order to enhance the economic efficiency while reducing its adverse effects on power *** proposed method involves a multi-objective optimization scheduling model,which minimizes the total operation cost and smoothes load fluctuations,***,we modify a recently proposed multi-objective optimization algorithm of non-sorting genetic algorithm III(NSGA-III)for solving this scheduling ***,simulation studies verify the effectiveness of the proposed multi-objective operation method.
We introduce Probabilistic Coordinate Fields (PCFs), a novel geometric-invariant coordinate representation for image correspondence problems. In contrast to standard Cartesian coordinates, PCFs encode coordinates in c...
详细信息
暂无评论