In order to address the issues of real-time performance and the low dependency between feature channels in fabric defect detection networks, this paper proposes the ESE_YOLOv5 network based on YOLOv5. Firstly, to addr...
In order to address the issues of real-time performance and the low dependency between feature channels in fabric defect detection networks, this paper proposes the ESE_YOLOv5 network based on YOLOv5. Firstly, to address the relative redundancy of the neck detection network feature channels, a relatively lightweight and efficient convolution module is adopted to ensure accuracy while reducing computation and parameter volume. Furthermore, the Efficient Squeeze-Excitation (ESE) module is introduced into the backbone to optimize the dependency of feature channels, which enhances the model's feature extraction capacity and improves detection accuracy. Experimental results show that compared to YOLOv5, the proposed ESE_YOLOv5 model reduces computation and parameter volume while improving accuracy, meeting the needs of fabric defect detection for recognizing fabric defects that have similar characteristics to the background while maintaining real-time performance.
Generalized eigenvalue problem (GEP) plays a significant role in signal processing and machine learning. This paper proposes a consensus-based distributed algorithm for GEP in multi-agent systems, where data samples a...
详细信息
Electroencephalography (EEG)-based brain-computer interfaces (BCIs) enable neural interaction by decoding brain activity for external communication. Motor imagery (MI) decoding has received significant attention due t...
详细信息
When encountering the distribution shift between the source(training) and target(test) domains, domain adaptation attempts to adjust the classifiers to be capable of dealing with different domains. Previous domain ada...
详细信息
When encountering the distribution shift between the source(training) and target(test) domains, domain adaptation attempts to adjust the classifiers to be capable of dealing with different domains. Previous domain adaptation research has achieved a lot of success both in theory and practice under the assumption that all the examples in the source domain are welllabeled and of high quality. However, the methods consistently lose robustness in noisy settings where data from the source domain have corrupted labels or features which is common in reality. Therefore, robust domain adaptation has been introduced to deal with such problems. In this paper, we attempt to solve two interrelated problems with robust domain adaptation:distribution shift across domains and sample noises of the source domain. To disentangle these challenges, an optimal transport approach with low-rank constraints is applied to guide the domain adaptation model training process to avoid noisy information influence. For the domain shift problem, the optimal transport mechanism can learn the joint data representations between the source and target domains using a measurement of discrepancy and preserve the discriminative information. The rank constraint on the transport matrix can help recover the corrupted subspace structures and eliminate the noise to some extent when dealing with corrupted source data. The solution to this relaxed and regularized optimal transport framework is a convex optimization problem that can be solved using the Augmented Lagrange Multiplier method, whose convergence can be mathematically proved. The effectiveness of the proposed method is evaluated through extensive experiments on both synthetic and real-world datasets.
A non-invasive brain-computer interface (BCI) enables direct interaction between the user and external devices, typically via electroencephalogram (EEG) signals. However, decoding EEG signals across different headsets...
详细信息
Recently, large-scale pre-trained language-image models like CLIP have shown extraordinary capabilities for understanding spatial contents, but naively transferring such models to video recognition still suffers from ...
详细信息
Consistency models have demonstrated powerful capability in efficient image generation and allowed synthesis within a few sampling steps, alleviating the high computational cost in diffusion models. However, the consi...
Temporal concept shift (TCS) is an unavoidable problem in physiological signal-based emotion recognition tasks, i.e., the data distribution of physiological signals is constantly changing over time, which gradually de...
Temporal concept shift (TCS) is an unavoidable problem in physiological signal-based emotion recognition tasks, i.e., the data distribution of physiological signals is constantly changing over time, which gradually degrades the model accuracy. To this end, we propose a method based on a combination of domain adaptation and incremental learning to reduce the impact of temporal concept drift. In this paper, domain adaptation is used to reduce the distribution differences and incremental learning is used to prevent the learned knowledge from being forgotten. Finally, we validate the effectiveness of our approach on two real datasets.
We present 3D Cinemagraphy, a new technique that mar-ries 2D image animation with 3D photography. Given a single still image as input, our goal is to generate a video that contains both visual content animation and ca...
We present 3D Cinemagraphy, a new technique that mar-ries 2D image animation with 3D photography. Given a single still image as input, our goal is to generate a video that contains both visual content animation and camera motion. We empirically find that naively combining existing 2D image animation and 3D photography methods leads to obvious artifacts or inconsistent animation. Our key insight is that representing and animating the scene in 3D space offers a natural solution to this task. To this end, we first convert the input image into feature-based layered depth images using predicted depth values, followed by unprojecting them to a feature point cloud. To animate the scene, we perform motion estimation and lift the 2D motion into the 3D scene flow. Finally, to resolve the problem of hole emer-gence as points move forward, we propose to bidirectionally displace the point cloud as per the scene flow and synthe-size novel views by separately projecting them into target image planes and blending the results. Extensive experiments demonstrate the effectiveness of our method. A user study is also conducted to validate the compelling rendering results of our method.
Machine learning has achieved great success in electroencephalogram (EEG) based brain-computer interfaces (BCIs). Most existing BCI studies focused on improving the decoding accuracy, with only a few considering the a...
详细信息
暂无评论