Convolutional neural networks have been widely used for human pose estimation tasks, but with some issues. It is limited to local receptive fields and it is difficult to capture global information. To address this pro...
Convolutional neural networks have been widely used for human pose estimation tasks, but with some issues. It is limited to local receptive fields and it is difficult to capture global information. To address this problem, we propose the GM-HRNet network. The network aggregates the multi-stage feature information of HRNet, and makes full use of the criss-cross attention and channel attention to obtain the context information in the space and channel dimensions, and realizes the modeling of the global relationship, so as to effectively locate the keypoints of the human body. In this paper, HRNet is used as a benchmark to conduct experiments on MPII dataset. Experimental results show that the accuracy of GM-HRNet is 1.3 percent higher than that of HRNet under the same experimental conditions, which validates the effectiveness of the GM-HRNet model.
Explicit information of lesions can provide visual instructions for diabetic retinopathy (DR) grading on fundus images. However, pixel-level lesion annotations are extremely difficult and time-consuming to acquire. In...
详细信息
Time series anomaly detection often faces significant challenges due to the high dimensionality, interdependence, and label sparsity of the data. Traditional methods struggle to model the intricate relationships betwe...
详细信息
As metasurface technology has been developing rapidly over the past decades, multi-multiplexing and tunability are evolving into hot spots in its development. Here, a metasurface fit for vortex beam generation, beam f...
详细信息
As metasurface technology has been developing rapidly over the past decades, multi-multiplexing and tunability are evolving into hot spots in its development. Here, a metasurface fit for vortex beam generation, beam focusing, linear-to-circular polarization conversion, and absorption is proposed in this study, contingent upon the phase change properties of vanadium dioxide. When the temperature is 25°C, the vortex beam and focusing generated when the circularly polarized wave is incident on the metasurface, the calculated outcomes demonstrate that the cross-polarization reflective coefficient is close to the co-polarization coefficient at the frequency range of 0.5–0.9 THz and 1.2–1.5 THz, implementing the linear-to-circular polarization conversion. When heated up to 68°C, it functions as an absorber at 1.55 THz when the y-polarized wave is incident vertically on the metasurface. The designed metasurface is likely to be used in the fields of terahertz communication and imaging systems.
Automatic Piano Transcription is to transcribe raw audio files into annotated piano rolls. In recent studies, jointly estimating pitch, onset, offset, and velocity of each note is commonly used. The previous state-of-...
详细信息
There are some defects on the surface of steel that are difficult to detect, so we propose an improved algorithm based on YOLOv8 for detecting defects that are difficult to be detected. This improvement includes the i...
详细信息
This paper is concerned with a Nash equilibrium(NE)tracking issue in online games with bandit feedback,where cost functions vary with time and agents only have access to the values of these functions at two points dur...
详细信息
This paper is concerned with a Nash equilibrium(NE)tracking issue in online games with bandit feedback,where cost functions vary with time and agents only have access to the values of these functions at two points during each round.A partial-decision information setting is considered,in which agents have only access to the decisions of their *** primary objective of this paper is to develop a distributed online NE tracking algorithm that ensures sublinear growth of regret with respect to the total round T,under both the bandit feedback and partial-decision information *** utilizing a two-point estimator together with the leader-following consensus method,a new distributed online NE tracking algorithm is established with the estimated gradient and local estimated decisions based on the projection gradient-descent ***,sufficient conditions are derived to guarantee an improved upper bound of dynamic regret compared to existing bandit ***,a simulation example is presented to demonstrate the effectiveness of the proposed algorithm.
Multimodal recommender systems (MRSs) aim to integrate information from multiple modalities, for better capturing users' preferences. However, existing MRSs usually face the challenge of data sparsity, especially ...
详细信息
Getting dense, uniform, time-series point cloud data is critical for effective rendering. However, due to the limited computational power of edge devices, existing methods cannot achieve real-time results, which affec...
详细信息
Aiming at the problem of model misdetection and missed detection caused by the small defect and unclear features of industrial metal surfaces, this paper studies a large number of metal surface defects, and proposes a...
Aiming at the problem of model misdetection and missed detection caused by the small defect and unclear features of industrial metal surfaces, this paper studies a large number of metal surface defects, and proposes a metal surface micro defect detection algorithm based on improved YOLOv5. First, this paper uses the FenceMask data enhancement method for regularization to solve the problem of few sample images. Then an enhanced multi-scale feature fusion pyramid network DS-FPN is proposed by introducing depthwise separable convolution, dilated convolution and spatial attention mechanism, so that the model can improve the ability to extract keyinformation of images without adding additional parameters and calculations. An adaptive channel spatial attention mechanism SCBAM is also proposed, which adds non-local information to the original interaction with only local information, improving the feature expression ability of the model. Finally, through experimental verification, the detection accuracy of the model in this paper on the public data set GC10-DET has reached 74.9%, which is 4.7% higher than the original YOLOv5 model.
暂无评论