In this paper, the robust containment control problem of the leader-following multi-agent systems with input saturation and input additive disturbance is addressed, where the followers can be informed by multiple lead...
详细信息
In this paper, the robust containment control problem of the leader-following multi-agent systems with input saturation and input additive disturbance is addressed, where the followers can be informed by multiple leaders. With the help of the lowand-high gain feedback technique and the high-gain observer approach, a distributed control algorithm for each agent is firstly designed by using the observed output information, then sufficient conditions are provided to guarantee the semi-global robust containment of the system. Finally, some numerical simulations are given to verify the correctness of the theoretical results.
Modeling spatio-temporal sequences is an important topic yet challenging for existing neural networks. Most of the current spatio-temporal sequence prediction methods usually capture features separately in temporal an...
详细信息
Person re-identification has gradually become a hot research topic in many fields, such as security, criminal investigation and video analysis. In this paper, we propose a novel feature extraction framework for video-...
Person re-identification has gradually become a hot research topic in many fields, such as security, criminal investigation and video analysis. In this paper, we propose a novel feature extraction framework for video-based person re-identification, namely, the partial attribute-driven network (PADNet). The proposed method is based on the deep-learning architecture and incorporates the attribute and identity learning of the pedestrian. Existing attribute research always focuses on the feature representation at the global-level. Unlike them, first, the pedestrian is automatically partitioned to several body parts in our work. Then the pedestrian and his/her body parts are annotated by the global and partial attributes, respectively. Finally, we employ a four-branch multi-label network to explore the spatial-temporal cues of videos by utilizing these labeled samples. Extensive experiments are conducted on two video-based datasets, including PRID2011 and iLIDS-VID. The experimental results demonstrate the superiority and effectiveness of the proposed PADNet over the state-of-the-art approaches.
When deploying a Chinese neural Text-to-Speech (TTS) system, one of the challenges is to synthesize Chinese utterances with English phrases or words embedded. This paper looks into the problem in the encoder-decoder f...
详细信息
This paper presents a joint parallel loop filtering algorithm based on multi-thread load balancing in HEVC decoding, which implements the parallel processing of deblocking filtering (DBF) and sample adaptive compensat...
This paper presents a joint parallel loop filtering algorithm based on multi-thread load balancing in HEVC decoding, which implements the parallel processing of deblocking filtering (DBF) and sample adaptive compensation (SAO). Because of the diversity of video, the texture of different regions in an image is also different, which leads to various CTU partition methods. Therefore, the number of the boundary to be filtered is greatly different, resulting the computation load among multiple threads unbalanced in parallel processing. To solve this problem, an area division scheme is proposed, which divides the image into multiple areas, and the number of boundaries to be filtered in each area is similar. Then, the mapping relationship table is used to allocate these areas to multiple threads for parallel processing, so as to achieve the load balancing among the filtering threads. Finally, the cache technology is used to combine DBF and SAO to reduce the delay between them and improve the overall parallelism of the loop filter. Experimental results show that the performance of the proposed load balancing joint filtering algorithm is 8.15% higher than the previous scheme.
Target tracking is currently a hot research topic in Computer Vision and has a wide range of use in many research fields. However, due to factors such as occlusion, fast motion, blur and scale variation, tracking meth...
Target tracking is currently a hot research topic in Computer Vision and has a wide range of use in many research fields. However, due to factors such as occlusion, fast motion, blur and scale variation, tracking method still needs to be deeply studied. In this paper, we propose a block target tracking method based on multi-convolutional layer features and Kernel correlation filter. Our method divides the tracking process into two parts: target position estimation and target scale estimation. First, we block the target frame based on the condition number. Second, we extract the features by the convolutional layer and apply it to the kernel correlation filter to get the center position of different block targets. With the reliability of different blocks measured by the Barker coefficient, the overall target position center is obtained. Then, the affine transformation is adopted to achieve the scale adaptation. The algorithm in this paper is evaluated by the public video sequences in OTB-2013. Numerous experimental results demonstrate that the proposed tracking method can achieve target scale adaptation and effectively improve the tracking accuracy.
Recently, Transformer-based methods have shown impressive performance in single image super-resolution (SISR) tasks due to the ability of global feature extraction. However, the capabilities of Transformers that need ...
详细信息
Exudate detection in fundus images is an important task for the screening of people with diabetic retinopathy. In this paper, Convolutional Neural Network (CNN) is used to detect the exudates in fundus images. An auxi...
详细信息
Compared to conventional speech synthesis, end-to-end speech synthesis has achieved much better naturalness with more simplified system building pipeline. End-to-end framework can generate natural speech directly from...
详细信息
ISBN:
(数字)9781728103068
ISBN:
(纸本)9781728103075
Compared to conventional speech synthesis, end-to-end speech synthesis has achieved much better naturalness with more simplified system building pipeline. End-to-end framework can generate natural speech directly from characters for English. But for other languages like Chinese, recent studies have indicated that extra engineering features are still needed for model robustness and naturalness, e.g, word boundaries and prosody boundaries, which makes the front-end pipeline as complicated as the traditional approach. To maintain the naturalness of generated speech and discard language-specific expertise as much as possible, in Mandarin TTS, we introduce a novel self-attention based encoder with learnable Gaussian bias in Tacotron. We evaluate different systems with and without complex prosody information and results show that the proposed approach has the ability to generate stable and natural speech with minimum language-dependent front-end modules.
A siamese network tracking algorithm based on hierarchical attention mechanism is proposed in this paper. In order to obtain more robust target tracking results, different layer features are fused effectively. In the ...
A siamese network tracking algorithm based on hierarchical attention mechanism is proposed in this paper. In order to obtain more robust target tracking results, different layer features are fused effectively. In the process of extracting features, attention mechanism is used to recalibrate the feature map, and AdaBoost algorithm is used to weight the target feature map, which improves the reliability of the response map. Besides, the Inception module is also introduced which not only increases the width of the network and the adaptability of the siamese network to the scale, but also reduces the parameters and improves the speed of network training. Experimental results show that this method can effectively solve the impact of background clutter and improve the accuracy of tracking.
暂无评论