We introduce the problem of detecting a group of students from classroom videos. The problem requires the detection of students from different angles and the separation of the group from other groups in long videos (o...
详细信息
We introduce the problem of detecting a group of students from classroom videos. The problem requires the detection of students from different angles and the separation of the group from other groups in long videos (one to one and a half hours).We use multiple image representations to solve the problem. We use FM components to separate each group from background groups, AM-FM components for detecting the back-of-the-head, and YOLO for face detection. We use classroom videos from four different groups to validate our approach. Our use of multiple representations is shown to be significantly more accurate than the use of YOLO alone.
As an executive component of robots, the precise control of electric cylinders is crucial for achieving accurate control of robots. This article proposes a position closed-loop electric cylinder control method based o...
详细信息
ISBN:
(数字)9798331506100
ISBN:
(纸本)9798331506117
As an executive component of robots, the precise control of electric cylinders is crucial for achieving accurate control of robots. This article proposes a position closed-loop electric cylinder control method based on three-orders active disturbance rejection control (ADRC). Firstly, model the electric cylinder system and determine its transfer function. Secondly, PID controllers and ADRC controllers were designed separately, and their stability was verified through the Routh criterion and Lyapunov method, respectively. Finally, the control effects of PID and ADRC were compared through three sets of experiments. The results show that compared to PID, ADRC is less affected by the increase in input signal frequency, has stronger anti-interference ability, and can effectively resist the influence of internal uncertainty.
When deploying a Chinese neural Text-to-Speech (TTS) system, one of the challenges is to synthesize Chinese utterances with English phrases or words embedded. This paper looks into the problem in the encoder-decoder f...
详细信息
At present, deep learning technology is widely used in ship target detection in synthetic aperture radar (SAR) images. However, high-resolution remote sensing SAR images cover a larger area and have larger image sizes...
详细信息
In this study, we present a Danger Model immune algorithm based path planning algorithm (DMIA-PP) for robot path planning. Different with the traditional immune algorithm, the system is not based on self-nonself mecha...
详细信息
Color correlogram for content-based image retrieval (CBIR) characterizes not only the color distribution of pixels, but also the spatial correlation of pairs of colors. Color not only reflects the material of surface,...
详细信息
Intra-predictive transforms are a kind of block-based transforms that can exploit both the intra- and inter-block correlations. This paper analyzes the coding gains of intra-predictive transforms for the Gaussian proc...
详细信息
ISBN:
(纸本)9781424456536
Intra-predictive transforms are a kind of block-based transforms that can exploit both the intra- and inter-block correlations. This paper analyzes the coding gains of intra-predictive transforms for the Gaussian process. The tight upper bound of the coding gain is derived, which is shown to be better than both the discrete cosine transform and the Karhunen Loeve transform. The optimal intra-predictive transform that achieves the upper bound is also presented. Actual coding results on images verify the effectiveness of the optimal intra-predictive transform.
<正>The image fusion is an important approach to produce a single complete image which preserves all relevant information from different *** this paper,we proposed a support value transform-based multi focus image f...
<正>The image fusion is an important approach to produce a single complete image which preserves all relevant information from different *** this paper,we proposed a support value transform-based multi focus image fusion method,where the fused saliency features are represented by support values. Based on the mapped least squares support vector machine,the support value transform is developed as a multi-scale analysis *** fusing results on the multi focus images demonstrate that the proposed image fusion method is effective and efficient.
By researching the Brushlet domain coefficients of texture images, we found that the distribution of the magnitudes of Brushlet domain coefficients roughly meet rayleigh distribution. And there are correlations betwee...
详细信息
A noise erosion operator based on partial differential equation (PDE) is introduced, which has an excellent ability of noise removal and edge preservation for two-dimensional (2D) gradient data. The operator is applie...
详细信息
A noise erosion operator based on partial differential equation (PDE) is introduced, which has an excellent ability of noise removal and edge preservation for two-dimensional (2D) gradient data. The operator is applied to estimate a new diffusion coefficient. Experimental results demonstrate that anisotropic diffusion based on this new erosion operator can efficiently reduce noise and sharpen object boundaries.
暂无评论