Audiovisual speech synchrony detection is an important part of talking-face verification systems. Prior work has primarily focused on visual features and joint-space models, while standard mel-frequency cepstral coeff...
详细信息
ISBN:
(纸本)9781538663967
Audiovisual speech synchrony detection is an important part of talking-face verification systems. Prior work has primarily focused on visual features and joint-space models, while standard mel-frequency cepstral coefficients (MFCCs) have been commonly used to present speech. We focus more closely on audio by studying the impact of context window length for delta feature computation and comparing MFCCs with simpler energy-based features in lip-sync detection. We select state-of-the-art hand-crafted lip-sync visual features, space-time auto-correlation of gradients (STACOG), and canonical correlation analysis (CCA), for joint-space modeling. To enhance joint space modeling, we adopt deep CCA (DCCA), a nonlinear extension of CCA. Our results on the XM2VTS data indicate substantially enhanced audiovisual speech synchrony detection, with an equal error rate (EER) of 3.68%. Further analysis reveals that failed lip region localization and beard-edness of the subjects constitutes most of the errors. Thus, the lip motion description is the bottleneck, while the use of novel audio features or joint-modeling techniques is unlikely to boost lip-sync detection accuracy further.
data Mining, which is known as knowledge discovery in databases has been defined as the nontrivial extraction of implicit, previous unknown and potentially useful information from data. It uses machine learning, stati...
详细信息
ISBN:
(纸本)0769529941
data Mining, which is known as knowledge discovery in databases has been defined as the nontrivial extraction of implicit, previous unknown and potentially useful information from data. It uses machine learning, statistical and visualization techniques to discover and present knowledge in a form which is easily comprehensible to human. In the paper the authors first introduce the idea, basic concept and process of data mining, then, an example and methods of the application of data mining in physical statistics are analyzed. data mining is applied in physical training and evaluation, such as constitution data analyzing, PE industry and competitive sports. Thus;we think data mining becomes an important task of the scientific research of sports topic in future.
Text has become a major part of our life. We use text as way to communicate and express ourselves in this sophisticated world. Since we are predicting the next probable "word" we will be using text as input ...
详细信息
ISBN:
(纸本)9781665428644
Text has become a major part of our life. We use text as way to communicate and express ourselves in this sophisticated world. Since we are predicting the next probable "word" we will be using text as input data for our algorithm. Predictive typing is basically predicting the most probable word following the given set of words or a particular word. In this contemporary world where mobile devices and computers dominate communication, like emails, social media and even blogs and newsletters, we require a method for speeding up the responding process to keep ourselves with the fast-paced world. Predictive typing is a widely used procedure for faster and eloquent communication but the robustness and scalability along with the speed is always a concern. The main aim of our algorithm is to develop a next word predictor that helps to significantly reduce the number of key-strokes by the user along with accuracy. This predictive typing algorithm's core is based on the N-grams model.
The analog data acquisition system developed in this paper is integrated into the remote interface unit of aircraft. It uses the high certainty and high real-time performance of the IEEE-1394B bus to realize informati...
详细信息
Discrete Trigonometric Sine Transforms of type 7 (DST-7) recently got popular in coding of video. There is a direct relation between DCT-2 and DST-7 transforms. The relation permits joint computation of these two tran...
详细信息
MEMS sensors have been used in many applications including navigation systems. However, these sensors suffer from highly noisy measurements. If left untreated, these errors will significantly degrade the ultimate navi...
详细信息
ISBN:
(纸本)9781728120850
MEMS sensors have been used in many applications including navigation systems. However, these sensors suffer from highly noisy measurements. If left untreated, these errors will significantly degrade the ultimate navigational solution. Hence, applying a pre-filtering technique becomes a necessity to de-noise these sensor signals to improve the overall system performance. While wavelet denoising is the most common technique for sensor data pre-filtering, it may not be suitable for real-time implementations. This paper explores another method;namely, Savitzky-Golay filters, which can provide competitive denoising performance with a less computationally demanding algorithm. The purpose of the paper is to examine the performance of the new method against wavelet de-noising with respect to both positioning and attitude accuracy and computations time. We applied the filter to denoise MEMS-based inertial sensors data in a tightly coupled integrated INS/GPS system. Our results showed that the new method outperformed the wavelet denoising approach. Moreover, the new method demands much less computations time, which makes it more suitable for embedded systems and real-time applications.
作者:
Zheng, YangfengSUN YAT SEN UNIV
Sch Data & Comp Sci Guangzhou Higher Educ Mega Ctr Univ Town 132 East Waihuan Rd Guangzhou 510006 Peoples R China
As the size of training dataset of face recognition models becomes larger and larger, we are interested in a method called Average-Half-Face (AHF), which could halve the size of training samples. The AHF method divide...
详细信息
As the size of training dataset of face recognition models becomes larger and larger, we are interested in a method called Average-Half-Face (AHF), which could halve the size of training samples. The AHF method divides a full face into two halves and then averages them together (reversing the columns of one of the halves). We preprocess the dataset with the method of AHF, and train them on two different models, Eigenfaces and Convolutional Neural Network (CNN). We compare the prediction results with those models trained on the original dataset. Previous researches showed that AHF is superior to Full-Face (FF), while our experiment results further showed that in some cases AHF also boosts CNN. The application of AHF can bring both saving in storage and reduction on training cost time.
Motivated by the remarkable performance achieved using deep learning strategies in solving action recognition tasks, an effective, yet simple method is proposed for encoding the spatiotemporal information of skeleton ...
详细信息
ISBN:
(纸本)9781538663967
Motivated by the remarkable performance achieved using deep learning strategies in solving action recognition tasks, an effective, yet simple method is proposed for encoding the spatiotemporal information of skeleton sequences into color texture images, referred to as Skeletal Optical Flows (SOFs). SOFs collectively represent the kinetic energy, predefined angles and pair-wise displacements between joints over consecutive frames of skeleton data, as color variations to capture meaningful temporal information and make them highly interpretable. A novel Convolutional Neural Network with Correctness-Vigilant Regularizer (CVR-CNN) is then employed to exploit the discriminative features of SOFs for human action recognition. Empirical results show that the efficiency of the proposed method is superior in terms of the generalizability of the generated model, the training convergence speed, and the resulting classification accuracy on commonly used action recognition datasets, such as MHAD, HDM05 and NTU RGB+D.
Biological network inference has always been one of the central topics in systems biology. Network inference can be regarded as a process of determining relations between nodes with efficient measurements. For gene re...
详细信息
ISBN:
(纸本)9781450376259
Biological network inference has always been one of the central topics in systems biology. Network inference can be regarded as a process of determining relations between nodes with efficient measurements. For gene regulatory networks, transcriptomic data such as single cell RNA sequencing (sc RNA-seq) have increasingly act as the main information source in reconstructing network structures. Although many methods have been proposed towards this challenge, most of them do not focus on sing-cell data and omit the characteristics of gene regulatory networks. Here, we presented a new method names ICGNI to solve these problems about gene functional clustering, network inference with single-cell data and hub genes finding. Three single cell datasets were used to evaluate the performance of our method with satisfying results.
This paper proposes an autonomous approach for repairing leakage at the international Space Station (ISS) using the NASA Astrobee robot with the 3rd Kibo Robot Programming Challenge (KRPC) guidelines provided by JAXA....
详细信息
ISBN:
(纸本)9798350307573;9798350307566
This paper proposes an autonomous approach for repairing leakage at the international Space Station (ISS) using the NASA Astrobee robot with the 3rd Kibo Robot Programming Challenge (KRPC) guidelines provided by JAXA. We bring into play image processing and analysis from the Astrobee Nav Cam data, circle detection algorithms, AR tag detection techniques, autonomous navigation, and a Visual Projective Localization And Mapping (VPLAM) approach for irradiating lasers with excellent precision at the Astrobee Robot Simulator. In addition, we discuss the various capabilities of the Astrobee Robots onboard the ISS and how they can facilitate the delivery of such an autonomous mission in emergencies. Lastly, we walk through ways to deal with randomness and uncertainty while operating, ensuring robustness and accuracy. Ultimately, we scored a precision of up to 0.03 cm on the KRPC online simulator.
暂无评论