In this paper, we present a scheme of similarity measure learning based on kernel optimization. Employing a data-dependent kernel model, the proposed scheme optimizes the spatial distribution of the training data in t...
详细信息
In this paper, we present a scheme of similarity measure learning based on kernel optimization. Employing a data-dependent kernel model, the proposed scheme optimizes the spatial distribution of the training data in the feature space, aiming to maximize the class separability of the data in the feature space. The learned similarity measure, derived from the optimized kernel, exhibits a favorable feature to the task of pattern classification, that the spatial resolution of the embedding space is expanded around the boundary areas, and shrunk around the homogeneous areas. Experiments demonstrate that using the learned similarity measure can substantially improve the performances of the K-nearest-neighbor classifier.
How does man's vision system work? In some cross research fields like neurobiology, psychology and robotics researchers have been work hard to answer the question for long time. Now on visual cortex neuroscience h...
详细信息
How does man's vision system work? In some cross research fields like neurobiology, psychology and robotics researchers have been work hard to answer the question for long time. Now on visual cortex neuroscience has accumulate much experimental data and some theories like information redundancy reduction, sparse coding have given their interpretation of experiments, but understanding information processing as a whole , especially to make a representation of image with basic conceptions of receptive field and direction column, is still a difficult task. In our work, together with consideration of psychology and sparse coding a multi-resolution statistics scheme is given, signal grads statistics is carried out according to resolution level, strength and direction in space respectively. By comparing the distribution of nerve cell on visual cortex with one of neural network which works follow multi-resolution statistics, the similarity of both tell the arithmetic meanings of receptive field and direction column. With modern neuroscience experimental means the validation of the point of view in this article may be done in principle.
This paper presents a high-speed video transfer scheme and a real-time infrared spots detection algorithm designed for field programmable gate array (FPGA) implementation. Rather than IEEE 1394a, two IEEE 1394b interf...
详细信息
This paper presents a high-speed video transfer scheme and a real-time infrared spots detection algorithm designed for field programmable gate array (FPGA) implementation. Rather than IEEE 1394a, two IEEE 1394b interfaces are alternatively used to ensure high-resolution image transfer in real time. In order to execute fast infrared spots detection, a parallel algorithm that processes four pixels per clock cycle is proposed. It detects infrared spots in a single pass over a frame and its implementation is only composed of combinatorial logic and registers. Furthermore, the execution time of the algorithm is independent of image content. A prototype system is implemented in an FPGA device. It is capable of transferring 1024 × 768 images smoothly at 60 fps and detecting infrared sports in a 1024 × 768 image within 1.966ms, demonstrating its superiority over the existing multi-pass algorithms and some other one-pass algorithms. Details of software and hardware architecture are discussed in this paper.
In multi-camera surveillance systems, it is important to track the same person across multiple cameras. It is also desirable to recognize the individuals who have been previously observed in a single-camera system. Th...
详细信息
In multi-camera surveillance systems, it is important to track the same person across multiple cameras. It is also desirable to recognize the individuals who have been previously observed in a single-camera system. The method that represents a object image using a bag of visual words has been commonly used in image retrieval applications. For recognizing people, it can outperform the methods mainly based on global appearance like color histogram, and fit better to low-quality images compared to biometric features such as face and gait. In this paper we study the details in feature extraction, vocabulary building and classifier learning of the bag-of-features approach for classifying tracks of different individuals. Based on this approach, we design a online system applying incremental support vector machine learning with a decision scheme to distinguish reoccurrences from new targets. We get promising results from the evaluation with more than 100 tracks of 50 different people.
In this paper, we describe an experimental investigation to evaluate the significance of different facial regions of a person in the task of gender classification. For this purpose we use a support vector machine (SVM...
详细信息
In this paper, we describe an experimental investigation to evaluate the significance of different facial regions of a person in the task of gender classification. For this purpose we use a support vector machine (SVM) classifier on face images for gender classification. We perform experiments using different facial regions of varying resolution so that the significance of facial regions in this application can be assessed. According to the results obtained, the upper region of the face proved to be the most significant for the task of gender classification. Moreover, the changes in the resolution of the facial region images do not produce significant changes in the result. Based on the significance of different facial regions, we propose a gender classification method based on fusion of multiple facial regions and show that this method is able to compensate for facial expressions and lead to better overall performance.
A novel approach of pose estimation is proposed for the object with surface of revolution(SOR). The silhouette of the object is the only information necessary for this method and no cross section circle (latitude circ...
详细信息
ISBN:
(纸本)9781424456536;9781424456543
A novel approach of pose estimation is proposed for the object with surface of revolution(SOR). The silhouette of the object is the only information necessary for this method and no cross section circle (latitude circle) is needed. In this article, we explain the property of tangent circle and use it to establish constraint between two images of object with different poses. Such constraint can help to solve the pose of object in both images. We test our method with a simulation experiment and use it to estimate the pose for both rigid body and articulated object.
Faults of sensor data will always present in sensor networks because of unreliable communication links, measurement interference and harsh environment. Developing fusion algorithms that can tolerate faults is necessar...
Faults of sensor data will always present in sensor networks because of unreliable communication links, measurement interference and harsh environment. Developing fusion algorithms that can tolerate faults is necessary for reliable sensor network applications. In this paper, we study the fault tolerant fusion for moving vehicle classification based on Marzullo's interval fusion algorithm. The unreliable sensor data are represented using interval estimations. To reduce communication cost, quantized interval representation is adopted. Simulation results demonstrate the validity of the interval fusion algorithm. By using quantized representation, the communication cost is reduced.
We address the issue of markless human motion capture by voxel labeling. We explore the problem of pose estimation from voxel cloud based on a predefined human model. First, voxels are labeled into different individua...
详细信息
ISBN:
(纸本)9780769552378;9780769538839
We address the issue of markless human motion capture by voxel labeling. We explore the problem of pose estimation from voxel cloud based on a predefined human model. First, voxels are labeled into different individual parts of the body. Then, the joints are extracted from labeled voxels. Finally, the joint angles are estimated from the joints. Tested on the voxel data in our experiments, our algorithm is insensitive to noise and achieve fine accuracy in pose estimation.
B-factor reflects the atom's uncertainty about its average position within a crystal structure and is highly correlated with protein functions. In this article, we propose a novel approach to predict the real valu...
详细信息
B-factor reflects the atom's uncertainty about its average position within a crystal structure and is highly correlated with protein functions. In this article, we propose a novel approach to predict the real value of B-factor. We firstly extract features from the protein sequences and their evolution information, then apply random forest tree to select the important features, which are further inputted to a two-stage support vector regression (SVR) for prediction. Our results have revealed that a systematic analysis of the importance of different features makes us have deep insights into the different contributions of features and is very necessary for developing effective B-factor prediction tools. We thus develop an online Web server, which is freely available at http://***/bioinf/PredBF for academic use.
This paper is about a vision-based system that automatically monitors intermodal freight trains for the quality of how the loads (containers) are placed along the train. An accurate and robust algorithm to segment the...
详细信息
暂无评论