Semi-structured Chinese document analysis is the most difficult task for complex structure and Chinese semantics. According to the generic characteristics of the semi-structured document and the specific characteristi...
详细信息
Semi-structured Chinese document analysis is the most difficult task for complex structure and Chinese semantics. According to the generic characteristics of the semi-structured document and the specific characteristics of the resume document, the paper researched on resume document block analysis based on pattern matching, multi-level information identification and feedback control algorithms was also prompted. Based on the research, resume parser system was implemented for ChinaHR, which is the biggest recruitment Website. It can read, analysis, retrieval and store the information automatically. According to all kinds of experiments results, the accuracy and efficiency of this system can generally satisfy the practical requirements. As the research on the processing of the semi-structured document, it will not only be as a directive of the further research on the resume analysis, but also be as the reference to other form of the semi-structured document.
Ship detection using high-resolution remote sensing images is an important task, which contribute to sea surface regulation. The complex background and special visual angle make ship detection relies in high quality d...
详细信息
Human parsing is an essential branch of semantic segmentation, which is a fine-grained semantic segmentation task to identify the constituent parts of human. The challenge of human parsing is to extract effective sema...
详细信息
A novel text-independent speaker identification (SI) method is proposed. This method uses the Mel-frequency Cepstral coefficients (MFCCs) and the dynamic information among adjacent frames as feature sets to capture sp...
详细信息
In this paper we proposed an end-to-end short utterances speech language identification(SLD) approach based on a Long Short Term Memory (LSTM) neural network which is special suitable for SLD application in intelligen...
详细信息
Recommender systems show increasingly importance with the development of E-commerce, news and multimedia applications. Traditional recommendation algorithms such as collab.rative-filtering-based methods and graph-base...
详细信息
Recommender systems show increasingly importance with the development of E-commerce, news and multimedia applications. Traditional recommendation algorithms such as collab.rative-filtering-based methods and graph-based methods mainly use items' original attributes and relationships between items and users, ignoring items' chronological order in browsing sessions. In recent years, RNN-based methods show their superiority when dealing with the sequential data, and some modified RNN models have been proposed. However, these RNN models only use the sequence order of items and neglect items' browsing time information. It is widely accepted that users tend to spend more time on their interested items, and these interested items are always closely related to users' current target. Based on the above view, items' browsing time is an important feature in recommendations. In this paper, we propose a modified RNN-based recommender system called TA4Rec, which can recommend the probable Item that may be clicked in the next moment. Our main contribution is to introduce a method to calculate the time-attention factors from browsing items' duration time and add time-attention factors to the RNN-based model. We conduct experiments on RecSys Challenge 2015 dataset and the result shows that TA4Rec model has gained obvious improvement on session-based recommendations than the classic session-based recommender method.
Efficient reconfigurable VLSI architecture for 1-D 5/3 and 9/7 wavelet transforms adopted in JPEG2000 proposal, based on lifting scheme is proposed. The embedded decimation technique based on fold and time multiplexin...
详细信息
Efficient reconfigurable VLSI architecture for 1-D 5/3 and 9/7 wavelet transforms adopted in JPEG2000 proposal, based on lifting scheme is proposed. The embedded decimation technique based on fold and time multiplexing, as well as embedded boundary data extension technique, is adopted to optimize the design of the architecture. These reduce significantly the required numbers of the multipliers, adders and registers, as well as the amount of accessing external memory, and lead to decrease efficiently the hardware cost and power consumption of the design. The architecture is designed to generate an output per clock cycle, and the detailed component and the approximation of the input signal are availab.e alternately. Experimental simulation and comparison results are presented, which demonstrate that the proposed architecture has lower hardware complexity, thus it is adapted for embedded applications. The presented architecture is simple, regular and scalab.e, and well suited for VLSI implementation.
With huge amount of observed air quality and components data, it is of great challenge to analyze and trace the pollutant diffusion path. Partitioning the air pollution sources (air quality observation stations) into ...
详细信息
With huge amount of observed air quality and components data, it is of great challenge to analyze and trace the pollutant diffusion path. Partitioning the air pollution sources (air quality observation stations) into subnetworks will help a lot in tracing the air pollution diffusion path. Conventional air pollution sources clustering methods, which are based on geography or pollutant levels, present weak correlation with pollution transmission links. In order to overcome such problem, a method of air pollution sources clustering via activation force (AF) model is introduced in this paper. We model the connections of the pollution sources by AF so that the relationship among the observation stations and the coincidence of the transmission links can be modeled effectively. With the affinity matrix obtained via AF modeling, we conduct clustering of the air pollution sources via modularity measurement. Compared to K-means clustering method purely, which is based on the air quality index of pollutants, the proposed approach shows several advantages in air pollution network clustering.
Content-based image retrieval (CBIR) is an application of computer vision techniques to the image retrieval *** is,the problem of searching for digital images in large *** this paper,we apply an image segmentation tec...
详细信息
Content-based image retrieval (CBIR) is an application of computer vision techniques to the image retrieval *** is,the problem of searching for digital images in large *** this paper,we apply an image segmentation technique to an image retrieval system which is designed for the use on mobile *** an image captured by the mobile devices,edge detection and region merging mechanisms are used in this segmentation technique to extract the ROI from a complex background *** proposed method automatically merges the regions that are initially segmented by mean shift segmentation,and then effectively extracts the object contour by the lab.led regions as either background or *** no users interaction,the experimental results show the method is more effective than other automatic segmentation methods.
In the topic search system,some of web pages got by crawling are inconsistent with user *** this situation,this paper had a research on content-based web filtering *** paper proposed a dual feature selection method ba...
详细信息
In the topic search system,some of web pages got by crawling are inconsistent with user *** this situation,this paper had a research on content-based web filtering *** paper proposed a dual feature selection method based on the CHI statistical method and N-gram,and then made binary text classification by SVM in order to achieve Web *** experiments showed that the proposed web filtering method has better results.
暂无评论