With two-dimensional structures, mathematical expressions deliver more information than normal text not only with symbols but also their spatial arrangements. Users of mathematical retrieval systems need more search m...
详细信息
Ordinal decision tree (ODT) can effectively deal with monotonic classification problems. However, it is difficult for the existing ordinal decision tree algorithms to learning ODT from large data sets. Based on the va...
详细信息
ISBN:
(纸本)9781479986989
Ordinal decision tree (ODT) can effectively deal with monotonic classification problems. However, it is difficult for the existing ordinal decision tree algorithms to learning ODT from large data sets. Based on the variable consistency dominance based rough set approach (VC-DRSA), an ordinal random forest algorithm is proposed in this paper. Combining with the computing framework of MapReduce, the proposed ordinal random forest algorithm is paralleled on the platform of Hadoop, which improves the efficiency of the proposed algorithm. The feasibility and effectiveness of the proposed algorithm is verified by the experimental results.
Patch-level features are essential for achieving good performance in computer vision tasks. Besides well-known pre-defined patch-level descriptors such as scaleinvariant feature transform (SIFT) and histogram of orien...
详细信息
In this paper, we propose a new method based on Chinese keyword search to select the WAV or MP3 files in audio post-production. First, we listen to each file and label it with Chinese characters, and then classify and...
详细信息
A core set extreme learningmachine(CSELM) approach is proposed in order to deal with large datasets classification problem. In the first stage, the core set can be obtained efficiently by using the generalized core v...
详细信息
A core set extreme learningmachine(CSELM) approach is proposed in order to deal with large datasets classification problem. In the first stage, the core set can be obtained efficiently by using the generalized core vector machine(GCVM) algorithm. For the second stage, the extreme learningmachine(ELM) can be used to implement classification for much larger datasets. Experiments show that the CSELM has comparable performance with SVM and ELM implementations, but is faster on large datasets.
This paper proposes an unsupervised bottom-up boundary detection algorithm, which is an improved surround suppression model based on orientation contrast. First, the candidate boundary set is obtained by the edge focu...
详细信息
ISBN:
(纸本)9781467322164
This paper proposes an unsupervised bottom-up boundary detection algorithm, which is an improved surround suppression model based on orientation contrast. First, the candidate boundary set is obtained by the edge focusing algorithm. Second, the orientation contrast map is constructed using the response of Gabor filter. The suppression term is computed on orientation contrast map using steerable filter, which can effectively differentiate step edge from texture edge. Using low-level image features, the boundary map can be used as preprocessing step for image segmentation and/or object detection. The detection approach has been validated on Rug dataset and the average of figure of merit shows an improvement of 15%.
In many database applications, ranking queries may reference both text and numeric attributes, where the ranking functions are based on both semantic distances/similarities for text attributes and numeric distances fo...
详细信息
In many database applications, ranking queries may reference both text and numeric attributes, where the ranking functions are based on both semantic distances/similarities for text attributes and numeric distances for numeric attributes. In this paper, we propose a new method for evaluating such type of ranking queries over a relational database. By statistics and training, this method builds a mechanism that combines the semantic and numeric distances, and the mechanism can be used to balance the effects of text attributes and numeric attributes on matching a given query and tuples in database search. The basic idea of the method is to create an index based on WordNet to expand the tuple words semantically for text attributes and on the information of numeric attributes. The candidate results for a query are retrieved by the index and a simple SQL selection statement, and then top-N answers are obtained. The results of extensive experiments indicate that the performance of this new strategy is efficient and effective.
In this paper, we propose a new method based on index to realize IR-style Chinese keyword search with ranking strategies in relational databases. This method creates an index by using the related information of tuple ...
详细信息
In this paper, we propose a new method based on index to realize IR-style Chinese keyword search with ranking strategies in relational databases. This method creates an index by using the related information of tuple words and presents a ranking strategy in terms of the nature of Chinese words. For a Chinese keyword query, the index is used to match query search words and the tuple words in index quickly, and to compute similarities between the query and tuples by the ranking strategy, and then the set of identifiers of candidate tuples is generated. Thus, we retrieve top-N results of the query using SQL selection statements and output the ranked answers according to the similarities. The experimental results show that our method is efficient and effective.
In this paper, we propose a new method based on index to realize IR-style Chinese keyword search with ranking strategies in relational databases. This method creates an index by using the related information of tuple ...
详细信息
In this paper, we propose a new method based on index to realize IR-style Chinese keyword search with ranking strategies in relational databases. This method creates an index by using the related information of tuple words and presents a ranking strategy in terms of the nature of Chinese words. For a Chinese keyword query, the index is used to match query search words and the tuple words in index quickly, and to compute similarities between the query and tuples by the ranking strategy, and then the set of identifiers of candidate tuples is generated. Thus, we retrieve top-N results of the query using SQL selection statements and output the ranked answers according to the similarities. The experimental results show that our method is efficient and effective.
In many database applications, ranking queries may reference both text and numeric attributes, where the ranking functions are based on both semantic distances/similarities for text attributes and numeric distances fo...
详细信息
In many database applications, ranking queries may reference both text and numeric attributes, where the ranking functions are based on both semantic distances/similarities for text attributes and numeric distances for numeric attributes. In this paper, we propose a new method for evaluating such type of ranking queries over a relational database. By statistics and training, this method builds a mechanism that combines the semantic and numeric distances, and the mechanism can be used to balance the effects of text attributes and numeric attributes on matching a given query and tuples in database search. The basic idea of the method is to create an index based on WordNet to expand the tuple words semantically for text attributes and on the information of numeric attributes. The candidate results for a query are retrieved by the index and a simple SQL selection statement, and then top-N answers are obtained. The results of extensive experiments indicate that the performance of this new strategy is efficient and effective.
暂无评论