For high dimensional data, the redundant attributes of samplers will not only increase the complexity of the calculation, but also affect the accuracy of final result. The existing attribute reduction methods are enco...
详细信息
Web service composition is one of the fundamental technologies in implementing Service Oriented Architecture (SOA) based applications. Description Logic based Web service composition can easily express the static know...
详细信息
Increasingly widespread use of mobile devices for processing monetary transactions and accessing business secrets has created a great demand on securing mobile devices. Poorly designed authentication mechanisms (e.g.,...
详细信息
Grammar learning has been a bottleneck problem for a long time. In this paper, we propose a method of seman- tic separator learning, a special case of grammar learning. The method is based on the hypothesis that some ...
详细信息
Grammar learning has been a bottleneck problem for a long time. In this paper, we propose a method of seman- tic separator learning, a special case of grammar learning. The method is based on the hypothesis that some classes of words, called semantic separators, split a sentence into sev- eral constituents. The semantic separators are represented by words together with their part-of-speech tags and other infor- mation so that rich semantic information can be involved. In the method, we first identify the semantic separators with the help of noun phrase boundaries, called subseparators. Next, the argument classes of the separators are learned from cor- pus by generalizing argument instances in a hypernym space. Finally, in order to evaluate the learned semantic separators, we use them in unsupervised Chinese text parsing. The exper- iments on a manually lab.led test set show that the proposed method outperforms previous methods of unsupervised text parsing.
In this demo paper we present a multiscale browsing interface for handheld devices, in which the user can interactively change the scale of the storyboard to easily adjust the amount of information desired. Convention...
详细信息
Recently, more and more approaches are emerging to solve the cross-view matching problem where reference samples and query samples are from different views. In this paper, inspired by Graph Embedding, we propose a uni...
详细信息
Real-world problems usually exhibit dual-heterogeneity, i.e., every task in the problem has features from multiple views, and multiple tasks are related with each other through one or more shared views. To solve these...
详细信息
A motivation model is proposed in the paper. Based on the model we develop a motivational system for mind model CAM. Through the application in automatic navigation of animal robots shows the motivation system is usef...
详细信息
When hierarchical phrase-based statistical machine translation systems are used for language translation, sometimes the translations' content words were lost: source-side content words is empty when translated int...
详细信息
Cross-domain learning targets at leveraging the knowledge from source domains to train accurate models for the test data from target domains with different but related data distributions. To tackle the challenge of da...
详细信息
ISBN:
(纸本)9781577356332
Cross-domain learning targets at leveraging the knowledge from source domains to train accurate models for the test data from target domains with different but related data distributions. To tackle the challenge of data distribution difference in terms of raw features, previous works proposed to mine high-level concepts (e.g., word clusters) across data domains, which shows to be more appropriate for classification. However, all these works assume that the same set of concepts are shared in the source and target domains in spite that some distinct concepts may exist only in one of the data domains. Thus, we need a general framework, which can incorporate both shared and distinct concepts, for cross-domain classification. To this end, we develop a probabilistic model, by which both the shared and distinct concepts can be learned by the EM process which optimizes the data likelihood. To validate the effectiveness of this model we intentionally construct the classification tasks where the distinct concepts exist in the data domains. The systematic experiments demonstrate the superiority of our model over all compared baselines, especially on those much more challenging tasks.
暂无评论