检索结果-内蒙古大学图书馆

Combining the missing link: An incremental topic model of document content and hyperlink

作者： Ma, Huifang Li, Zhixin Shi, Zhongzhi Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences 100080 Beijing China Graduate School of the Chinese Academy of Sciences 100039 Beijing China

ISBN: (纸本)3642163262

The content and structure of linked information such as sets of web pages or research paper archives are dynamic and keep on changing. Even though different methods are proposed to exploit both the link structure and the content information, no existing approach can effectively deal with this evolution. We propose a novel joint model, called Link-IPLSI, to combine texts and links in a topic modeling framework incrementally. The model takes advantage of a novel link updating technique that can cope with dynamic changes of online document streams in a faster and scalable way. Furthermore, an adaptive asymmetric learning method is adopted to freely control the assignment of weights to terms and citations. Experimental results on two different sources of online information demonstrate the time saving strength of our method and indicate that our model leads to systematic improvements in the quality of classification. © 2010 IFIP.

关键词： Websites

来源：评论

学校读者我要写书评

暂无评论

An efficient data indexing approach on Hadoop using Java persistence API

引用

作者： Lai, Yang Zhongzhi, Shi Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing 100190 China Graduate University of Chinese Academy of Sciences Beijing 100039 China

ISBN: (纸本)3642163262

Data indexing is common in data mining when working with high-dimensional, large-scale data sets. Hadoop, a cloud computing project using the MapReduce framework in Java, has become of significant interest in distributed data mining. To resolve problems of globalization, random-write and duration in Hadoop, a data indexing approach on Hadoop using the Java Persistence API (JPA) is elaborated in the implementation of a KD-tree algorithm on Hadoop. An improved intersection algorithm for distributed data indexing on Hadoop is proposed, it performs O(M+logN), and is suitable for occasions of multiple intersections. We compare the data indexing algorithm on open dataset and synthetic dataset in a modest cloud environment. The results show the algorithms are feasible in large-scale data mining. © 2010 IFIP.

关键词： Data mining

来源：评论

学校读者我要写书评

暂无评论

Collaboration in agent grid based on dynamic description logics

引用

作者： Chen, Limin Shi, Zhongzhi Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing 100190 China Graduate University of Chinese Academy of Sciences Beijing 100049 China

ISBN: (纸本)3642163262

The global expansion of the Web brings the global computing;and the increasing number of problems with increasing complexity & sophistication also makes collaboration desirable. In this paper, we presented a semantics-based framework for collaborative problem solving in agent grid by coupling joint intention and dynamic description logics (DDL), our previous work to extend description logics (DL) with a dynamic dimension to model the dynamic world. Capabilities and attitudes of agents were captured by actions, and formulas in DDL respectively. Thus representation components in our framework were conferred with well-defined semantics by relating them to some domain ontologies. We could employ reasoning on actions in DDL to help agents to find proper colleagues when collaboration is necessary, and the philosophy underlying Joint Intention to bind their actions to achieve their overall goal. The main strengths of our framework include: i) finding probably helpful agents in a semantically accurate way due to the employment of semantic information;ii) going much closer to industrial implementations while retaining the main express power of classical joint intention model. © 2010 IFIP.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

The description logic for relational databases

引用

作者： Yue, Ma Yuming, Shen Yuefei, Sui Cungen, Cao Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing 100190 China Graduate University of Chinese Academy of Science Beijing 100039 China

ISBN: (纸本)3642163262

Description logics are widely used to express structured data and provide reasoning facility to query and integrate data from different databases. This paper presents a many-sorted description logic MDL to represent relational databases. We give a translation from relational databases to the description logic MDL, and show this translation completely and faithfully captures the information in the relational database. Moreover, we show that some relational algebra operations could be expressed in MDL. © 2010 IFIP.

关键词： Computer circuits

来源：评论

学校读者我要写书评

暂无评论

Requirement driven service composition: An ontology-based approach

引用

作者： Cai, Guangjun Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing 100190 China Graduate University of Chinese Academy of Sciences Beijing 100049 China

ISBN: (纸本)3642163262

Service-oriented computing is a new computing paradigm that utilizes services as fundamental elements for developing applications. Service composition plays a very important role in it. This paper focuses on service composition triggered by service requirement. Here, the processes modeling the requirement should be treated in parallel with describing service and a same ontology should be adopted for allowing the understanding between the requirement and services. An effect-based approach has been proposed based on our previous work on service description. This approach could be promising for tackling the challenge of services composition. © 2010 IFIP.

关键词： Ontology

来源：评论

学校读者我要写书评

暂无评论

Statistical translation model based on source syntax structure

PACLIC 24 - Proceedings of the 24th Pacific Asia Conference ...

引用

PACLIC 24 - Proceedings of the 24th Pacific Asia Conference on Language, information and Computation 2010年 61-61页

作者： Liu, Qun Liu, Yang Mi, Haitao Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing 100190 China

ISBN: (纸本)9784905166009

Syntax-based statistical translation model is proved to be better than phrasebased model, especially for language pairs with very different syntax structures, such as Chinese and English. In this talk I will introduce a serial of statistical translation models based on source syntax structure. The tree-based model uses the one best syntax tree for translation. The forest-based model uses a compact forest which encodes exponential number of syntax trees in a polynomial spaces and lead to better performance. The joint parsing and translation model produces source parse trees, using the source side of the translation rules instead of separate parsing rules, and generate translations on the target side simultaneously, which outperforms the forest-based model. Some extensions of these models are introduced also. © 2010 by Qun Liu, Yang Liu, and Haitao Mi.

关键词： Syntactics

来源：评论

学校读者我要写书评

暂无评论

The ICT Statistical Machine Translation System for IWSLT 2010 7

The ICT Statistical Machine Translation System for IWSLT 201...

引用

7th International Workshop on Spoken Language Translation, IWSLT 2010

作者： Xiong, Hao Xie, Jun Yu, Hui Liu, Kai Luo, Wei Mi, Haitao Liu, Yang Lü, Yajuan Liu, Qun Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences No.6 Kexueyuan South Road Haidian District P.O. Box 2704 Beijing100080 China

This paper illustrates the ICT Statistical Machine Translation system used in the evaluation campaign of the International Workshop on Spoken Language Translation 2010. We participate in the DIALOG tasks for Chinese-to-English and English-to-Chinese translation respectively. For both tasks, our system has achieved significant improvement with several effective methods as follows: 1) refining the data preprocessing, including Chinese word segmentation, named entity recognition, etc. 2) reducing the number of Out-of-Vocabulary(OOV) on the final test set by applying a fuzzy matching strategy. 3) considering generating a better input for the decoder from the N-best lists of ASR output as a special kind of translation task for the ASR task. 4) improving the performance of every single decoder, and reranking the n-best list for the final results submitted. © IWSLT 2010. All rights reserved.

关键词： Computer aided language translation

来源：评论

学校读者我要写书评

暂无评论

引用

2010 IEEE 5th International Conference on Bio-Inspired computing: Theories and Applications, BIC-TA 2010

作者： Zhang, Zhujin Wang, Shuo Zhang, Xingyi Zhang, Zheng Department of Control Science and Engineering Key Laboratory of Image Processing and Intelligent Control Huazhong University of Science and Technology Wuhan 430074 China School of Computer and Information Science University of South Australia Adelaide SA 5095 Australia Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education School of Computer Science and Technology Anhui University Hefei 230039 China

ISBN: (纸本)9781424464388

Randí et al. proposed a significant graphical representation for DNA sequences, which is very compact and avoids loss of information. In this paper, we build a fast algorithm for this graphical representation with time complexity O(n2), and find another important advantage in the representation: no degeneracy. Moreover, we propose a new method to do similarity analysis of DNA sequences based on the representation. The approach adopts four elements of covariance matrix as a descriptor, and is illustrated on the first exon of beta-globin genes from 11 different species. © 2010 IEEE.

关键词： DNA sequences

来源：评论

学校读者我要写书评

暂无评论

Manifold alignment via corresponding projections

Manifold alignment via corresponding projections

引用

2010 21st British Machine Vision Conference, BMVC 2010

作者： Zhai, Deming Li, Bo Chang, Hong Shan, Shiguang Chen, Xilin Gao, Wen School of Computer Science and Technology Harbin Institute of Technology China Digital Media Research Center Institute of Computing Technology CAS China Key Laboratory of Intelligent Information Processing Chinese Academy of Sciences China Institute of Digital Media Peking University China

ISBN: (纸本)1901725405

In this paper, we propose a novel manifold alignment method by learning the underlying common manifold with supervision of corresponding data pairs from different observation sets. Different from the previous algorithms of semi-supervised manifold alignment, our method learns the explicit corresponding projections from each original observation space to the common embedding space everywhere. Benefiting from this property, our method could process new test data directly rather than re-alignment. Furthermore, our approach doesn't have any assumption on the data structures, thus it could handle more complex cases and get better results compared with previous work. In the proposed algorithm, manifold alignment is formulated as a minimization problem with proper constraints, which could be solved in an analytical manner with closed-form solution. Experimental results on pose manifold alignment of different objects and faces demonstrate the effectiveness of our proposed method. © 2010. The copyright of this document resides with its authors.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Class compactness for data clustering

Class compactness for data clustering

引用

IEEE International Conference on information Reuse and Integration (IRI)

作者： Yuqing Song Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy and Sciences China

In this paper we introduce a compactness based clustering algorithm. The compactness of a data class is measured by comparing the inter-subset and intra-subset distances. The class compactness of a subset is defined as the ratio of the two distances. A subset is called an isolated cluster (or icluster) if its class compactness is greater than 1. All iclusters make a containment tree. We introduce monotonic sequences of iclusters to simplify the structure of the icluster tree, based on which a clustering algorithm is designed. The algorithm has the following advantages: it is effective on data sets with clusters nonlinearly separated, of arbitrary shapes, or of different densities. The effectiveness of the algorithm is demonstrated by experiments.

关键词： Clustering algorithms Kernel Partitioning algorithms Algorithm design and analysis Shape Pediatrics Joining processes

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：