检索结果-内蒙古大学图书馆

2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022

作者： Guo, Shasha Zhang, Jing Wang, Yanling Zhang, Qianyi Li, Cuiping Chen, Hong School of Information Renmin University of China Beijing China Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education China

Existing methods on knowledge base question generation (KBQG) learn a one-size-fits-all model by training together all subgraphs without distinguishing the diverse semantics of subgraphs. In this work, we show that making use of the past experience on semantically similar subgraphs can reduce the learning difficulty and promote the performance of KBQG models. To achieve this, we propose a novel approach to model diverse subgraphs with meta-learner (DSM). Specifically, we devise a graph contrastive learning-based retriever to identify semantically similar subgraphs, so that we can construct the semantics-aware learning tasks for the meta-learner to learn semantics-specific and semantics-agnostic knowledge on and across these tasks. Extensive experiments on two widely-adopted benchmarks for KBQG show that DSM derives new state-of-the-art performance and benefits the question answering tasks as a means of data augmentation. Codes and datasets are available online. © 2022 Association for Computational Linguistics.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

COP: Privacy-preserving multidimensional partition in DAS paradigm 09

COP: Privacy-preserving multidimensional partition in DAS pa...

引用

2009 International Conference on Extending database Technology/International Conference on database Theory Workshops, EDBT/ICDT '09

作者： Wang, Jieping Du, Xiaoyong Wang, Haocong Yang, Pingping Key Laboratory of Data Engineering and Knowledge Engineering MOE China School of Information Renmin University of China Beijing 100872 China

ISBN: (纸本)9781605586502

database-as-a-Service (DAS) is an emerging database management paradigm wherein partition based index is an effective way to querying encrypted data. However, previous research either focuses on one-dimensional partition or ignores multidimensional data distribution characteristic, especially sparsity and locality. In this paper, we propose Cluster based Onion Partition (COP), which is designed to decrease both false positive and dead space at the same time. Basically, COP is composed of two steps. First, it partition covered space level by level, which is like peeling of onion;second, at each level, a clustering algorithm based on local density is proposed to achieve local optimal secure partition. Extensive experiments on real dataset and synthetic dataset show that COP is a secure multidimensional partition with much less efficiency loss than previous top down or bottom up counterparts. Copyright 2009 ACM.

关键词： database systems

来源：评论

学校读者我要写书评

暂无评论

Efficient Duplicate Detection on Cloud Using a New Signature Scheme

Efficient Duplicate Detection on Cloud Using a New Signature...

引用

12th International Conference on Web-Age Information Management

作者： Rong, Chuitian Lu, Wei Du, Xiaoyong Zhang, Xiao Key Labs. of Data Engineering and Knowledge Engineering MOE China School of Information Renmin University of China China Shanghai Key Laboratory of Intelligent Information Processing China

ISBN: (纸本)9783642235344;9783642235351

Duplicate detection has been well recognized as a crucial task to improve the quality of data. Related work on this problem mainly aims to propose efficient approaches over a single machine. However, with increasing volume of the data, the performance to identify duplicates is still far from satisfactory. Hence, we try to handle the problem of duplicate detection over MapReduce, a share-nothing paradigm. We argue the performance of utilizing MapReduce to detect duplicates mainly depends on the number of candidate record pairs. In this paper, we proposed a new signature scheme with new pruning strategy over MapReduce to minimize the number of candidate record pairs. Our experimental results over both real and synthetic datasets demonstrate that our proposed signature based method is efficient and scalable.

关键词： duplicate detection MapReduce Cloud

来源：评论

学校读者我要写书评

暂无评论

Maintaining materialized relations incrementally to improve performance of ontology query

Maintaining materialized relations incrementally to improve ...

引用

7th International Conference on Web-Age Information Management Workshops, WAIM 2006

作者： Li, Man Du, Xiaoyong Wang, Shan School of Information Renmin University of China China Key Laboratory of Data Engineering and Knowledge Engineering MOE 100872 Beijing China

ISBN: (纸本)0769527051

For ontology-based applications, the efficiency of ontology query is vital. Different from existing approaches, the paper improves performance of ontology query by materializing some derived relations. Experimental results show that the integrated performance of ontology query can be improved greatly by maintaining materialized relations and the materialized relations technique has good scalability. Here the challenge is how to maintain the materialized relations incrementally with the update of ontologies. Because transitive relations are in common use in ontology, the paper proposes a novel algorithm for maintaining transitive materialized relations incrementally based on a special weighted materialized relation transitive graph, which can solve the coexistence problem of multiple derived paths better and proves the correctness of the algorithm. © 2006 IEEE.

关键词： Ontology

来源：评论

学校读者我要写书评

暂无评论

Search result diversification based on hierarchical intents 15

Search result diversification based on hierarchical intents

引用

24th ACM International Conference on Information and knowledge Management, CIKM 2015

作者： Hu, Sha Dou, Zhicheng Wang, Xiaojie Sakai, Tetsuya Wen, Ji-Rong Beijing Key Laboratory of Big Data Management and Analysis Methods China Key Laboratory of Data Engineering and Knowledge Engineering MOE China School of Information Renmin University of China China Waseda University Japan

ISBN: (纸本)9781450337946

A large percentage of queries issued to search engines are broad or ambiguous. Search result diversification aims to solve this problem, by returning diverse results that can fulfill as many different information needs as possible. Most existing intent-aware search result diversification algorithms formulate user intents for a query as a flat list of subtopics. In this paper, we introduce a new hierarchical structure to represent user intents and propose two general hierarchical diversification models to leverage hierarchical intents. Experimental results show that our hierarchical diversification models outperform state-of-the-art diversification methods that use traditional flat subtopics. © 2015 ACM.

关键词： Search engines

来源：评论

学校读者我要写书评

暂无评论

WATuning:A Workload-Aware Tuning System with Attention-Based Deep Reinforcement Learning

引用

Journal of Computer Science & Technology 2021年第4期36卷 741-761页

作者： Jia-Ke Ge Yan-Feng Chai Yun-Peng Chai Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education Renmin University of China Beijing 100872China School of Information Renmin University of ChinaBeijing 100872China College of Computer Science and Technology Taiyuan University of Science and TechnologyTaiyuan 030027China

Configuration tuning is essential to optimize the performance of systems(e.g.,databases,key-value stores).High performance usually indicates high throughput and low *** present,most of the tuning tasks of systems are performed artificially(e.g.,by database administrators),but it is hard for them to achieve high performance through tuning in various types of systems and in various *** recent years,there have been some studies on tuning traditional database systems,but all these methods have some *** this article,we put forward a tuning system based on attention-based deep reinforcement learning named WATuning,which can adapt to the changes of workload characteristics and optimize the system performance efficiently and ***,we design the core algorithm named ATT-Tune for WATuning to achieve the tuning task of *** algorithm uses workload characteristics to generate a weight matrix and acts on the internal metrics of systems,and then ATT-Tune uses the internal metrics with weight values assigned to select the appropriate ***,WATuning can generate multiple instance models according to the change of the workload so that it can complete targeted recommendation services for different types of ***,WATuning can also dynamically fine-tune itself according to the constantly changing workload in practical applications so that it can better fit to the actual environment to make *** experimental results show that the throughput and the latency of WATuning are improved by 52.6%and decreased by 31%,respectively,compared with the throughput and the latency of CDBTune which is an existing optimal tuning method.

关键词： attention mechanism auto-tuning system reinforcement learning(RL) workload-aware

来源：评论

学校读者我要写书评

暂无评论

Accuracy estimation of link-based similarity measures and its application

引用

Frontiers of Computer Science 2016年第1期10卷 113-123页

作者： Yinglong ZHANG Cuiping LI Chengwang XIE Hong CHEN School of Software East China Jiaotong University Nanchang 330045 China Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education and Department of Computer Science Renmin University of China Beijing 100872 China Intelligent Optimization and Information Processing Laboratory East China Jiaotong University Nanchang 330013 China

Link-based similarity measures play a significant role in many graph based applications. Consequently, mea- suring node similarity in a graph is a fundamental problem of graph data mining. Personalized PageRank （PPR） and Sim- Rank （SR） have emerged as the most popular and influen- tial link-based similarity measures. Recently, a novel link- based similarity measure, penetrating rank （P-Rank）, which enriches SR, was proposed. In practice, PPR, SR and P-Rank scores are calculated by iterative methods. As the number of iterations increases so does the overhead of the calcula- tion. The ideal solution is that computing similarity within the minimum number of iterations is sufficient to guaran- tee a desired accuracy. However, the existing upper bounds are too coarse to be useful in general. Therefore, we focus on designing an accurate and tight upper bounds for PPR, SR, and P-Rank in the paper. Our upper bounds are designed based on the following intuition： the smaller the difference between the two consecutive iteration steps is, the smaller the difference between the theoretical and iterative similar- ity scores becomes. Furthermore, we demonstrate the effec- tiveness of our upper bounds in the scenario of top-k similar nodes queries, where our upper bounds helps accelerate the speed of the query. We also run a comprehensive set of exper- iments on real world data sets to verify the effectiveness and efficiency of our upper bounds.

关键词： personalized PageRank SimRank P-Rank up-per bound

来源：评论

学校读者我要写书评

暂无评论

Selecting Good Expansion Terms for Improving XML Retrieval Performance

Selecting Good Expansion Terms for Improving XML Retrieval P...

引用

International Conference on Control engineering and Communication Technology (ICCECT)

作者： Minjuan Zhong Jiangxi Key Laboratory of Data and Knowledge Engineering Jiangxi University of Finance and Economics Nanchang China

In this paper, we study how to perform XML query expansion effectively from the high quality pseudo-relevance documents. A solution for selecting good expansion information is presented, in which various features impacting weight, such as term element frequency, term inverse element frequency, semantic weight of tag and level information, are analyzed and those term with high weigh value are selected as expansion term. Experiment results show that proposed expansion method is feasible. Compared to original query and traditional expansion method with no structure features considered, our method achieves better retrieval performance.

关键词： XML Semantics Analytical models Vectors Information retrieval Computational modeling Educational institutions

来源：评论

学校读者我要写书评

暂无评论

Intelligent Fast Cell Association Scheme Based on Deep Q-Learning in Ultra-Dense Cellular Networks

引用

China Communications 2021年第2期18卷 259-270页

作者： Jinhua Pan Lusheng Wang Hai Lin Zhiheng Zha Caihong Kai Key Laboratory of Knowledge Engineering with Big Data Ministry of Education.School of Computer Science and Information EngineeringHefei University of TechnologyHefei 230601China Anhui Province Key Laboratory of Industry Safety and Emergency Technology Hefei 230601China Key Laboratory of Aerospace Information Security and Trusted Computing Ministry of Education.School of Cyber Science and EngineeringWuhan UniversityWuhan 430072China

To support dramatically increased traffic loads,communication networks become *** cell association(CA)schemes are timeconsuming,forcing researchers to seek fast *** paper proposes a deep Q-learning based scheme,whose main idea is to train a deep neural network(DNN)to calculate the Q values of all the state-action pairs and the cell holding the maximum Q value is *** the training stage,the intelligent agent continuously generates samples through the trial-anderror method to train the DNN until *** the application stage,state vectors of all the users are inputted to the trained DNN to quickly obtain a satisfied CA result of a scenario with the same BS locations and user *** demonstrate that the proposed scheme provides satisfied CA results in a computational time several orders of magnitudes shorter than traditional ***,performance metrics,such as capacity and fairness,can be guaranteed.

关键词： ultra-dense cellular networks(UDCN) cell association(CA) deep Q-learning proportional fairness Q-learning

来源：评论

学校读者我要写书评

暂无评论

An approach for personalized tag recommendation based on interest transfer model

An approach for personalized tag recommendation based on int...

引用

9th Web Information Systems and Applications Conference, WISA 2012

作者： Liu, Yue Yang, Nan Yang, Gang School of Information Renmin University of China Bejing China Key Laboratory of Data Engineering and Knowledge Engineering Ministry of Education Bejing China

ISBN: (纸本)9781467330541

Recently, social tagging systems become more and more popular in many Web 2.0 applications. In such systems, Users are allowed to annotate a particular resource with a freely chosen a set of tags. These user-generated tags can represent users' interests more concise and closer to human understanding. Interests will change over time. Thus, how to describe users' interests and interests transfer path become a big challenge for personalized recommendation systems. In this approach, we propose a variable-length time interval division algorithm and user interest model based on time interval. Then, in order to draw users' interests transfer path over a specific time period, we suggest interest transfer model. After that, we apply a classical community partition algorithm in our approach to separate users into communities. Finally, we raise a novel method to measure users' similarities based on interest transfer model and provide personalized tag recommendation according to similar users' interests in their next time intervals. Experimental results demonstrate the higher precision and recall with our approach than classical user-based collaborative filtering methods. © 2012 IEEE.

关键词： Collaborative filtering

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：