检索结果-内蒙古大学图书馆

Improving Parameter Estimation and Defensive Ability of Latent Dirichlet Allocation Model Training Under Rényi Differential Privacy

引用

Journal of Computer Science & Technology 2022年第6期37卷 1382-1397页

作者： Tao Huang Su-Yun Zhao Hong Chen Yi-Xuan Liu Key Laboratory of Data Engineering and Knowledge Engineering(Renmin University of China) Ministry of Education Beijing 100087China School of Information Renmin University of ChinaBeijing 100087China

Latent Dirichlet allocation(LDA)is a topic model widely used for discovering hidden semantics in massive text *** Gibbs sampling(CGS),as a widely-used algorithm for learning the parameters of LDA,has the risk of privacy ***,word count statistics and updates of latent topics in CGS,which are essential for parameter estimation,could be employed by adversaries to conduct effective membership inference attacks(MIAs).Till now,there are two kinds of methods exploited in CGS to defend against MIAs:adding noise to word count statistics and utilizing inherent *** two kinds of methods have their respective *** sampled from the Laplacian distribution sometimes produces negative word count statistics,which render terrible parameter estimation in *** inherent privacy could only provide weak guaranteed privacy when defending against *** is promising to propose an effective framework to obtain accurate parameter estimations with guaranteed differential *** key issue of obtaining accurate parameter estimations when introducing differential privacy in CGS is making good use of the privacy budget such that a precise noise scale is *** is the first time that R′enyi differential privacy(RDP)has been introduced into CGS and we propose RDP-LDA,an effective framework for analyzing the privacy loss of any differentially private ***-LDA could be used to derive a tighter upper bound of privacy loss than the overestimated results of existing differentially private CGS obtained byε-*** RDP-LDA,we propose a novel truncated-Gaussian mechanism that keeps word count statistics *** we propose distribution perturbation which could provide more rigorous guaranteed privacy than utilizing inherent *** validate that our proposed methods produce more accurate parameter estimation under the JS-divergence metric and obtain lower precision and recall when defending against MIAs.

关键词： latent Dirichlet allocation parameter estimation membership inference attack Rényi differential privacy

来源：评论

学校读者我要写书评

暂无评论

knowledge Graph for China's Genealogy1

引用

IEEE Transactions on knowledge and data engineering 2023年第1期35卷 634-646页

作者： Wu, Xindong Jiang, Tingting Zhu, Yi Bu, Chenyang Key Laboratory of Knowledge Engineering with Big Data Hefei University of Technology Ministry of Education Anhui Hefei230009 China Hefei University of Technology Research Institute of Big Knowledge Anhui Hefei230009 China Mininglamp Technology Mininglamp Academy of Sciences Beijing100102 China Hefei University of Technology School of Computer Science and Information Engineering Anhui Hefei230009 China Hefei University of Technology Ministry of Education Key Laboratory of Knowledge Engineering with Big Data Anhui Hefei230009 China

Genealogical knowledge graphs depict the relationships of family networks and the development of family histories. They can help researchers to analyze and understand genealogical data, search for genealogical descendant paths, and explore the origins of a family. However, the heterogenous, autonomous, complex, and evolving natures of genealogical data bring challenges to the development of contemporary genealogical knowledge graph models. Applying existing methods to genealogical data may be improper because general knowledge graph models lack in-depth domain knowledge. In this paper, we propose a genealogical knowledge graph model named Huapu-KG that combines HAO intelligence (human intelligence + artificial intelligence + organizational intelligence) to implement the construction and applications of genealogical knowledge graphs. Furthermore, challenges in constructing genealogical knowledge graphs are demonstrated, and experiments conducted on real-world genealogical datasets verify the feasibility and effectiveness of our proposed model. © 1989-2012 IEEE.

关键词： History

来源：评论

学校读者我要写书评

暂无评论

Hierarchical All-Pairs SimRank Calculation 28th

Hierarchical All-Pairs SimRank Calculation

引用

28th International Conference on database Systems for Advanced Applications, DASFAA 2023

作者： Zhang, Liangfu Li, Cuiping Zhang, Xue Chen, Hong Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education School of Information Renmin University of China Beijing China

ISBN: (纸本)9783031306747

All-pairs SimRank calculation is a classic SimRank problem. However, all-pairs algorithms suffer from efficiency issues and accuracy issues. In this paper, we convert the non-linear simrank calculation into a new simple closed formulation of linear system. And we come up with a sequence of novel algorithms to efficiently solve the linear system with accuracy guarantees. To reduce the memory consumption and improve the computational efficiency, we build a hierarchical framework to calculate the all-pairs SimRank scores, which includes locally coarse calculation and globally refine calculation. We first solve the local linear systems generated from the subgraphs, then we refine the SimRank scores on the full graph from the residuals of the local structures. We also show that our algorithms outperform the state-of-the-art all-pairs SimRank computation algorithms on real graphs. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Computational efficiency

来源：评论

学校读者我要写书评

暂无评论

PCQPR: Proactive Conversational Question Planning with Reflection

PCQPR: Proactive Conversational Question Planning with Refle...

引用

2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024

作者： Guo, Shasha Liao, Lizi Zhang, Jing Li, Cuiping Chen, Hong School of Information Renmin University of China Beijing China Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education China Singapore Management University Singapore

ISBN: (纸本)9798891761643

Conversational Question Generation (CQG) enhances the interactivity of conversational question-answering systems in fields such as education, customer service, and entertainment. However, traditional CQG, focusing primarily on the immediate context, lacks the conversational foresight necessary to guide conversations toward specified conclusions. This limitation significantly restricts their ability to achieve conclusion-oriented conversational outcomes. In this work, we redefine the CQG task as Conclusion-driven Conversational Question Generation (CCQG) by focusing on proactivity, not merely reacting to the unfolding conversation but actively steering it towards a conclusion-oriented question-answer pair. To address this, we propose a novel approach, called Proactive Conversational Question Planning with self-Refining (PCQPR). Concretely, by integrating a planning algorithm inspired by Monte Carlo Tree Search (MCTS) with the analytical capabilities of large language models (LLMs), PCQPR predicts future conversation turns and continuously refines its questioning strategies. This iterative self-refining mechanism ensures the generation of contextually relevant questions strategically devised to reach a specified outcome. Our extensive evaluations demonstrate that PCQPR significantly surpasses existing CQG methods, marking a paradigm shift towards conclusion-oriented conversational question-answering systems. © 2024 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Integrating Physical Prediction Methods and AI-based Satellite data Analysis Methods in Earthquake Damage Estimation 8th

Integrating Physical Prediction Methods and AI-based Satelli...

引用

8th International Symposium on Reliability engineering and Risk Management, ISRERM 2022

作者： Miyamoto, Takashi Department of Civil and Environmental Engineering University of Yamanashi Japan Smart Data and Knowledge Services German Research Center for Artificial Intelligence Germany

ISBN: (纸本)9789811851841

In order to estimate the damage distribution immediately after an earthquake, both physical prediction methods and data-driven methods that analyze sensing data obtained from satellites are used. However, the former has the problem of prediction accuracy, while the latter has the problem of difficulty in detecting detailed damage patterns such as partial destruction. Therefore, we presents a method that improves the detection accuracy of detailed damage distribution of structures such as total and partial collapse by integrating both methods. As an integration scheme of the two methods, a data assimilation method based on Bayes’ theorem is adopted in this study. We proposed a method to update the damage probability of each structure obtained from physical simulations by conditioning it on the observed data obtained from satellite image analysis, and verified its effectiveness. © 2022 ISRERM Organizers. Published by Research Publishing, Singapore.

关键词： Prediction models

来源：评论

学校读者我要写书评

暂无评论

Enhancing Extractive Question Answering in Multiparty Dialogues with Logical Inference Memory Network 31

Enhancing Extractive Question Answering in Multiparty Dialog...

引用

31st International Conference on Computational Linguistics, COLING 2025

作者： Zhou, Shu Zhao, Rui Zhou, Zhengda Yi, Haohan Zheng, Xuhui Wang, Hao Nanjing University China Key Laboratory of Data Engineering and Knowledge Services in Jiangsu Provincial Universities Nanjing University China University of Technology Sydney Australia

ISBN: (纸本)9798891761964

Multiparty dialogue question answering (QA) within machine reading comprehension (MRC) presents significant challenges due to the complex interplay of information across multiple speakers and the need for advanced logical reasoning. While existing models often focus on separating dialogue information based on speakers and utterances, they rarely address the crucial aspect of logical inference, leading to suboptimal performance in understanding and answering questions. To bridge this gap, we introduce the Logical Inference Memory Network (LIMN), a novel architecture designed for extractive QA in multiparty dialogues. LIMN incorporates a unique inference module pretrained on plain text QA datasets (like SQuAD 2.0), enabling it to transfer robust logical reasoning abilities to the dialogue domain. This module generates representations that are specifically attuned to logical inference, which are then integrated into the dialogue context. Furthermore, we propose a key-utterance-based interaction mechanism that dynamically focuses on the most relevant utterances within the dialogue, enhancing the model's ability to pinpoint answers. To ensure robust performance, LIMN employs a multitask learning strategy that jointly optimizes for answer extraction, answerability prediction, key-utterance identification, and masked speaker prediction. Extensive experiments on the Molweni and FriendsQA benchmarks, encompassing 25,000 and 10,000 questions respectively, demonstrate that LIMN achieves state-of-the-art results, affirming the effectiveness of incorporating logical inference in multiparty dialogue QA. © 2025 Association for Computational Linguistics.

关键词： Memory architecture

来源：评论

学校读者我要写书评

暂无评论

Intelligent Assistant for Multivariant Analysis 26

Intelligent Assistant for Multivariant Analysis

引用

26th International Conference of the Catalan Association for Artificial Intelligence, CCIA 2024

作者： Angerri, Xavier Delgado, Oscar Gibert, Karina Knowledge Engineering and Machine Learning Group Intelligent Data Science and Artificial Intelligence Research Center Universtitat Politècnica de Catalunya Spain

ISBN: (纸本)9781643685434

When a knowledge Discovery from data (KDD) (Fayyad, Piatetsky-Shapiro, & Smyth, 1996) process is being applied to get knowledge, several methods could be used (Gibert, et al., 2018). A simple and fast way to obtain preliminary insights from data before using KDD models is by generating a basic descriptive analysis. It is one of the most popular ways to describe experimental data and should be the beginning of all data projects. Nevertheless some of the main knowledge that can be extracted in a descriptive analysis is hidden due to underlying multivariate structures which could be elicited through multivariate analysis techniques. Moreover, the domain expert is key for a proper interpretation of descriptive results. At the same time, there is a lack of automatic reporting techniques that can report and help in the interpretation of complex patterns and the use of advanced multivariate techniques. This paper shows the tool developed to generate automatic interpretation of Multiple Correspondence Analysis (MCA) and Principal Components Analysis (PCA) by using RMarkdown. This tool generates a Word document which contains the automatic interpretation of the results, built on the basis of regular expressions ellaborating over the R analytical outputs (either numerical or graphical results). The proposal is being applied with some real data, like INSESS database on social vulnerabilities of the Catalan population. In conclusion, the developed tool contributes to facilitate the factorial methods results, avoiding the misinterpretation of the results and the involuntary skipping of conclusions due to the large amount of knowledge that can be extracted from a complete factorial analysis. Also, this software enables non-expert users to read multivariate analysis results in a friendly way. Moreover, this tool saves time in the interpretation step and is a basis to support the expert to start the report with the results, even the output of the software could become the report or

关键词： automatic interpretation Automatic reporting explainability

来源：评论

学校读者我要写书评

暂无评论

Face micro-expression recognition algorithm based on ResNet depth model 8

Face micro-expression recognition algorithm based on ResNet ...

引用

8th International Conference on Intelligent Computing and Signal Processing, ICSP 2023

作者： Wang, Liquan Zhan, Shu Hefei University of Technology Key Laboratory of Big Data Knowledge Engineering Ministry of Education School of Computer and Information Engineering Hefei China

ISBN: (纸本)9798350302455

Micro Expression (ME) is the subtle facial expressions that people show when they express their inner feelings. To address the problem that micro-expression recognition is difficult and less accurate due to the small number of samples and uneven distribution of different categories, we propose a model framework to improve the accuracy of micro-expression recognition. The peak frames containing more key expression information in the micro-expression video sequences are extracted;SE-ResNeXt-50, an improved residual network with SE module, is used to extract features from the peak frames of micro-expressions, where the SE module can better learn the key information in the features, and ResNeXt simplifies the structure by replacing the dense structure with the sparse structure through group convolution, which improves the recognition efficiency. The recognition efficiency is improved by replacing the dense structure with the sparse structure by group convolution. At the same time, the Focal Loss loss function can better solve the model performance problem caused by the imbalance of micro-expression data. Simulation experiments are conducted on the micro-expression dataset CASME, and it is found that the improved residual network and peak frame improve the accuracy and F1 value of micro-expression recognition. The improved residual network and peak frame can reduce the effect of small data set, make the model have good fitting effect, and improve the performance of different categories, improve the recognition accuracy of micro-expressions, and have better recognition performance for micro-expression recognition. © 2023 IEEE.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Layer-Wise Learning Rate Optimization for Task-Dependent Fine-Tuning of Pre-Trained Models: An Evolutionary Approach

引用

ACM Transactions on Evolutionary Learning and Optimization 2024年第4期4卷 1-23页

作者： Bu, Chenyang Liu, Yuxin Huang, Manzong Shao, Jianxuan Ji, Shengwei Luo, Wenjian Wu, Xindong Key Laboratory of Knowledge Engineering with Big Data Ministry of Education and School of Computer Science and Information Engineering Hefei University of Technology Hefei China School of Artificial Intelligence and Big Data Hefei University Hefei China Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies School of Computer Science and Technology Harbin Institute of Technology Shenzhen China

The superior performance of large-scale pre-Trained models, such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Trained Transformer (GPT), has received increasing attention in both academic and industrial research and has become one of the current research hotspots. A pre-Trained model refers to a model trained on large-scale unlabeled data, whose purpose is to learn general language representation or features for fine-Tuning or transfer learning in subsequent tasks. After pre-Training is complete, a small amount of labeled data can be used to fine-Tune the model for a specific task or domain. This two-stage method of "pre-Training+fine-Tuning"has achieved advanced results in natural language processing (NLP) tasks. Despite widespread adoption, existing fixed fine-Tuning schemes that adapt well to one NLP task may perform inconsistently on other NLP tasks given that different tasks have different latent semantic structures. In this article, we explore the effectiveness of automatic fine-Tuning pattern search for layer-wise learning rates from an evolutionary optimization perspective. Our goal is to use evolutionary algorithms to search for better task-dependent fine-Tuning patterns for specific NLP tasks than typical fixed fine-Tuning patterns. Experimental results on two real-world language benchmarks and three advanced pre-Training language models show the effectiveness and generality of the proposed framework. © 2024 held by the owner/author(s).

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Bootstrap-Based Layerwise Refining for Causal Structure Learning

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2024年第6期5卷 2708-2722页

作者： Xiang, Guodu Wang, Hao Yu, Kui Guo, Xianjie Cao, Fuyuan Song, Yukun Hefei University of Technology Key Laboratory of Knowledge Engineering with the Big Data of Ministry of Education Hefei230601 China Hefei University of Technology School of Computer Science and Information Engineering Hefei230601 China Shanxi University School of Computer and Information Technology Taiyuan030006 China

Learning causal structures from observational data is critical for causal discovery and many machine learning tasks. Traditional constraint-based methods first adopt conditional independence (CI) tests to learn a global skeleton layer by layer and then orient the undirected edges to obtain a causal structure. However, the reliability of these statistical tests largely depends on the quality of data samples. In real-life scenarios, the presence of data noise or limited samples often makes many CI tests unreliable at each layer in the skeleton learning phase, leading to an inaccurate skeleton. As the number of layers increases, the inaccurate skeleton will continue to impair the skeleton construction of subsequent layers. Furthermore, an unreliable skeleton hampers the skeleton orientation procedure, resulting in an unsatisfactory causal structure. In this article, we propose a Bootstrap-based layerwise refining (BLR) algorithm for causal structure learning, which includes two new procedures to solve the above problems. First, BLR utilizes a novel layerwise skeleton refining procedure to construct the global skeleton layer by layer based on the bootstrap sampling. Second, BLR employs a collective skeleton orientation procedure that incorporates scoring techniques to collectively orient the global skeleton. The experimental results show that BLR outperforms the state-of-the-art methods on the benchmark Bayesian Network datasets. © 2020 IEEE.

关键词： Refining

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：