检索结果-内蒙古大学图书馆

31st International Conference on computational Linguistics, COLING 2025

作者： Xin, Chunlei Zhou, Shuheng Chen, Xuanang Lu, Yaojie Zhu, Huijia Wang, Weiqiang Liu, Zhongyi Han, Xianpei Sun, Le Chinese Information Processing Laboratory Institute of Software Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China Ant Group China

ISBN: (纸本)9798891761964

Open-Domain Question Answering (ODQA) systems often struggle with the quality of retrieved passages, which may contain conflicting information and be misaligned with the reader's needs. Existing retrieval methods aim to gather relevant passages, but often fail to prioritize consistent and useful information for the reader. In this paper, we introduce a novel Reader-Centered Passage Selection (R-CPS) method, which enhances the performance of the retrieve-then-read pipeline by re-ranking and clustering passages from the reader's perspective. Our method re-ranks passages based on the reader's prediction probability distribution and clusters passages according to the predicted answers, prioritizing more useful and relevant passages to the top and reducing inconsistent information. Experiments on ODQA datasets demonstrate the effectiveness of our approach in improving the quality of evidence passages under zero-shot settings. © 2025 Association for computational Linguistics.

关键词： Question answering

来源：评论

学校读者我要写书评

暂无评论

LatentDE: latent-based directed evolution for protein sequence design

引用

MACHINE LEARNING-science AND TECHNOLOGY 2025年第1期6卷 015070-015070页

作者： Tran, Thanh V. T. Ngo, Nhat Khang Nguyen, Viet Thanh Duy Hy, Truong-Son FPT Software AI Ctr Hanoi Vietnam Univ Alabama Birmingham Birmingham AL 35294 USA

Directed evolution (DE) has been the most effective method for protein engineering that optimizes biological functionalities through a resource-intensive process of screening or selecting among a vast range of mutations. To mitigate this extensive procedure, recent advancements in machine learning-guided methodologies center around the establishment of a surrogate sequence-function model. In this paper, we propose latent-based DE (LDE), an evolutionary algorithm designed to prioritize the exploration of high-fitness mutants in the latent space. At its core, LDE is a regularized variational autoencoder (VAE), harnessing the capabilities of the state-of-the-art protein language model, ESM-2, to construct a meaningful latent space of sequences. From this encoded representation, we present a novel approach for efficient traversal on the fitness landscape, employing a combination of gradient-based methods and DE. Experimental evaluations conducted on eight protein sequence design tasks demonstrate the superior performance of our proposed LDE over previous baseline algorithms. Our implementation is publicly available at https://***/HySonLab/LatentDE.

关键词： directed evolution protein representation learning protein design latent-based optimization evolutionary algorithm

来源：评论

学校读者我要写书评

暂无评论

Multi-Person Estimation Method Combining Interactive and Contextual Information 5

Multi-Person Estimation Method Combining Interactive and Con...

引用

5th International Seminar on Artificial Intelligence, Networking and Information Technology, AINIT 2024

作者： Chen, Zhuxiang Li, Shujie Jia, Wei School of Computer Science and Information Engineering Hefei University of Technology Hefei China School of Software Hefei University of Technology Hefei China

ISBN: (纸本)9798350385557

Multi-person pose estimation based on monocular cameras is one of the hot research topics in computer vision. Current monocular multi-person 3D pose estimation methods often treat individuals as independent entities for estimation and are based solely on single-frame assessments. This methodology has notable limitations, particularly its insufficient utilization of the rich interactive information interlinking individuals, as well as the contextual continuity inherent across sequential frames. To address these challenges, our study introduces a novel method for monocular multi-person 3D pose estimation. Firstly, it utilizes the attention mechanism of Transformer to learn the contextual and interactive information of both local and global coordinates. Subsequently, the methd employs low-order convolution operators of a root trajectory network to further assimilate global coordinate information. Finally, an optimization algorithm is used to adaptively filter frames with significant occlusions. Our method effectively integrates the interactive information among multiple individuals and the contextual information across multiple frames. Experimental results on the public dataset MuPoTS-3D demonstrate that compared to the state-of-the-art methods, our approach shows a 4.8% improvement in average joint accuracy and a 19. 4% reduction in average root joint error. © 2024 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

CONLUX: CONCEPT-BASED LOCAL UNIFIED EXPLANATIONS

arXiv

引用

arXiv 2024年

作者： Liu, Junhao Yu, Haonan Zhang, Xin Key Laboratory of High Confidence Software Technologies MoE School of Computer Science Peking University Beijing100871 China

With the rapid advancements of various machine learning models, there is a significant demand for model-agnostic explanation techniques, which can explain these models across different architectures. Mainstream model-agnostic explanation techniques generate local explanations based on basic features (e.g., words for text models and (super-)pixels for image models). However, these explanations often do not align with the decision-making processes of the target models and end-users, resulting in explanations that are unfaithful and difficult for users to understand. On the other hand, concept-based techniques provide explanations based on high-level features (e.g., topics for text models and objects for image models), but most are model-specific or require additional pre-defined external concept knowledge. To address this limitation, we propose ConLUX, a general framework to provide concept-based local explanations for any machine learning models. Our key insight is that we can automatically extract high-level concepts from large pre-trained models, and uniformly extend existing local model-agnostic techniques to provide unified concept-based explanations. We have instantiated ConLUX on four different types of explanation techniques: LIME, Kernel SHAP, Anchor, and LORE, and applied these techniques to text and image models. Our evaluation results demonstrate that 1) compared to the vanilla versions, ConLUX offers more faithful explanations and makes them more understandable to users, and 2) by offering multiple forms of explanations, ConLUX outperforms state-of-the-art concept-based explanation techniques specifically designed for text and image models, respectively. Copyright © 2024, The Authors. All rights reserved.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Integrating Quantum Computing Resources into Scientific HPC Ecosystems

arXiv

引用

arXiv 2024年

作者： Beck, Thomas Baroni, Alessandro Bennink, Ryan Buchs, Gilles Coello Pérez, Eduardo Antonio Eisenbach, Markus da Silva, Rafael Ferreira Meena, Muralikrishnan Gopalakrishnan Gottiparthi, Kalyan Groszkowski, Peter Humble, Travis S. Landfield, Ryan Maheshwari, Ketan Oral, Sarp Sandoval, Michael A. Shehata, Amir Suh, In-Saeng Zimmer, Christopher National Center for Computational Sciences Oak Ridge National Laboratory Oak RidgeTN United States Computational Sciences and Engineering Oak Ridge National Laboratory Oak RidgeTN United States Quantum Science Center Oak Ridge National Laboratory Oak RidgeTN United States

Quantum Computing (QC) offers significant potential to enhance scientific discovery in fields such as quantum chemistry, optimization, and artificial intelligence. Yet QC faces challenges due to the noisy intermediate-scale quantum era’s inherent external noise issues. This paper discusses the integration of QC as a computational accelerator within classical scientific high-performance computing (HPC) systems. By leveraging a broad spectrum of simulators and hardware technologies, we propose a hardware-agnostic framework for augmenting classical HPC with QC capabilities. Drawing on the HPC expertise of the Oak Ridge National laboratory (ORNL) and the HPC lifecycle management of the Department of Energy (DOE), our approach focuses on the strategic incorporation of QC capabilities and acceleration into existing scientific HPC workflows. This includes detailed analyses, benchmarks, and code optimization driven by the needs of the DOE and ORNL missions. Our comprehensive framework integrates hardware, software, workflows, and user interfaces to foster a synergistic environment for quantum and classical computing research. This paper outlines plans to unlock new computational possibilities, driving forward scientific inquiry and innovation in a wide array of research domains. © 2024, CC BY.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Negative-Free Self-Supervised Gaussian Embedding of Graphs

引用

NEURAL NETWORKS 2025年 181卷 106846页

作者： Liu, Yunhui He, Tieke Zheng, Tao Zhao, Jianhua Nanjing Univ State Key Lab Novel Software Technol Nanjing 210023 Peoples R China

Graph Contrastive Learning (GCL) has recently emerged as a promising graph self-supervised learning framework for learning discriminative node representations without labels. The widely adopted objective function of GCL benefits from two key properties: alignment and uniformity, which align representations of positive node pairs while uniformly distributing all representations on the hypersphere. The uniformity property plays a critical role in preventing representation collapse and is achieved by pushing apart augmented views of different nodes (negative pairs). As such, existing GCL methods inherently rely on increasing the quantity and quality of negative samples, resulting in heavy computational demands, memory overhead, and potential class collision issues. In this study, we propose a negative-free objective to achieve uniformity, inspired by the fact that points distributed according to a normalized isotropic Gaussian are uniformly spread across the unit hypersphere. Therefore, we can minimize the distance between the distribution of learned representations and the isotropic Gaussian distribution to promote the uniformity of node representations. Our method also distinguishes itself from other approaches by eliminating the need fora parameterized mutual information estimator, an additional projector, asymmetric structures, and, crucially, negative samples. Extensive experiments over seven graph benchmarks demonstrate that our proposal achieves competitive performance with fewer parameters, shorter training times, and lower memory consumption compared to existing GCL methods.

关键词： Graph neural networks Graph representation learning Self-supervised learning Graph data mining

来源：评论

学校读者我要写书评

暂无评论

HSI-DETR: A DETR-based Transfer Learning from RGB to Hyperspectral Images for Object Detection of Live and Dead Cells: To achieve better results, convert models with the fewest changes from RGB to HSI 11

HSI-DETR: A DETR-based Transfer Learning from RGB to Hypersp...

引用

11th International Conference on Computing and Pattern Recognition, ICCPR 2022

作者： Ye, Songxin Li, Nanying Xue, Jiaqi Long, Yaqian Jia, Sen College of Computer Science and Software Engineering Shenzhen University China

ISBN: (纸本)9781450397056

Traditional cell viability judgment methods are invasive and damaging to cells. Moreover, even under a microscope, it is difficult to distinguish live cells from dead cells by the naked eye alone. With the development of optical imaging technology, hyperspectral imaging is more and more widely used in various fields. Hyperspectral imaging is a non-contact optical technique that provides both spectral and spatial information in a single measurement. It becomes a fast, non-invasive option to differentiate between live and dead cells. In recent years, the rapid development of deep learning has provided a better way to distinguish the difference between living and dead cells through a large amount of data. However, it is often necessary to acquire large amounts of labeled data at an expensive cost to train models. This is more difficult to achieve on medical hyperspectral images. Therefore, in this paper, a new model called HSI-DETR is proposed to solve the above problem on the target detection task of live and dead cells, which is based on the detection transformer (DETR) model. The HSI-DETR model suitable for hyperspectral images (HSI) is proposed with minimal modification. Then, some parameters of DETR trained on RGB images are transferred to HSI-DETR trained on hyperspectral images. Compared to the general method, this method can train a better model with a small number of labeled samples. And compared to the DETR-R50, the AP50 of HSI-DETR-R50 has increased by 5.15%. © 2022 ACM.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

FAER: Fairness-aware Event-participant Recommendation in Event-based Social Networks 6

FAER: Fairness-aware Event-participant Recommendation in Eve...

引用

6th International Conference on Computer science and Artificial Intelligence, CSAI 2022

作者： Liang, Yuan School of Computer Science and Engineering Beihang University Beijing China Guangxi Key Laboratory of Trusted Software Guilin University of Electronic Technology Guilin China

ISBN: (纸本)9781450397773

An event-based social network (EBSN) is a new type of social network that combines online and offline networks. In recent years, an important task in EBSN recommendation systems has been to design better and more reasonable recommendation algorithms to improve the accuracy of recommendation and enhance user satisfaction. However, the current research seldom considers how to coordinate fairness among individual users and reduce the impact of individual unreasonable feedback in group event recommendation. In addition, while considering the fairness of individuals, the accuracy of recommendation is less improved by fully combining the context key information. To solve these problems, we propose a prefiltering algorithm to filter the candidate event set, a multidimensional context recommendation to provide personalized event recommendations for each user in the group. Finally, we verify the effectiveness of our proposed algorithm in real data sets and find that FAGR is superior to the latest algorithms in terms of global satisfaction, distance satisfaction and user fairness. © 2022 ACM.

关键词： Event-based social networks Event-participant recommendation Fairness-aware

来源：评论

学校读者我要写书评

暂无评论

Leveraging CPU-FPGA Co-design for Matrix Profile Computation 11th

Leveraging CPU-FPGA Co-design for Matrix Profile Computation

引用

11th Latin American Conference on High Performance Computing

作者： Huseynli, Fariz Raoofy, Amir Schulz, Martin Tech Univ Munich Chair Comp Architecture & Parallel Syst Munich Germany Bavarian Acad Sci & Human Leibniz Supercomp Ctr Garching Germany

ISBN: (纸本)9783031800832;9783031800849

Current technology trends in high-performance computing (HPC) are pushing us towards accelerated systems. While GPU-based systems are the most common option, not all applications work well on such architectures. Solutions, like programmable hardware in the form of FPGAs (Field Programmable Gate Arrays), can be a powerful alternative. However, the complexity of developing specialized computing units in FPGAs, which are optimized for a specific task, often limits their broad utilization. In this paper, we follow a co-design methodology to identify the key computational routines and to replace them by using user-friendly libraries that wrap complex FPGA access mechanisms. This simplifies the usage of specialized compute units in FPGAs. To demonstrate our approach, we focus on performance improvements for an HPC/BigData application called (MP)N, which is built around widely used data analytics algorithm computing the matrix profile for multidimensional time series. In this application, we identify a sorting kernel as one of the key time consumers and accelerate it designing a parallel sorting library and using it to offload sorting batches to the FPGA. At the same time, we enable efficient utilization of CPU resources through overlap and pipelining. We achieve a 2-fold run time improvement for computing a 128dimensional time series of 7 million records, with the performance gap increasing as the number of records grows, highlighting the potential of CPU-FPGA co-design in HPC.

关键词： CPU-FPGA Co-Design Time Series Mining HPC

来源：评论

学校读者我要写书评

暂无评论

NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance

arXiv

引用

arXiv 2024年

作者： Su, Huan-Yi Wu, Ke Huang, Yu-Hao Li, Wu-Jun National Key Laboratory for Novel Software Technology Department of Computer Science and Technology Nanjing University Nanjing210023 China

Recently, many works have proposed various financial large language models (FinLLMs) by pre-training from scratch or fine-tuning open-sourced LLMs on financial corpora. However, existing FinLLMs exhibit unsatisfactory performance in understanding financial text when numeric variables are involved in questions. In this paper, we propose a novel LLM, called numeric-sensitive large language model (NumLLM), for Chinese finance. We first construct a financial corpus from financial textbooks which is essential for improving numeric capability of LLMs during fine-tuning. After that, we train two individual low-rank adaptation (LoRA) modules by fine-tuning on our constructed financial corpus. One module is for adapting general-purpose LLMs to financial domain, and the other module is for enhancing the ability of NumLLM to understand financial text with numeric variables. Lastly, we merge the two LoRA modules into the foundation model to obtain NumLLM for inference. Experiments on financial question-answering benchmark show that NumLLM can boost the performance of the foundation model and can achieve the best overall performance compared to all baselines, on both numeric and non-numeric questions. Copyright © 2024, The Authors. All rights reserved.

关键词： Finance

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：