检索结果-内蒙古大学图书馆

Conference on Digital Avionics Systems (DASC)

作者： Udayan Mandal Guy Amir Haoze Wu Ieva Daukantas Fletcher Lee Newell Umberto Ravaioli Baoluo Meng Michael Durling Kerianne Hobbs Milan Ganai Tobey Shim Guy Katz Clark Barrett Center for AI Safety Stanford University Stanford USA School of CS & Engineering The Hebrew University of Jerusalem Jerusalem Israel Department of Computer Science IT University of Copenhagen Copenhagen Denmark Google Mountain View USA GE Aerospace Research Niskayuna USA Air Force Research Laboratory US Air Force Dayton USA Department of Computer Science Stanford University Stanford USA Department of Data Science Stanford University Stanford USA

ISBN: (数字)9798350349610

ISBN: (纸本)9798350349627

In recent years, deep reinforcement learning (DRL) approaches have generated highly successful controllers for a myriad of complex domains. However, the opaque nature of these models limits their applicability in aerospace systems and sasfety-critical domains, in which a single mistake can have dire consequences. In this paper, we present novel advancements in both the training and verification of DRL controllers, which can help ensure their safe behavior. We showcase a design-for-verification approach utilizing k-induction and demonstrate its use in verifying liveness properties. In addition, we also give a brief overview of neural Lyapunov Barrier certificates and summarize their capabilities on a case study. Finally, we describe several other novel reachability-based approaches which, despite failing to provide guarantees of interest, could be effective for verification of other DRL systems, and could be of further interest to the community.

关键词： Training Space vehicles Neural networks Aerospace electronics Deep reinforcement learning Control systems Aerospace safety Reliability Reachability analysis Artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

Kbpt: Knowledge-Based Prompt Tuning for Zero-Shot Relation Triplet Extraction

SSRN

引用

SSRN 2023年

作者： Guo, Qian Guo, Yi Zhao, Jin Department of Computer Science and Engineering East China University of Science and Technology Shanghai200237 China Business Intelligence and Visualization Research Center National Engineering Laboratory for Big Data Distribution and Exchange Technologies Shanghai200436 China Shanghai Engineering Research Center of Big Data & Internet Audience Shanghai200072 China Shanghai Key Laboratory of Data Science School of Computer Science Fudan University Shanghai200433 China

Knowledge representation is becoming an effective way of information extraction. However, considerable studies ignored its application scenarios in the zero-shot setting. In this paper, we propose a novel framework for prompt language models based on external ontology knowledge called Knowledge-Based Prompt Tuning for the Zero-shot Relation Triplet Extraction (KBPT), which encourages further investigation in low-resource regimes to address the data scarcity problem in Relation Triplet Extraction (RTE). The major task of relation triplet extraction in zero-shot learning is to extract multiple triplets that are consisted of head entities, tail entities, and relation labels from an input sentence where the extracted relation labels do not exist in the training set. The fundamental idea of prompt tuning is to construct a prompt template, then append it behind the input text as the pre-trained language models (PLMs) input, thus, transforming the classification task into a masked language model prediction. However, our proposed model does not involve the mask language model prediction but a well-designed prompt template in a structured text format to generate synthetic training data containing the unseen relation category. Concretely, we utilize the relation labels and incorporate virtual tokens sources from the relation semantics to construct a structured prompt template for generating synthetic training instances. Moreover, to further enrich and supplement prior knowledge, we draw on ontology schema based on external knowledge bases to enhance the capability of semantic representation in the prompt template. To address the problem of knowledge heterogeneity, we synergistically optimize these embedding representations by way of collective training. In addition, we carefully design a Multiple Triplets Decoding (MTD) algorithm to break through the limitation of extracting multiple relation triplets in a sentence, and our proposed model is model-agnostic and can be orthogon

关键词： Ontology

来源：评论

学校读者我要写书评

暂无评论

Further results on permutation pentanomials over finite fields with characteristic two

引用

Designs, Codes and Cryptography 2025年 1-30页

作者： Zhang, Tongliang Kan, Haibin Zheng, Lijing Peng, Jie Zhao, Hanbing Hebei Key Laboratory of Data Science and Application College of Science North China University of Science and Technology Tangshan China College of Computer Science and Artificial Intelligence Fudan University Shanghai China Shanghai Engineering Research Center of Blockchain Shanghai China Shanghai Institute for Mathematics and Interdisciplinary Sciences Shanghai China School of Mathematics and Physics University of South China Hengyang China Mathematics and Science College of Shanghai Normal University Shanghai China

In this paper, we investigate several classes of permutation pentanomials over $${{\mathbb {F}}}_{2^{2m}}$$ of the form $$f(x)=x^t+x^{r_1(q-1)+t}+x^{r_2(q-1)+t}+x^{r_3(q-1)+t}+x^{r_4(q-1)+t}$$ with $$ 1\le r_i\le t$$ for $$i\in [1,4]$$ . A new technique is presented to describe the sufficient condition for f(x) to be a permutation through investigating two kinds of irreducible factors, which are called polynomials of nonzero trace and zero trace, of some certain polynomials over $${{\mathbb {F}}}_{2}$$ . We resolve the open problem the authors left in Zhang et al. (Finite Fields Appl 98:102468, 2024). Numerical results suggest that the results in this paper seem to contain all the permutation pentanomials of that form with $$\textrm{gcd}(x^{r_4}+x^{r_3}+x^{r_2}+x^{r_1}+1,x^t+x^{t-r_1}+x^{t-r_2}+x^{t-r_3}+x^{t-r_4})=1$$ for $$t>23$$ and the conditions presented in Theorems 3.1, 3.7, 3.10 and 3.14 of this paper are also necessary.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Role of Contemporary Techniques in Agriculture Development: A Systematic Review 2

Role of Contemporary Techniques in Agriculture Development: ...

引用

2nd International Conference on Advance Computing and Innovative Technologies in engineering, ICACITE 2022

作者： Popli, Renu Singh, Daljeet Kumar, Rajeev Saini, Gurpreet Singh Garg, Kanwal Chitkara University Institute of Engineering and Technology Chitkara University Punjab India Center for Space Research DRD Lovely Professional University Phagwara India School of Electronics and Electrical Engineering Lovely Professional University Phagwara India Kurukshetra University Department of Computer Science and Application Kurukshetra India

ISBN: (纸本)9781665437899

Gone are the days when agriculture was said to be a profession of uneducated people who used only basic mechanical tools to survive. Use of technology in agriculture has transformed it into a high-tech, smart and efficient business. The abrupt increase in population has pushed the efficiency of agriculture activities to achieve higher and better yield from given set of resources. In this paper, a comprehensive survey on the role of contemporary techniques used in agricultural development is presented. Various aspects of smart agriculture involving the use of Wireless Sensor Networks (WSNs) to gather information, big data analysis for processing the collected information, Internet of Things (IOT) framework for communication, image processing techniques and mobile phone application developed for agriculture are discussed. Landmark studies published in each of these related fields are also presented. © 2022 IEEE.

关键词： Wireless sensor networks

来源：评论

学校读者我要写书评

暂无评论

Pre-trained language models in Spanish for health insurance coverage 5

Pre-trained language models in Spanish for health insurance ...

引用

5th Workshop on Clinical Natural Language Processing, ClinicalNLP 2023. held at ACL 2023

作者： Aracena, Claudio Rodríguez, Nicolás Rocco, Victor Dunstan, Jocelyn Faculty of Physical and Mathematical Sciences University of Chile Chile Millennium Institute Foundational Research on Data Chile Chilean Safety Association Chile Department of Computer Science Catholic University of Chile Chile Institute for Mathematical and Computational Engineering Catholic University of Chile Chile Center for Mathematical Modeling University of Chile Chile

ISBN: (纸本)9781959429883

The field of clinical natural language processing (NLP) can extract useful information from clinical text. Since 2017, the NLP field has shifted towards using pre-trained language models (PLMs), improving performance in several tasks. Most of the research in this field has focused on English text, but there are some available PLMs in Spanish. In this work, we use clinical PLMs to analyze text from admission and medical reports in Spanish for an insurance and health provider to give a probability of no coverage in a labor insurance process. Our results show that fine-tuning a PLM pretrained with the provider's data leads to better results, but this process is time-consuming and computationally expensive. At least for this task, fine-tuning publicly available clinical PLM leads to comparable results to a custom PLM, but in less time and with fewer resources. Analyzing large volumes of insurance requests is burdensome for employers, and models can ease this task by pre-classifying reports that are likely not to have coverage. Our approach of entirely using clinical-related text improves the current models while reinforcing the idea of clinical support systems that simplify human labor but do not replace it. To our knowledge, the clinical corpus collected for this study is the largest one reported for the Spanish language. © 2023 Association for Computational Linguistics.

关键词： Health insurance

来源：评论

学校读者我要写书评

暂无评论

Enhancing Building Safety: A Brief Review of Agent-Based Modeling for Fire Evacuation Simulation 16

Enhancing Building Safety: A Brief Review of Agent-Based Mod...

引用

16th International Conference on Ambient Systems, Networks and Technologies Networks, ANT 2025 / 8th International Conference on Emerging data and Industry 4.0, EDI40 2025

作者： Kasereka, Selain K. Kabwe, Ortega M. Kinyanta, Maria W.K. Kasongo, Audrey L. Ilunga, Godwill W.K. Muhambya, Katya Tashev, Tasho Kyamakya, Kyandoghere Mathematics Statistics and Computer Science Department University of Kinshasa Kinshasa Democratic Republic Congo Faculty of English Engineering Studies Technical University of Sofia Sofia Bulgaria ABIL Research Center Kinshasa Democratic Republic Congo Faculty of Computer Science Université Nouveaux Horizons Lubumbashi Democratic Republic Congo Ganpat University Faculty of Computer Application Gujarat India Université Libre des Pays des Grands Lacs Faculty of Sciences and Technology Goma Democratic Republic Congo Institute of Smart Systems Technologies University of Klagenfurt Klagenfurt Austria Polytechnic Faculty University of Kinshasa Kinshasa Democratic Republic Congo

The issue of building evacuation in the event of a fire is a significant concern in urban planning and architecture. In the absence of appropriate measures, an emergency situation can potentially result in disastrous outcomes. In this review article, we explore the application of agent-based modeling (ABM) in the simulation of building evacuations in the event of a free breakout. To address this issue, we present a synthesis of findings from several studies, highlighting the advantages of ABM in modeling complex evacuation dynamics such as crowd behavior, consideration of multiple parameters, and decision-making processes. Additionally, we provide insight into the simulation tools and techniques used in studies that demonstrate the practical applications of ABM in enhancing evacuation strategies. Furthermore, we identify current challenges, research gaps, and propose future directions to enhance the accuracy and effectiveness of fire evacuation modeling and simulations. This study aims to contribute to the improvement of proposed solutions and models for the development of successful, effective, and reliable evacuation plans to enhance the safety of occupants in buildings. © 2025 The Author(s).

关键词： Urban planning

来源：评论

学校读者我要写书评

暂无评论

Tensor Graph Convolutional Network for Dynamic Graph Representation Learning

arXiv

引用

arXiv 2024年

作者： Wang, Ling Yuan, Ye The School of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing400065 China The Chongqing Key Laboratory of Big Data and Intelligent Computing Chongqing Engineering Research Center of Big Data Application for Smart Cities Chongqing Institute of Green and Intelligent Technology Chinese Academy of Sciences Chongqing400714 China The College of Computer and Information Science Southwest University Chongqing400715 China

Dynamic graphs (DG) describe dynamic interactions between entities in many practical scenarios. Most existing DG representation learning models combine graph convolutional network and sequence neural network, which model spatial-temporal dependencies through two different types of neural networks. However, this hybrid design cannot well capture the spatial-temporal continuity of a DG. In this paper, we propose a tensor graph convolutional network to learn DG representations in one convolution framework based on the tensor product with the following two-fold ideas: a) representing the information of DG by tensor form;b) adopting tensor product to design a tensor graph convolutional network modeling spatial-temporal feature simultaneously. Experiments on real-world DG datasets demonstrate that our model obtains state-of-the-art performance. Copyright © 2024, The Authors. All rights reserved.

关键词： Tensors

来源：评论

学校读者我要写书评

暂无评论

Sequence-to-Sequence Multi-Hop Knowledge Reasoning Based on Retentive Network

Sequence-to-Sequence Multi-Hop Knowledge Reasoning Based on ...

引用

computer Information science and application Technology (CISAT), International Conference on

作者： Ruilin Su Wanshan Zhang Dunhui Yu Hubei University Wuhan China Wuhan China School of Computer Science and Information Engineering Hubei University Wuhan China Hubei Key Laboratory of Big Data Intelligent Analysis and Application Hubei University Wuhan China Engineering and Technical Research Center of Hubei Province in Software Engineering Wuhan China

ISBN: (数字)9798350375107

ISBN: (纸本)9798350375114

Multi-hop Knowledge Reasoning is a task that involves generating an answer given a query and a knowledge graph. Existing sequence-to-sequence reasoning models use the Transformer to encode and decode sequences, but these models have some flaws, such as the inability to effectively handle long-sequence reasoning and susceptibility to exposure bias. To address these issues, we provides a sequence-to-sequence reasoning model named STSR, which is based on Retentive Network. aiming at improving training efficiency through parallel training, this model leverages the advantages in Retentive Network, reduce time overhead, and enhance reasoning efficiency through iterative reasoning. It effectively mitigates the problems of high spatial overhead and low efficiency in long-sequence reasoning faced by the Transformer. Moreover, it retains and updates historical information during the encoding and decoding process, enhancing the model’s memory and generalization capabilities. Additionally, the Scheduled Sampling method is adopted to alleviate the exposure bias introduced during reasoning, which involves using the model’s own output as the input for the next moment with a certain probability during training, instead of using the true label. We conduct the experiment on six public datasets, with the results showing that the proposed model outperforms existing baseline models in terms of precision and generation quality. This paper provides a new solution for the task of sequence-to-sequence multihop knowledge reasoning and also demonstrates the potential application of the Retentive Network in natural language processing.

关键词： Training Knowledge engineering Accuracy Spread spectrum communication Knowledge graphs Transformers Sampling methods Cognition Natural language processing Iterative decoding

来源：评论

学校读者我要写书评

暂无评论

What improves the generalization of graph transformers? a theoretical dive into the self-attention and positional encoding 24

What improves the generalization of graph transformers? a th...

引用

Proceedings of the 41st International Conference on Machine Learning

作者： Hongkang Li Meng Wang Tengfei Ma Sijia Liu Zaixi Zhang Pin-Yu Chen Department of Electrical Computer and System Engineering Rensselaer Polytechnic Institute Troy NY Department of Biomedical Informatics Stony Brook University Stony Brook NY Department of Computer Science and Engineering Michigan State University East Lansing MI and MIT-IBM Watson AI Lab IBM Research MA Department of Computer Science and Engineering University of Science and Technology of China Hefei Anhui China IBM Thomas J. Watson Research Center Yorktown Heights NY

Graph Transformers, which incorporate self-attention and positional encoding, have recently emerged as a powerful architecture for various graph learning tasks. Despite their impressive performance, the complex non-convex interactions across layers and the recursive graph structure have made it challenging to establish a theoretical foundation for learning and generalization. This study introduces the first theoretical investigation of a shallow Graph Transformer for semi-supervised node classification, comprising a self-attention layer with relative positional encoding and a two-layer perceptron. Focusing on a graph data model with discriminative nodes that determine node labels and non-discriminative nodes that are class-irrelevant, we characterize the sample complexity required to achieve a desirable generalization error by training with stochastic gradient descent (SGD). This paper provides the quantitative characterization of the sample complexity and number of iterations for convergence dependent on the fraction of discriminative nodes, the dominant patterns, and the initial model errors. Furthermore, we demonstrate that self-attention and positional encoding enhance generalization by making the attention map sparse and promoting the core neighborhood during training, which explains the superior feature representation of Graph Transformers. Our theoretical results are supported by empirical experiments on synthetic and real-world benchmarks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An ADRC-Incorporated Stochastic Gradient Descent Algorithm for Latent Factor Analysis

arXiv

引用

arXiv 2024年

作者： Li, Jinli Yuan, Ye The School of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing400065 China The Chongqing Key Laboratory of Big Data and Intelligent Computing Chongqing Engineering Research Center of Big Data Application for Smart Cities Chongqing Institute of Green and Intelligent Technology Chinese Academy of Sciences Chongqing400714 China The College of Computer and Information Science Southwest University Chongqing400715 China

High-dimensional and incomplete (HDI) matrix contains many complex interactions between numerous nodes. A stochastic gradient descent (SGD)-based latent factor analysis (LFA) model is remarkably effective in extracting valuable information from an HDI matrix. However, such a model commonly encounters the problem of slow convergence because a standard SGD algorithm only considers the current learning error to compute the stochastic gradient without considering the historical and future state of the learning error. To address this critical issue, this paper innovatively proposes an ADRC-incorporated SGD (ADS) algorithm by refining the instance learning error by considering the historical and future state by following the principle of an ADRC controller. With it, an ADS-based LFA model is further achieved for fast and accurate latent factor analysis on an HDI matrix. Empirical studies on two HDI datasets demonstrate that the proposed model outperforms the state-of-the-art LFA models in terms of computational efficiency and accuracy for predicting the missing data of an HDI matrix. Copyright © 2024, The Authors. All rights reserved.

关键词： Stochastic systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：