检索结果-内蒙古大学图书馆

arXiv 2022年

作者： Rammal, Mohamad Rida Achille, Alessandro Golatkar, Aditya Diggavi, Suhas Soatto, Stefano

We derive information theoretic generalization bounds for supervised learning algorithms based on a new measure of leave-one-out conditional mutual information (loo-CMI). Contrary to other CMI bounds, which are black-box bounds that do not exploit the structure of the problem and may be hard to evaluate in practice, our loo-CMI bounds can be computed easily and can be interpreted in connection to other notions such as classical leave-one-out cross-validation, stability of the optimization algorithm, and the geometry of the loss-landscape. It applies both to the output of training algorithms as well as their predictions. We empirically validate the quality of the bound by evaluating its predicted generalization gap in scenarios for deep learning. In particular, our bounds are non-vacuous on large-scale image-classification tasks. © 2022, CC BY.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

T4pdm: A Deep Neural Network Based on the Transformer Architecture for Fault Diagnosis of Rotating Machinery

SSRN

引用

SSRN 2022年

作者： Sperandio Nascimento, Erick Giovani Liang, Julian Santana Figueiredo, Ilan Sousa Guarieiro, Lílian Lefol Nani Surrey Institute for People-Centred AI Faculty of Engineering and Physical Sciences University of Surrey GuildfordGU2 7XH United Kingdom Computational Modeling Department SENAI CIMATEC Bahia Salvador41650-010 Brazil

Deep learning algorithms have become widely used in industrial applications to optimize several tasks in many complex systems, particularly for diagnosing and prognosing machinery health, which have leveraged predictive maintenance (PdM) to be more accurate and reliable in decision making of machinery maintenance. Recently, Transformer Neural Networks have gained notoriety and have been increasingly the favorite choice for Natural Language Processing (NLP) tasks. Thus, motivated by their recent major achievements in NLP, this paper proposes the development and evaluation of an automatic fault classifier model for PdM based on a modified version of the Transformer architecture, namely T4PdM, to identify multiple types of faults in rotating machinery. Experimental results were developed and presented for the MaFaulDa and CWRU public databases. T4PdM was able to achieve an overall accuracy of 99.98% and 98% for both datasets, respectively. In addition, its performance was compared to other previously state of the art works, demonstrating its superiority in detecting and classifying faults in rotating industrial machinery. Therefore, the proposed model can improve the current state of the art performance of machinery fault analysis and diagnostic processes, helping to leverage companies to a new era of the Industry 4.0. Furthermore, this methodology can be adapted to any other task of time series classification. © 2022, The Authors. All rights reserved.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

DESIGNING NOVEL PROTEIN STRUCTURES USING SEQUENCE GENERATOR AND ALPHAFOLD2

arXiv

引用

arXiv 2022年

作者： Agha, Xeerak Fu, Nihang Hu, Jianjun Department of Computer Science and Engineering University of South Carolina ColumbiaSC29201 United States

Protein structures and functions are determined by a contiguous arrangement of amino acid sequences. Designing novel protein sequences and structures with desired geometry and functions is a complex task with large state spaces. Here we develop a novel protein design pipeline consisting of two deep learning algorithms, ProteinSolver and AlphaFold2. ProteinSolver is a deep graph neural network that generates amino acid sequences such that the forces between interacting amino acids are favorable and compatible with the fold while AlphaFold2 is a deep learning algorithm that predicts the protein structures from protein sequences. We present forty de novo designed binding sites of the PTP1B and P53 proteins with high precision, out of which thirty proteins are novel. Using ProteinSolver and AlphaFold2 in conjunction, we can trim the exploration of the large protein conformation space, thus expanding the ability to find novel and diverse de novo protein designs. © 2022, CC BY-NC-SA.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

A Double-Strategy-Check Active learning Algorithm for Hyperspectral Image Classification

引用

PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING 2019年第11期85卷 841-851页

作者： Cui, Ying Ji, Xiaowei Xu, Kai Wang, Liguo Harbin Engn Univ Coll Informat & Commun Engn Harbin 150001 Heilongjiang Peoples R China

Applying limited labeled samples to improve classification results is a challenge in hyperspectral images. Active learning (AL) and Semisupervised learning (SSL) are two promising techniques to achieve this challenge. Combining AL with SSL is an excellent idea for hyperspectral image classification. The traditional method, such as the Collaborative Active and Semisupervised learning algorithm (CASSL), may introduce many incorrect pseudolabels and shows premature convergence. To overcome these drawbacks, a novel framework named Double-Strategy-Check Collaborative Active and Semisupervised learning (DSC-CASSL) is proposed in this paper. This framework combines two different AL algorithms and SSL in a collaborative mode. The double-strategy verification can gradually improve the pseudolabeling accuracy and facilitate SSL. We evaluate the performance of DSC-CASSL on four hyperspectral data sets and compare it with that of four hyperspectral image classification methods. Our results suggest that DSC-CASSL leads to consistent improvement for hyperspectral image classification.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Semi-supervised CRF Chinese Word Segmentation based on Neural Network 21

Semi-supervised CRF Chinese Word Segmentation based on Neura...

引用

21st Chinese National Conference on Computational Linguistic, CCL 2022

作者： Luo, Zhiyong Zhang, Mingming Han, Yujiao Zhao, Zhilin Beijing Language and Culture University China

Chinese word segmentation (CWS) is a fundamental task of natural language processing. Currently, CWS model using fully supervised learning technology has achieved good results in the common domain. However, it has the problem of relying on the large-scale annotated corpus and poor domain migration capability, especially the cross-domain OOV word recognition is not effective. In order to alleviate these problems, this paper proposes a semi-supervised CWS framework that uses relatively easy-to-obtain unlabeled texts in the target domain to achieve cross-domain transfer. We design a semi-supervised model based on word memory network and sequence conditional entropy. Our model based on this framework achieves significant improvements in F-scores and ROOV on several datasets,some of them are *** maximum F-value and ROOV improvements are 2.35% and 12.12%. © 2022 China National Conference on Computational Linguistics Published under Creative Commons Attribution 4.0 International License.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Scene recognition of opencast coal mine areas based on scene sub-region multi-label learning

引用

National Remote Sensing Bulletin 2022年第9期26卷 1849-1858页

作者： Zhao, Yindi Wei, Hongyu Dong, Jihong Dong, Chang School of Environment Science and Spatial Informatics China University of Mining and Technology Xuzhou221116 China

Objective: With the development of remote sensing technology, high-resolution remote sensing images become available to scene recognition of opencast coal mine areas, which is conducive to the supervision of opencast coal mine areas for environmental governance. The scene is divided into multiple sub-regions for feature learning and recognition. Aiming at the poor performance of sub-region recognition based on single-label learning, this paper combines a multi-label learning strategy with the first law of geography to propose a scene recognition algorithm based on scene sub-region multi-label learning. Method: In order to distinguish the scene of opencast coal mine areas from its surrounding scene, 6 types of mining tags and 7 non-mining tags are set. The sub-regions, cropped from the scene of opencast coal mine areas and its surrounding scene, are labeled with 13 types of tags to form a multi-label dataset. Train the dataset with the Inception_v3 based on multi-label learning. The input remote sensing images are divided into sub-regions of the same size, and multi-label classification is performed on the sub-regions with the trained model. In order to recognize the sub-regions belonging to the scene of the opencast coal mine areas according to the multi-label classification results, a scene sub-region determination algorithm is introduced. Using the label correlation and the label integrity of the mining tags to determine whether the sub-region, containing the mining tags, belongs to the scene of the opencast coal mine areas. And the recognized sub-regions constitute the scene of the opencast coal mine areas. Result: The results show that, in scene sub-region recognition of opencast coal mine areas, compared with single-label learning algorithms, the F1-score of the proposed method, 0.857, is increased by up to 8 percentage points. On the remote sensing image of the study area, the recognition results of proposed method in scene recognition of opencast coal mine ar

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Desperately Searching for Something

SSRN

引用

SSRN 2022年

作者： Grindrod, Peter Bowman, Clive E. Mathematical Institute University of Oxford OxfordOX2 6GG United Kingdom

There is a growing interest in novelty search : that is, in sampling a parameter space to search for radical or unexpected behaviour(s), occurring as a consequence of parameter choice, being input to some downstream complex system, process, or service that will not yield to analysis, without imposing any specific pre-ordained objective function, or fitness function to be optimised. We mean "parameter" in the widest sense, including system learnables, non-autonomous forcing, sequencing and all *** upon the nature of the underlying parameter space of interest one may adopt a rather wide range of search algorithms. We do consider that this search activity has meta-objectives , though: one is of achieving diversity (efficiently reaching out across the space in some way);and one is of achieving some minimum density (not leaving out large unexplored holes). These are in tension. In general, the computational costs of both of these qualities become restrictive as the di- mension of the parameter spaces increase;and consequently their balance is harder to maintain. We may also wish for a substantial random element of search to provide some luck in discovery and to avoid any naive preset sampling *** consider archive-based methods within a range of spaces: finite discrete spaces, where the problem is straightforward (provided we are patient with the random element);Euclidean spaces, of increasing dimension, that become very lonely places;and infinite dimensional spaces. Our aim is to discuss a raft of distinctive search concepts, that respond to identified challenges, and rely on a rather diverse range of mathematical ideas. This arms practitioners with a range of highly practical *** applications requiring novelty search arise, one should avoid rushing to code-up a standard evolving search algorithm and instead give some thought to the nature and requirements of the search: there is a range of effective options available. We give some

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

The Use of Artificial Intelligence for Automating or Semi-Automating Biomedical Literature Analyses: A Scoping Review

SSRN

引用

SSRN 2022年

作者： Oliveira dos Santos, Álisson Sergio da Silva, Eduardo Machado Couto, Letícia Valadares Labanca Reis, Gustavo Silva Belo, Vinícius Federal University of São João del-Rei Campus Centro-Oeste Dona Lindu Minas Gerais Divinópolis Brazil Federal University of Ouro Preto School of Medicine Campus Morro do Cruzeiro Minas Gerais Ouro Preto Brazil

Objective: Evidence-based medicine (EBM) is a decision-making process based on the conscious and judicious use of the best available scientific evidence. However, the exponential increase in the amount of information currently available likely exceeds the capacity of human-only analysis. In this context, artificial intelligence (AI) and its branches such as machine learning (ML) can be used to facilitate human efforts in analyzing the literature to foster EBM. The present scoping review aimed to examine the use of AI in the automation of biomedical literature survey and analysis with a view to establishing the state-of-the-art and identifying knowledge *** and methods: Comprehensive searches of the main databases were performed for articles published up to June 2022 and studies were selected according to inclusion and exclusion criteria. Data were extracted from the included articles and the findings ***: The total number of records retrieved from the databases was 12,145, of which 273 were included in the review. Classification of the studies according to the use of AI in evaluating the biomedical literature revealed three main application groups, namely assembly of scientific evidence (n=127;47%), mining the biomedical literature (n=112;41%) and quality analysis (n=34;12%). Most studies addressed the preparation of systematic reviews, while articles focusing on the development of guidelines and evidence synthesis were the least frequent. The biggest knowledge gap was identified within the quality analysis group, particularly regarding methods and tools that assess the strength of recommendation and consistency of ***: Our review shows that, despite significant progress in the automation of biomedical literature surveys and analyses in recent years, intense research is needed to fill knowledge gaps on more difficult aspects of ML, deep learning and natural language processing, and to consolidate the use of automation by en

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

A Proximal Algorithm for Sampling from Non-convex Potentials

arXiv

引用

arXiv 2022年

作者： Liang, Jiaming Chen, Yongxin School of Industrial and Systems Engineering Georgia Institute of Technology AtlantaGA30332 United States School of Aerospace Engineering Georgia Institute of Technology AtlantaGA30332 United States

We study sampling problems associated with non-convex potentials that meanwhile lack smoothness. In particular, we consider target distributions that satisfy either logarithmic-Sobolev inequality or Poincaré inequality. Rather than smooth, the potentials are assumed to be semi-smooth or the summation of multiple semi-smooth functions. We develop a sampling algorithm that resembles proximal algorithms in optimization for this challenging sampling task. Our algorithm is based on a special case of Gibbs sampling known as the alternating sampling framework (ASF). The key contribution of this work is a practical realization of the ASF based on rejection sampling in the non-convex and semi-smooth setting. This work extends the recent algorithm in [24, 25] for non-smooth/semi-smooth log-concave distribution to the setting with non-convex potentials. In almost all the cases of sampling considered in this work, our proximal sampling algorithm achieves better complexity than all existing methods. © 2022, CC BY.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Towards Adversarial Evaluations for Inexact Machine Unlearning

arXiv

引用

arXiv 2022年

作者： Goel, Shashwat Prabhu, Ameya Sanyal, Amartya Lim, Ser-Nam Torr, Philip Kumaraguru, Ponnurangam IIIT Hyderabad India University of Oxford United Kingdom ETH Zurich Switzerland MPI-IS Germany Meta AI United States

Machine learning models face increased concerns regarding the storage of personal user data and adverse impacts of corrupted data like backdoors or systematic bias. Machine Unlearning can address these by allowing post-hoc deletion of affected training data from a learned model. Achieving this task exactly is computationally expensive;consequently, recent works have proposed inexact unlearning algorithms to solve this approximately as well as evaluation methods to test the effectiveness of these algorithms. In this work, we first outline some necessary criteria for evaluation methods and show no existing evaluation satisfies them all. Then, we design a stronger black-box evaluation method called the Interclass Confusion (IC) test which adversarially manipulates data during training to detect the insufficiency of unlearning procedures. We also propose two analytically motivated baseline methods (EU-k and CF-k) which outperform several popular inexact unlearning methods. Overall, we demonstrate how adversarial evaluation strategies can help in analyzing various unlearning phenomena which can guide the development of stronger unlearning algorithms. © 2022, CC BY.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：