检索结果-内蒙古大学图书馆

arXiv 2021年

作者： Zhang, Tianren Guo, Shangqi Tan, Tian Hu, Xiaolin Chen, Feng The Department of Automation Tsinghua University Beijing100086 China The Beijing Innovation Center for Future Chip Beijing100086 China The LSBDPA Beijing Key Laboratory Beijing100084 China The Department of Civil and Environmental Engineering Stanford University Stanford CA94305 United States The Department of Computer Science and Technology Institute for Artificial Intelligence Beijing National Research Center for Information Science and Technology State Key Laboratory of Intelligent Technology and Systems Tsinghua University Beijing100084 China

Goal-conditioned Hierarchical Reinforcement Learning (HRL) is a promising approach for scaling up reinforcement learning (RL) techniques. However, it often suffers from training inefficiency as the action space of the high-level, i.e., the goal space, is large. Searching in a large goal space poses difficulty for both high-level subgoal generation and low-level policy learning. In this paper, we show that this problem can be effectively alleviated by restricting the high-level action space from the whole goal space to a k-step adjacent region of the current state using an adjacency constraint. We theoretically prove that in a deterministic Markov Decision Process (MDP), the proposed adjacency constraint preserves the optimal hierarchical policy, while in a stochastic MDP the adjacency constraint induces a bounded state-value suboptimality determined by the MDP’s transition structure. We further show that this constraint can be practically implemented by training an adjacency network that can discriminate between adjacent and non-adjacent subgoals. Experimental results on discrete and continuous control tasks including challenging simulated robot locomotion and manipulation tasks show that incorporating the adjacency constraint significantly boosts the performance of state-of-the-art goal-conditioned HRL approaches. Copyright © 2021, The Authors. All rights reserved.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Predicting extreme events from data using deep machine learning: when and where

arXiv

引用

arXiv 2022年

作者： Jiang, Junjie Huang, Zi-Gang Grebogi, Celso Lai, Ying-Cheng The Key Laboratory of Biomedical Information Engineering of Ministry of Education Institute of Health and Rehabilitation Science School of Life Science and Technology Research Center for Brain-inspired Intelligence Xi’an Jiaotong University The Key Laboratory of Neuro-informatics & Rehabilitation Engineering of Ministry of Civil Affairs Shaanxi Xi’an China School of Electrical Computer and Energy Engineering Arizona State University TempeAZ85287 United States Institute for Complex Systems and Mathematical Biology School of Natural and Computing Sciences King’s College University of Aberdeen AB24 3UE United Kingdom Department of Physics Arizona State University TempeAZ85287 United States

We develop a deep convolutional neural network (DCNN) based framework for model-free prediction of the occurrence of extreme events both in time ("when") and in space ("where") in nonlinear physical systems of spatial dimension two. The measurements or data are a set of two-dimensional snapshots or images. For a desired time horizon of prediction, a proper labeling scheme can be designated to enable successful training of the DCNN and subsequent prediction of extreme events in time. Given that an extreme event has been predicted to occur within the time horizon, a space-based labeling scheme can be applied to predict, within certain resolution, the location at which the event will occur. We use synthetic data from the 2D complex Ginzburg-Landau equation and empirical wind speed data of the North Atlantic ocean to demonstrate and validate our machine-learning based prediction framework. The trade-offs among the prediction horizon, spatial resolution, and accuracy are illustrated, and the detrimental effect of spatially biased occurrence of extreme event on prediction accuracy is discussed. The deep learning framework is viable for predicting extreme events in the real world. © 2022, CC BY.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

An Enhanced Fault Diagnosis Method with Uncertainty Quantification Using Bayesian Convolutional Neural Network

An Enhanced Fault Diagnosis Method with Uncertainty Quantifi...

引用

IEEE International Conference on Automation Science and Engineering (CASE)

作者： Qihang Fang Gang Xiong Xiuqin Shang Sheng Liu Bin Hu Zhen Shen State Key Laboratory for Management and Control of Complex Systems Institute of Automation Chinese Academy of Sciences School of Artificial Intelligence University of Chinese Academy of Sciences Beijing Engineering Research Center of Intelligent Systems and Technology Institute of Automation Guangdong Engineering Research Center of 3D Printing and Intelligent Manufacturing Cloud Computing Center Chinese Academy of Sciences China

ISBN: (数字)9781728169040

ISBN: (纸本)9781728169057

Fault diagnosis is a vital technique to pinpoint the machine malfunctions in manufacturing systems. In recent years, the deep learning techniques greatly improve the fault detection accuracy, but there still remain some problems. If one fault is absent in the training data or the fault signal is disturbed by severe noise interference, the fault classifier may misjudge the health state. This problem limits the reliability of the fault diagnosis in real applications. In this paper, we enhance the fault diagnosis method by using Bayesian Convolutional Neural Network (BCNN). A Shannon entropy-based method is presented to quantify the prediction uncertainty. The BCNN turns the deterministic predictions to probabilistic distributions and enhances the robustness of the fault diagnosis. The uncertainty quantification method helps to indicate the wrong predictions, detect unknown faults, and discover the strong disturbances. Then, a fine-tuning strategy is applied to enhance the model performance further. The potential usability of the proposed method in monitoring the motors of 3D printers is studied. And the experiment is conducted on a motor bearing dataset provided by Case Western Reserve University. The proposed BCNN achieves 99.82% fault classification accuracy over nine health conditions. Its robustness is verified by comparing the testing accuracy with three other methods on the noisy datasets. And the uncertainty quantification method successfully detects the outlier inputs.

关键词： Fault diagnosis Uncertainty Bayes methods Circuit faults Robustness Predictive models Noise measurement

来源：评论

学校读者我要写书评

暂无评论

ACG-Engine: An Inference Accelerator for Content Generative Neural Networks

ACG-Engine: An Inference Accelerator for Content Generative ...

引用

IEEE International Conference on computer-Aided Design

作者： Haobo Xu Ying Wang Yujie Wang Jiajun Li Bosheng Liu Yinhe Han University of Chinese Academy of Sciences State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Research Center for Intelligent Computing Systems Institute of Computing Technology Chinese Academy of Sciences

The technological breakthrough in Generative Adversarial Networks (GAN) has propelled the advancement of content generative applications such as AI-based paintings, style transfer, and music composition. However, in contrast to previous deep learning models for prediction and categorization, generative networks generally rely on instance normalization (IN) layer for better feature distribution, which performs significantly better than batch normalization(BN) in image style-transfer, image to image translation, etc. Unlike batch or group normalization that can be fused into convolutional layers and ignored during the network inference stage, an instance normalization layer induces intensive computation and memory access. However, prior deep learning accelerator designs for traditional Neural Network and Generative Adversarial Networks mostly focus on the acceleration of convolution and deconvolution layer but lack of support for IN operations, which could become a performance bottleneck on edge devices with insufficient computational power. To address this problem, we propose an inference accelerator for content generation (ACG-Engine) aimed to support the fundamental operations of generative networks, including convolution layers, deconvolution layers, specifically instance normalization layer. We performed a hardware-aware mathematical transformation of the IN operation for less computation complexity and memory-friendliness, so that it can be efficiently mapped to the classic 2D processing element array. Owing to the proposed optimization techniques, ACG-Engine achieves 4.56X speedup and improve power efficiency up to 29X compared to prior baseline acceleration scheme in generative network acceleration. In addition, ACG-Engine can achieve performance comparable to the classic CNN-specific accelerators with negligible power consumption and area overhead.

关键词： Performance evaluation Deep learning Deconvolution Convolution Neural networks Generative adversarial networks Acceleration

来源：评论

学校读者我要写书评

暂无评论

VALID: A Comprehensive Virtual Aerial Image Dataset

VALID: A Comprehensive Virtual Aerial Image Dataset

引用

IEEE International Conference on Robotics and Automation (ICRA)

作者： Lyujie Chen Feng Liu Yan Zhao Wufan Wang Xiaming Yuan Jihong Zhu Department of Computer Science and Technology State Key Laboratory of Intelligent Technology and Systems Beijing National Research Center for Information Science and Technology Tsinghua University Beijing Department of Precision Instrument Tsinghua University Beijing

ISBN: (数字)9781728173955

ISBN: (纸本)9781728173962

Aerial imagery plays an important role in land-use planning, population analysis, precision agriculture, and unmanned aerial vehicle tasks. However, existing aerial image datasets generally suffer from the problem of inaccurate labeling, single ground truth type, and few category numbers. In this work, we implement a simulator that can simultaneously acquire diverse visual ground truth data in the virtual environment. Based on that, we collect a comprehensive Virtual AeriaL Image Dataset named VALID, consisting of 6690 high-resolution images, all annotated with panoptic segmentation on 30 categories, object detection with oriented bounding box, and binocular depth maps, collected in 6 different virtual scenes and 5 various ambient conditions (sunny, dusk, night, snow and fog). To our knowledge, VALID is the first aerial image dataset that can provide panoptic level segmentation and complete dense depth maps. We analyze the characteristics of VALID and evaluate state-of-the-art methods for multiple tasks to provide reference baselines. The experiment results demonstrate that VALID is well presented and challenging. The dataset is available at https://***/view/valid-dataset/.

关键词： Image segmentation Semantics Task analysis Object detection Image color analysis Benchmark testing Labeling

来源：评论

学校读者我要写书评

暂无评论

Neural quality estimation with multiple hypotheses for grammatical error correction

arXiv

引用

arXiv 2021年

作者： Liu, Zhenghao Yi, Xiaoyuan Sun, Maosong Yang, Liner Chua, Tat-Seng Department of Computer Science and Technology Tsinghua University Beijing China Institute for Artificial Intelligence Tsinghua University Beijing China Beijing National Research Center for Information Science and Technology State Key Lab on Intelligent Technology and Systems Tsinghua University Beijing China Beijing Academy of Artificial Intelligence Beijing Language and Culture University Beijing China School of Computing National University of Singapore Singapore

Grammatical Error Correction (GEC) aims to correct writing errors and help language learners improve their writing skills. However, existing GEC models tend to produce spurious corrections or fail to detect lots of errors. The quality estimation model is necessary to ensure learners get accurate GEC results and avoid misleading from poorly corrected sentences. Well-trained GEC models can generate several high-quality hypotheses through decoding, such as beam search, which provide valuable GEC evidence and can be used to evaluate GEC quality. However, existing models neglect the possible GEC evidence from different hypotheses. This paper presents the Neural Verification Network (VERNet) for GEC quality estimation with multiple hypotheses. VERNet establishes interactions among hypotheses with a reasoning graph and conducts two kinds of attention mechanisms to propagate GEC evidence to verify the quality of generated hypotheses. Our experiments on four GEC datasets show that VERNet achieves state-of-the-art grammatical error detection performance, achieves the best quality estimation results, and significantly improves GEC performance by reranking hypotheses. All data and source codes are available at https://***/thunlp/VERNet. Copyright © 2021, The Authors. All rights reserved.

关键词： Error correction

来源：评论

学校读者我要写书评

暂无评论

SAMamba: Adaptive state space modeling with hierarchical vision for infrared small target detection

引用

Information Fusion 2025年 124卷

作者： Xu, Wenhao Zheng, Shuchen Wang, Changwei Zhang, Zherui Ren, Chuan Xu, Rongtao Xu, Shibiao School of Artificial Intelligence Beijing University of Posts and Telecommunications Beijing100876 China Jinan250014 China The State Key Laboratory of Multimodal Artificial Intelligence Systems Institute of Automation Chinese Academy of Sciences Beijing100190 China School of Artificial Intelligence University of Chinese Academy of Sciences Beijing100190 China Shandong Provincial Key Laboratory of Computing Power Internet and Service Computing Shandong Fundamental Research Center for Computer Science Jinan250014 China School of Software Microelectronics Peking University Beijing100084 China

Infrared small target detection (ISTD) is vital for long-range surveillance systems, particularly in military defense, maritime monitoring, and early warning applications. Despite its strategic importance, ISTD remains challenging due to two fundamental limitations: targets typically occupy less than 0.15% of the image area and exhibit low distinguishability from complex backgrounds. While recent advances in deep learning have shown promise, existing methods struggle with information loss during downsampling and inefficient modeling of global context. This paper presents SAMamba, a novel framework that synergistically integrates SAM2's hierarchical feature learning with Mamba's selective sequence modeling to address these challenges. Our key innovations include: (1) Feature Selection Adapter (FS-Adapter) that enables efficient domain adaptation from natural to infrared imagery by employing a dual-stage selection mechanism, which includes token-level selection via a learnable task embedding and channel-wise refinement through adaptive transformations;(2) Cross-Channel state-Space Interaction (CSI) module that achieves efficient global context modeling through selective state space modeling with linear complexity;and (3) Detail-Preserving Contextual Fusion (DPCF) module that adaptively combines multi-scale features through learnable fusion strategies, utilizing a gating mechanism to balance contributions from high-resolution and low-resolution features. SAMamba effectively addresses the core challenges of ISTD by bridging the domain gap, maintaining fine-grained target details, and efficiently modeling long-range dependencies. Extensive experiments on NUAA-SIRST, IRSTD-1k and NUDT-SIRST datasets demonstrate that SAMamba significantly outperforms state-of-the-art methods, particularly in challenging scenarios with heterogeneous backgrounds and varying target scales. Code is available at https://***/zhengshuchen/SAMamba. © 2025 Elsevier B.V.

关键词： Feature Selection

来源：评论

学校读者我要写书评

暂无评论

反向连边在大型分层网络中的影响（英文）

引用

Engineering 2024年第5期 240-249页

作者：曹浩森胡斌斌莫小雨陈都鑫高建喜袁烨陈关荣 Tamás Vicsek 管晓宏张海涛 MOE Engineering Research Center of Autonomous Intelligent Unmanned Systems State Key Laboratory of Intelligent Manufacturing Equipment and Technology School of Artificial Intelligence and Automation Huazhong University of Science and Technology School of Mechanical and Aerospace Engineering Nanyang Technological University Jiangsu Key Laboratory of Networked Collective Intelligence School of Mathematics Southeast University Department of Computer Science Rensselaer Polytechnic Institute Department of Electronic Engineering City University of Hong Kong Department of Biological Physics E?tv?s University MOE Key Laboratory for Intelligent Networks and Network Security School of Automation Science and Engineering Xi'an Jiaotong University

Hierarchical networks are frequently encountered in animal groups, gene networks, and artificial engineering systems such as multiple robots, unmanned vehicle systems, smart grids, wind farm networks,and so forth. The structure of a large directed hierarchical network is often strongly influenced by reverse edges from lower-to higher-level nodes, such as lagging birds' howl in a flock or the opinions of lowerlevel individuals feeding back to higher-level ones in a social group. This study reveals that, for most large-scale real hierarchical networks, the majority of the reverse edges do not affect the synchronization process of the entire network; the synchronization process is influenced only by a small part of these reverse edges along specific paths. More surprisingly, a single effective reverse edge can slow down the synchronization of a huge hierarchical network by over 60%. The effect of such edges depends not on the network size but only on the average in-degree of the involved subnetwork. The overwhelming majority of active reverse edges turn out to have some kind of ‘‘bunchingo effect on the information flows of hierarchical networks, which slows down synchronization processes. This finding refines the current understanding of the role of reverse edges in many natural, social, and engineering hierarchical networks,which might be beneficial for precisely tuning the synchronization rhythms of these networks. Our study also proposes an effective way to attack a hierarchical network by adding a malicious reverse edge to it and provides some guidance for protecting a network by screening out the specific small proportion of vulnerable nodes.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Delving deeper into the decoder for video captioning

arXiv

引用

arXiv 2020年

作者： Chen, Haoran Li, Jianmin Hu, Xiaolin State Key Laboratory of Intelligent Technology and Systems Institute for Artificial Intelligence Beijing National Research Center for Information Science and Technology THBI Department of Computer Science and Technology Tsinghua University Beijing100084 China

Video captioning is an advanced multi-modal task which aims to describe a video clip using a natural language sentence. The encoder-decoder framework is the most popular paradigm for this task in recent years. However, there exist some problems in the decoder of a video captioning model. We make a thorough investigation into the decoder and adopt three techniques to improve the performance of the model. First of all, a combination of variational dropout and layer normalization is embedded into a recurrent unit to alleviate the problem of overfitting. Secondly, a new online method is proposed to evaluate the performance of a model on a validation set so as to select the best checkpoint for testing. Finally, a new training strategy called professional learning is proposed which uses the strengths of a captioning model and bypasses its weaknesses. It is demonstrated in the experiments on Microsoft research Video Description Corpus (MSVD) and MSR-Video to Text (MSR-VTT) datasets that our model has achieved the best results evaluated by BLEU, CIDEr, METEOR and ROUGE-L metrics with significant gains of up to 18% on MSVD and 3.5% on MSR-VTT compared with the previous state-of-the-art *** Codes 68T45, 68T50. Copyright © 2020, The Authors. All rights reserved.

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

Author Correction: Discovering CRISPR-Cas system with self-processing pre-crRNA capability by foundation models

引用

Nature communications 2025年第1期16卷 535页

作者： Wenhui Li Xianyue Jiang Wuke Wang Liya Hou Runze Cai Yongqian Li Qiuxi Gu Qinchang Chen Peixiang Ma Jin Tang Menghao Guo Guohui Chuai Xingxu Huang Jun Zhang Qi Liu State Key Laboratory of Cardiology and Medical Innovation Center Shanghai East Hospital Frontier Science Center for Stem Cell Research Bioinformatics Department School of Life Sciences and Technology Tongji University Shanghai China. Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University) Ministry of Education Orthopaedic Department of Tongji Hospital Frontier Science Center for Stem Cell Research Bioinformatics Department School of Life Sciences and Technology Tongji University Shanghai China. Research Center for Life Sciences Computing Zhejiang Lab Hangzhou Zhejiang China. State Key Laboratory of Reproductive Medicine and Offspring Health Women's Hospital of Nanjing Medical University Nanjing Maternity and Child Health Care Hospital Nanjing Medical University Nanjing China. Shanghai Key Laboratory of Orthopedic Implants Department of Orthopedic Surgery Shanghai Ninth People's Hospital Shanghai Jiao Tong University School of Medicine Shanghai China. State Key Laboratory of Cardiology and Medical Innovation Center Shanghai East Hospital Frontier Science Center for Stem Cell Research Bioinformatics Department School of Life Sciences and Technology Tongji University Shanghai China. 18alexanderm117@***. Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University) Ministry of Education Orthopaedic Department of Tongji Hospital Frontier Science Center for Stem Cell Research Bioinformatics Department School of Life Sciences and Technology Tongji University Shanghai China. 18alexanderm117@***. National Key Laboratory of Autonomous Intelligent Unmanned Systems Frontiers Science Center for Intelligent Autonomous Systems Ministry of Education Shanghai Research Institute for Intelligent Autonomous Systems Shanghai China. 18alexanderm117@***. Research Center for Life Sciences Computing Zhejiang Lab Hangzhou Zhejiang China. xingxuhuang@zju.edu.cn. The Key Laboratory of Pancreatic Diseases of Zhejiang Province

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：