检索结果-内蒙古大学图书馆

42nd Chinese Control Conference, CCC 2023

作者： Ma, Waner Qu, Qingyu Liu, Kexin Beihang University Automation Science and Electrical Engineering Beijing100191 China

ISBN: (纸本)9789887581543

This paper studies the distributed flexible flow shop scheduling problem (DFFSP), where the transportation time between different factories needs to be considered and each machine has a different startup time. A mixed integer linear programming (MILP) model of DFFSP is proposed, and a smart algorithm based on imitation learning for branch-and-bound (B&B) is used to find the scheduling plan that minimizes the total processing time. The graph convolutional neural network model is trained using imitation learning from strong branch expert rules. Finally, we demonstrate the efficiency of our algorithm with simulation experiments. The results indicate that our algorithm demonstrates the most efficient search performance with respect to both the number of nodes explored and search time compared to the four traditional B&B strategies. © 2023 Technical Committee on Control Theory, Chinese Association of Automation.

关键词： neural network models

来源：评论

学校读者我要写书评

暂无评论

Online Meta-Learning via Learning with Layer-distributed Memory 35

Online Meta-Learning via Learning with Layer-Distributed Mem...

引用

35th Annual Conference on neural Information processing Systems (NeurIPS)

作者： Babu, Sudarshan Savarese, Pedro Maire, Michael TTI C Chicago IL 60637 USA Univ Chicago Chicago IL 60637 USA

ISBN: (纸本)9781713845393

We demonstrate that efficient meta-learning can be achieved via end-to-end training of deep neural networks with memory distributed across layers. The persistent state of this memory assumes the entire burden of guiding task adaptation. Moreover, its distributed nature is instrumental in orchestrating adaptation. Ablation experiments demonstrate that providing relevant feedback to memory units distributed across the depth of the network enables them to guide adaptation throughout the entire network. Our results show that this is a successful strategy for simplifying meta-learning - often cast as a bi-level optimization problem - to standard end-to-end training, while outperforming gradient-based, prototype-based, and other memory-based meta-learning strategies. Additionally, our adaptation strategy naturally handles online learning scenarios with a significant delay between observing a sample and its corresponding label - a setting in which other approaches struggle. Adaptation via distributed memory is effective across a wide range of learning tasks, ranging from classification to online few-shot semantic segmentation.

关键词： Memory architecture

来源：评论

学校读者我要写书评

暂无评论

Training on Polar Image Transformations Improves Biomedical Image Segmentation

引用

IEEE ACCESS 2021年 9卷 133365-133375页

作者： Bencevic, Marin Galic, Irena Habijan, Marija Babin, Danilo JJ Strossmayer Univ Osijek Fac Elect Engn Comp Sci & Informat Technol Osijek 31000 Croatia Univ Ghent Fac Engn & Architecture imec TELINIPI B-9000 Ghent Belgium

A key step in medical image-based diagnosis is image segmentation. A common use case for medical image segmentation is the identification of single structures of an elliptical shape. Most organs like the heart and kidneys fall into this category, as well as skin lesions, polyps, and other types of abnormalities. neural networks have dramatically improved medical image segmentation results, but still require large amounts of training data and long training times to converge. In this paper, we propose a general way to improve neural network segmentation performance and data efficiency on medical imaging segmentation tasks where the goal is to segment a single roughly elliptically distributed object. We propose training a neural network on polar transformations of the original dataset, such that the polar origin for the transformation is the center point of the object. This results in a reduction of dimensionality as well as a separation of segmentation and localization tasks, allowing the network to more easily converge. Additionally, we propose two different approaches to obtaining an optimal polar origin: (1) estimation via a segmentation trained on non-polar images and (2) estimation via a model trained to predict the optimal origin. We evaluate our method on the tasks of liver, polyp, skin lesion, and epicardial adipose tissue segmentation. We show that our method produces state-of-the-art results for lesion, liver, and polyp segmentation and performs better than most common neural network architectures for biomedical image segmentation. Additionally, when used as a pre-processing step, our method generally improves data efficiency across datasets and neural network architectures.

关键词： Image segmentation neural networks Biomedical imaging Training Task analysis Medical diagnostic imaging Lesions Convolutional neural network medical image processing medical image segmentation semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

Optimization Design of English Translation Error Recognition Algorithm Based on Particle Swarm Optimization neural network

Optimization Design of English Translation Error Recognition...

引用

2023 International Conference on Power, Electrical Engineering, Electronics and Control, PEEEC 2023

作者： Gao, Fei Wenhua College Faculty of Foreign Languages Hubei Wuhan430000 China

ISBN: (纸本)9798350329124

Nowadays, there are a large number of bilingual translated texts on the Internet. It is a crucial problem to build a practical bilingual corpus through the processing of translated texts. Based on NN(neural network) encoder, the source language sequence with start and end marks is input, which is converted into distributed word vector data, and then transmitted to NN, including the background vector of source language information, which is input to decoder NN, and the target language sequence is calculated and input with NN as the carrier. This directly leads to the fact that when English translation robots translate English audio, it is difficult to get the same English audio data as the original text, thus reducing translation efficiency or causing translation errors. Improve the ability to eliminate translation errors. In this paper, PSO (particle swarm optimization) algorithm is put forward to optimize it. Simulation results show that the recognition speed of this algorithm is relatively high, and PSO algorithm is used to automatically optimize it, which realizes the elimination of English translation errors. Finally, the experimental results are tested and analyzed, and a conclusion is drawn, which verifies the feasibility of this algorithm. © 2023 IEEE.

关键词： Errors

来源：评论

学校读者我要写书评

暂无评论

Prototyping a Biologically Plausible Neuron Model on a Multi-FPGA System

Prototyping a Biologically Plausible Neuron Model on a Multi...

引用

IEEE 3rd Colombian BioCAS Workshop (ColBioCAS)

作者： Salazar-Garcia, Carlos Chacon-Rodriguez, Alfonso Rimolo-Donadio, Renato Garcia-Ramirez, Ronny Strydis, Christos Costa Rica Inst Technol Elect Cartago Costa Rica Costa Rica Inst Technol Mechatron Engn Cartago Costa Rica Erasmus MC Dept Neurosci Rotterdam Netherlands

ISBN: (纸本)9798350306132

A hardware-based computational-efficient biophysical model of a neural network that considers the inferior olivary nucleus is presented. The implementation uses a multi-FPGA system based on the PlasticNet interconnection framework, which enables the implementation of a cost-effective and scalable model able to handle over ten thousand neurons with five FPGA evaluation boards. With the same setup, it was possible to simulate one thousand neurons in real time.

关键词： Custom FPGA networks HLS inter-FPGA communication multi-FPGA distributed processing

来源：评论

学校读者我要写书评

暂无评论

Data processing Centre's Cyberattack Protection Directions on the Base of neural network Algorithms 4

Data Processing Centre's Cyberattack Protection Directions o...

引用

4th International Scientific Conference "Information Technology and Implementation", IT and I 2022

作者： Shestak, Yanina Toliupa, Serhii Shevchenko, Anatolii Torchylo, Anna Onyigwang, Ogbu James Taras Shevchenko National University of Kyiv 24 B. Havrylyshyna Str. Kyiv04116 Ukraine University of Ibadan Oyo State Ibadan200284 Nigeria

This paper describes the methods of organization of the data center protection strategy, presented as a network distributed infrastructure, against potential external threats. This work indicates the advantages of using neural network algorithms and deep learning neural network architecture in the specified field. In accordance with the set of quantitative target indicators, mathematical modeling of the evaluation of the effectiveness of the selection of cyber attack software code was carried out. Based on the proposed mathematical apparatus, an evaluation of the protection of the infrastructure of the data center against cyber attacks was carried out. In particular, this article analyses using a neural network architecture such as an autoencoder, a multi-layer autoencoder, a deep belief network, a convolutional neural network, a recurrent neural network, a recursive neural network with the inclusion of algorithms based on a restricted Boltzmann machine and a long-chain scheme of short-term memory. According to a set of factors that correspond to the effectiveness of the application of neural network algorithms in solving the task of organizing a data center infrastructure protection strategy, objective functions were proposed. Besides, the determination of global extrema of these functions provides an opportunity to solve the problem of optimizing the machine code analysis system for the presence of a cyber attack. © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

Reliable Resource Matching in 6G Computing Power network for Digital Twin Service 24

Reliable Resource Matching in 6G Computing Power Network for...

引用

24th IEEE International Conference on Communication Technology, ICCT 2024

作者： Zhao, Shilong Zhang, Junye Fang, Honglin Tan, Can Yu, Peng Li, Wenjing Beijing University of Posts and Telecommunications Beijing China

ISBN: (纸本)9798350363760

Due to the complex and dynamic service demands involved in digital twin services, traditional resource matching methods often fail to meet the requirements. To address this challenge, we leverage Graph neural networks (GNN) for task graph matching to enhance the deployment of computing resources. The crux of our approach is an adaptive service decomposition process that breaks down complex services into smaller, more manageable subtasks. These subtasks are then strategically deployed across various nodes within the computing power network (CPN), utilizing distributed resources to maximize efficiency. Moreover, we employ dynamic task scheduling based on Service Level Agreement (SLA) constraints to prioritize tasks, thereby enhancing the quality of computing resource matching. The deployment strategy is grounded in GNN-facilitated low-complexity deep graph matching techniques, aimed at effectively integrating network functionalities with service demands. Our proposed service matching method based on GNN considering dynamic task scheduling (DTSM) has been validated through simulation experiments, with results indicating significant improvements in load balance and overall service processing delay. © 2024 IEEE.

关键词： Computing power

来源：评论

学校读者我要写书评

暂无评论

Sharper Convergence Guarantees for Asynchronous SGD for distributed and Federated Learning 36

Sharper Convergence Guarantees for Asynchronous SGD for Dist...

引用

36th Conference on neural Information processing Systems (NeurIPS)

作者： Koloskova, Anastasia Stich, Sebastian U. Jaggi, Martin Ecole Polytech Fed Lausanne Lausanne Switzerland CISPA Helmholtz Ctr Informat Secur Saarbrucken Germany

ISBN: (纸本)9781713871088

We study the asynchronous stochastic gradient descent algorithm for distributed training over n workers which have varying computation and communication frequency over time. In this algorithm, workers compute stochastic gradients in parallel at their own pace and return those to the server without any synchronization. Existing convergence rates for this algorithm for non-convex smooth objectives depend on the maximum gradient delay tau(max) and show that an epsilon-stationary point is reached after O (sigma(2) epsilon(-2) + tau(max) epsilon(-1) ) iterations, where sigma denotes the variance of stochastic gradients. In this work we obtain (i) a tighter convergence rate of O( sigma(2) epsilon(-2) + root tau(max) tau(avg) epsilon(-1) ) without any change in the algorithm, where tau(avg) is the average delay, which can be significantly smaller than tau(max). We also provide (ii) a simple delay-adaptive learning rate scheme, under which asynchronous SGD achieves a convergence rate of O (sigma(2) epsilon(-2) + tau(avg) epsilon(-1) ), and does not require any extra hyperparameter tuning nor extra communications. Our result allows to show for the first time that asynchronous SGD is always faster than mini-batch SGD. In addition, (iii) we consider the case of heterogeneous functions motivated by federated learning applications and improve the convergence rate by proving a weaker dependence on the maximum delay compared to prior works. In particular, we show that the heterogeneity term in convergence rate is only affected by the average delay within each worker.

关键词： Stochastic systems

来源：评论

学校读者我要写书评

暂无评论

Preprocessing Pipeline Optimization for Scientific Deep Learning Workloads 36

Preprocessing Pipeline Optimization for Scientific Deep Lear...

引用

36th IEEE International Parallel and distributed processing Symposium (IEEE IPDPS)

作者： Ibrahim, Khaled Z. Oliker, Leonid Lawrence Berkeley Natl Lab Appl Math & Computat Res Div One Cyclotron Rd Berkeley CA 94720 USA

ISBN: (纸本)9781665481069

Newly developed machine learning technology is promising to profoundly impact high-performance computing, with the potential to significantly accelerate scientific discoveries. However, scientific machine learning performance is often constrained by data movement overheads, particularly on existing and emerging hardware-accelerated systems. In this work, we focus on optimizing the data movement across storage and memory systems, by developing domain-specific data encoder/decoders. These plugins have the dual benefit of significantly reducing communication while enabling efficient decoding on the accelerated hardware. We explore detailed performance analysis for two important scientific learning workloads from cosmology and climate analytics, CosmoFlow and DeepCAM, on the GPU-enabled Summit and Cori supercomputers. Results demonstrate that our optimizations can significantly improve overall performance by up to 10x compared with the default baseline, while preserving convergence behavior. Overall, this methodology can be applied to various machine learning domains and emerging AI technologies.

关键词： Deep neural network Pytorch TensorFlow Preprocessing Compression CosmoFlow DeepCAM

来源：评论

学校读者我要写书评

暂无评论

Laser Phase Noise Mitigation based on Autoencoder for End-to-end Learning of CO-OFDM Systems 6

Laser Phase Noise Mitigation based on Autoencoder for End-to...

引用

6th International Conference on Signal processing and Information Security, ICSPIS 2023

作者： Alnaseri, Omar Al-Saedi, Ibtesam R. K. Al-Asadi, Ahmed Baden-Wuerttemberg Cooperative State University Department of Electronic Engineering Friedrichshafen Germany University of Louisville Department of Electrical and Computer Engineering LouisvilleKY United States University of Technology Department of Communication Engineering Baghdad Iraq University of Technology Baghdad Iraq

ISBN: (纸本)9798350329599

This paper proposes an end-to-end learning approach for coherent optical orthogonal frequency-division multiplexing (CO-OFDM) fiber communication transmission to mitigate laser phase noise. The approach is based on the autoencoder (AE) concept, which is a type of deep neural network designed to learn how to reconstruct input data at its output. In the proposed approach, the encoder component of the autoencoder generates robust symbol sequence representations for incoming data, ensuring resilience to laser phase noise impairments. The proposed approach exhibits impressive tolerance to laser phase noise of low-cost distributed feedback (DFB) lasers (a linewidth of above 1 MHz), making it effective in compensating for the impact of inter-carrier interference (ICI) phase noise within an OFDM symbol. © 2023 IEEE.

关键词： distributed feedback lasers

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：