检索结果-内蒙古大学图书馆

DOSA: Organic Compilation for Neural Network Inference on distributed FPGAs 7

DOSA: Organic Compilation for Neural Network Inference on Di...

7th IEEE International Conference on Edge Computing and Communications (IEEE EDGE) / IEEE World Congress on Services (SERVICES)

作者： Ringlein, Burkhard Abel, Francois Diamantopoulos, Dionysios Weiss, Beat Hagleitner, Christoph Fey, Dietmar IBM Res Europe Ruschlikon Switzerland Friedrich Alexander Univ Erlangen Nurnberg Erlangen Germany

ISBN: (纸本)9798350304831

The computational requirements of artificial intelligence workloads are growing exponentially. In addition, more and more compute is moved towards the edge due to latency or localization constraints. At the same time, Dennard scaling has ended and Moore's law is winding down. These trends created an opportunity for specialized accelerators including field-programmable gate arrays (FPGAs), but the poor support and usability of today's tools prevents FPGAs from being deployed at scale for deep neural network (DNN) inference applications. In this work, we propose an organic compiler - DOSA - that drastically lowers the barrier for deploying FPGAs. DOSA builds on the operation set architecture concept and integrates the DNN accelerator components generated by existing DNN-to-FPGA frameworks to produce an overall efficient solution. DOSA starts from DNNs represented in the community standard ONNX and automatically implements model- and data-parallelism, based on the performance targets and resource footprints provided by the user. Deploying a DNN using DOSA on 9 FPGAs exhibits a speedup of up to 52 times compared to a CPU and 18 times compared to a GPU.

关键词： MLSys Reconfigurable hardware Domain-specific architectures Compilers distributed artificial intelligence Design Tools and Techniques

来源：评论

学校读者我要写书评

暂无评论

A practical approach for multiagent manufacturing system based on agent computing nodes (Mar, 10.1177/0954406220908626, 2019)

引用

PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE 2022年第4期236卷 2070-2073页

作者： Zhang, Z. Tang, D. Zhu, H. College of Mechanical and Electrical Engineering Nanjing University of Aeronautics and Astronautics Nanjing China

With the increasing requirement for personalized customization service, discrete manufacturing workshop, as the parts processing unit in manufacturing system, is expected for more agile and fast adaptation to environment changes, dynamically handling production tasks according to resource conditions. Simultaneously, distributed artificial intelligence system (e.g. multiagent manufacturing system and the holonic manufacturing system) has been considered as an important approach for developing industrial applications to solve the problems of complexity, uncertainty, and dynamic in the modern manufacturing environment. But the lack of universality and the difficulty in deployment have restricted the use of distributed artificial intelligence in actual industrial sites. For this issue, a new concept of agent computing node is proposed in this paper to enable the realization of multiagent manufacturing system. Adaptation layer, information development layer, and intelligent analysis layer are investigated for standardizing the configuration mode of agent computing node. Cooperating agent computing node with the radio frequency identification-based dynamic recognition technology for workpiece machining process is presented in this paper, and a practical approach for multiagent manufacturing system is considered, which can apply the functions regarding to deployment of dynamic scheduling and plug-and-play. A laboratory discrete manufacturing workshop system is used as a case study to prove the feasibility of this approach. In addition, a verification in industry is carried out, and the result proves the universality of this approach.

关键词： distributed artificial intelligence multiagent manufacturing system discrete manufacturing workshop radio frequency identification plug-and-play

来源：评论

学校读者我要写书评

暂无评论

artificial Neural Network Modeling for Airline Disruption Management

引用

JOURNAL OF AEROSPACE INFORMATION SYSTEMS 2022年第5期19卷 382-393页

作者： Ogunsina, Kolawole Okolo, Wendy A. Purdue Univ Sch Aeronaut & Astronaut 701 Stadium Avenue W Lafayette IN 47907 USA NASA Ames Res Ctr Intelligent Syst Div Moffett Field CA 94035 USA

Since the 1970s, most airlines have incorporated computerized support for managing disruptions during flight schedule execution. However, existing platforms for airline disruption management (ADM) employ monolithic system design methods that rely on the creation of specific rules and requirements through explicit optimization routines, before a system that meets the specifications is designed. Thus, current platforms for ADM are unable to readily accommodate additional system complexities resulting from the introduction of new capabilities, such as the introduction of unmanned aerial systems, operations, and infrastructure, to the system. To this end, historical data on airline scheduling and operations recovery are used to develop a system of artificial neural networks (ANNs), which describe a predictive transfer function model (PTFM) for promptly estimating the recovery impact of disruption resolutions at separate phases of flight schedule execution during ADM. Furthermore, this paper provides a modular approach for assessing and executing the PTFM by employing a parallel ensemble method to develop generative routines that amalgamate the system of ANNs. Our modular approach ensures that current industry standards for tardiness in flight schedule execution during ADM are satisfied, while accurately estimating appropriate time-based performance metrics for the separate phases of flight schedule execution.

关键词： Neural Networks Unmanned Aircraft System Collaborative Decision Making Air Transportation distributed artificial intelligence False Positive Rate Airline Operations Gaussian Process Federal Aviation Administration Aircraft Operations

来源：评论

学校读者我要写书评

暂无评论

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training

引用

IEEE TRANSACTIONS ON PARALLEL AND distributed SYSTEMS 2022年第3期33卷 489-506页

作者： Zhao, Shixiong Li, Fanxin Chen, Xusheng Guan, Xiuxian Jiang, Jianyu Huang, Dong Qing, Yuhao Wang, Sen Wang, Peng Zhang, Gong Li, Cheng Luo, Ping Cui, Heming Univ Hong Kong Dept Comp Comp Sci Hong Kong 999077 Peoples R China Huawei Technoloies Co Ltd Theory Lab 2012 Labs Shenzhen 518129 Peoples R China Univ Sci & Technol China Sch Comp Sci & Technol Hefei 230052 Anhui Peoples R China

The increasing computational complexity of DNNs achieved unprecedented successes in various areas such as machine vision and natural language processing (NLP), e.g., the recent advanced Transformer has billions of parameters. However, as large-scale DNNs significantly exceed GPU's physical memory limit, they cannot be trained by conventional methods such as data parallelism. Pipeline parallelism that partitions a large DNN into small subnets and trains them on different GPUs is a plausible solution. Unfortunately, the layer partitioning and memory management in existing pipeline parallel systems are fixed during training, making them easily impeded by out-of-memory errors and the GPU under-utilization. These drawbacks amplify when performing neural architecture search (NAS) such as the evolved Transformer, where different network architectures of Transformer needed to be trained repeatedly. vPipe is the first system that transparently provides dynamic layer partitioning and memory management for pipeline parallelism. vPipe has two unique contributions, including (1) an online algorithm for searching a near-optimal layer partitioning and memory management plan, and (2) a live layer migration protocol for re-balancing the layer distribution across a training pipeline. vPipe improved the training throughput of two notable baselines (Pipedream and GPipe) by 61.4-463.4 percent and 24.8-291.3 percent on various large DNNs and training settings.

关键词： Pipelines Training Graphics processing units Throughput Memory management Parallel processing Tensors Machine learning distributed systems distributed artificial intelligence pipeline parallel systems memory management

来源：评论

学校读者我要写书评

暂无评论

Model-Agnostic Federated Learning for Privacy-Preserving Systems

Model-Agnostic Federated Learning for Privacy-Preserving Sys...

引用

IEEE Secure Development Conference (SecDev)

作者： Almohri, Hussain M. J. Watson, Layne T. Kuwait Univ Dept Comp Sci Kuwait Kuwait Virginia Polytech Inst & State Univ Dept Comp Sci Math & Aerosp Blacksburg VA 24061 USA Virginia Polytech Inst State Univ Dept Ocean Engn Blacksburg VA 24061 USA

ISBN: (纸本)9798350331325

This study presents an innovative aggregation scheme for model-agnostic, local, heterogeneous data models within the domain of Federated Learning. The proposed approach imposes minimal constraints on local models, only necessitating local model parameters and distances from local data centroids for a particular query. These requirements facilitate the design of privacy-preserving learning systems. We introduce a system architecture based on federated interpolation to operationalize the proposed scheme. The accuracy of our proposed scheme is evaluated using two distinct real-world datasets. We compare our results to the extreme case of a single-client scenario having complete access to all data points. Our findings indicate that, on average, federated interpolation maintains robust accuracy, experiencing a slight reduction of less than 10% compared to the single-client model with full data access.

关键词： distributed artificial intelligence Security and Privacy Protection Interpolation

来源：评论

学校读者我要写书评

暂无评论

Wireless Federated Learning (WFL) for 6G Networks-Part I: Research Challenges and Future Trends

引用

IEEE COMMUNICATIONS LETTERS 2022年第1期26卷 3-7页

作者： Bouzinis, Pavlos S. Diamantoulakis, Panagiotis D. Karagiannidis, George K. Aristotle Univ Thessaloniki Wireless Commun & Informat Proc Grp WCIP Dept Elect & Comp Engn Thessaloniki 54124 Greece

Conventional machine learning techniques are conducted in a centralized manner. Recently, the massive volume of generated wireless data, the privacy concerns and the increasing computing capabilities of wireless end-devices have led to the emergence of a promising decentralized solution, termed as Wireless Federated Learning (WFL). In this first of the two parts letter, we present the application of WFL in the sixth generation of wireless networks (6G), which is envisioned to be an integrated communication and computing platform. After analyzing the key concepts of WFL, we discuss the core challenges of WFL imposed by the wireless (or mobile communication) environment. Finally, we shed light to the future directions of WFL, aiming to compose a constructive integration of FL into the future wireless networks.

关键词： Wireless federated learning distributed artificial intelligence 6G networks

来源：评论

学校读者我要写书评

暂无评论

From intelligent agents to trustworthy human-centred multiagent systems

引用

AI COMMUNICATIONS 2022年第4期35卷 443-457页

作者： Soorati, Mohammad Divband Gerding, Enrico H. Marchioni, Enrico Naumov, Pavel Norman, Timothy J. Ramchurn, Sarvapali D. Rastegari, Bahar Sobey, Adam Stein, Sebastian Tarpore, Danesh Yazdanpanah, Vahid Zhang, Jie Univ Southampton Sch Elect & Comp Sci Agents Interact & Complex Res Grp Southampton Hants England

The Agents, Interaction and Complexity research group at the University of Southampton has a long track record of research in multiagent systems (MAS). We have made substantial scientific contributions across learning in MAS, game-theoretic techniques for coordinating agent systems, and formal methods for representation and reasoning. We highlight key results achieved by the group and elaborate on recent work and open research challenges in developing trustworthy autonomous systems and deploying human-centred AI systems that aim to support societal good.

关键词： Multiagent systems intelligent agents distributed artificial intelligence trustworthy autonomous systems

来源：评论

学校读者我要写书评

暂无评论

A Dynamic MAS to Manage a Daily Planning for a Team Operating Theater 2nd

A Dynamic MAS to Manage a Daily Planning for a Team Operatin...

引用

3rd International Conference on Digital Technologies and Applications

作者： Soualfi, Oumaima Hajji Soualfi, Abderrahim Hajji Chmali, Khalid Elmrini, Abdelmajid Elbarkany, Abdellah Harras, Bilal Sidi Mohamed Ben Abdellah Univ Fac Sci & Tech Mech Engn Lab Fes Morocco Moulay Ismail Univ Fac Sci & Tech Dept Sci Comp Errachidia Morocco Sidi Mohamed Ben Abdellah Univ Hassan II Univ Hosp Fac Med & Pharm Fez Dept Orthoped Surg B4 Fes Morocco

ISBN: (纸本)9783031298561;9783031298578

Surgical planning is a preponderant step in the management of operating theaters, which becomes more and more solicited;considering the multiplicity and the complexity of the human and material components, which intervene there;in order to face the various disturbances hindering the normal course of the surgical activity. It is a subject widely discussed in the literature with the realization of several solutions and applications, but which remain globally incompatible with the realities of the surgical process. For this reason, we propose a daily surgical planning realized with a multi-agent system (MAS) based on distributed artificial intelligence (DAI). We describe some basic architectural entities of MAS in relation with the surgical planning, before presenting their application on a real case of the orthopedic surgery department B4 of the CHU Hassan II of Fez-Morocco. The objective of this work is to elaborate a daily, dynamic and real time surgical program answering the various possible and frequent disturbances altering the process of the operating theater.

关键词： multi-agent system distributed artificial intelligence operating theater planning

来源：评论

学校读者我要写书评

暂无评论

Efficient Adaptive Federated Learning in Resource-Constrained IoT Environments

Efficient Adaptive Federated Learning in Resource-Constraine...

引用

IEEE Conference on Global Communications (IEEE GLOBECOM) - Intelligent Communications for Shared Prosperity

作者： Chen, Zunming Cui, Hongyan Luan, Qiuji Xi, Yu Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing 100876 Peoples R China

ISBN: (纸本)9798350310900

Federated Learning (FL) has emerged as a privacy-preserving distributed learning framework which enables IoT devices to collaboratively train machine learning models via sharing model parameters. However, inefficiency due to frequent parameters transmissions significantly reduces FL performance. Existing acceleration algorithms for speeding up FL training consist of two main types including local update and parameter compression which consider the trade-offs between communication and computation/precision respectively. Jointly considering these two trade-offs and adaptively balancing their impacts on convergence have remained unresolved. To solve the problem, we propose an efficient adaptive federated optimization (EAFO) algorithm to improve the efficiency of FL in resource-constrained IoT environments, which minimizes the learning error by the joint consideration of two variables consisting of the local update and parameter compression. The EAFO enables FL to adaptively adjust two variables and balance trade-offs among computation, communication, and precision. The experiment results illustrate the high effectiveness of the proposed EAFO algorithm, which can achieve higher accuracies faster compared with the state-of-the-art algorithms.

关键词： IoT Federated Learning distributed artificial intelligence Communication Trade-offs

来源：评论

学校读者我要写书评

暂无评论

CLONE: Collaborative Learning on the Edges

引用

IEEE INTERNET OF THINGS JOURNAL 2021年第13期8卷 10222-10236页

作者： Lu, Sidi Yao, Yongtao Shi, Weisong Wayne State Univ Dept Comp Sci Detroit MI 48202 USA

The proliferation of edge computing technologies has boosted the development of new applications for a plethora of edge devices. However, many applications face privacy issues and bandwidth limitations. To solve these limitations, we propose a collaborative learning framework on the edges, named CLONE, which is steered by the real-world data sets collected from a large electric vehicle (EV) company and a grocery store of a shopping mall, respectively. We categorize two application scenarios for CLONE, i.e., CLONE in the training stage (CLONE_training) and CLONE in the inference stage (CLONE_inference). As to CLONE_training, we choose the failure prediction of EV battery and associated components as the first use case. While as for CLONE_inference, customer tracking in a grocery store is selected as another case study. In this work, the goal of the CLONE is to support real-time training and inference for connected vehicles and marketing intelligence services. Our experimental results on the EV data show that CLONE is able to reduce model training time without sacrificing algorithm performance. Furthermore, the experimental results on the video data from the grocery store reveal that CLONE is a useful approach to solve the multitarget multicamera tracking problem in a collaborative fashion.

关键词： Cloning Training Batteries Cloud computing Hardware Collaboration Measurement Collaborative learning distributed artificial intelligence edge computing electric vehicle (EV) battery failure prediction multitarget multicamera tracking (MTMCT)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：