检索结果-内蒙古大学图书馆

IEEE International Conference on computer-Aided Design

作者： Ying Wang Mengdi Wang Bing Li Huawei Li Xiaowei Li State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing P.R. China Capital Normal University Beijing P.R. China

ISBN: (数字)9781665423243

Deep Reinforcement Learning (DRL) is substantially resource-consuming, and it requires large-scale distributed computing-nodes to learn complicated tasks, like video-game and Go play. This work attempts to down-scale a distributed DRL system into a specialized many-core chip and achieve energy-efficient on-chip DRL. With the customized Network-on-Chip that handles the communication of on-chip data and control-signals, we proposed a Synchronous Asynchronous RL architecture (SARLA) and the according many-core chip that completely avoids the unnecessary data duplication and synchronization activities in multi-node RL systems. In evaluation, the SARLA system achieves considerable energy-efficiency boost over the GPU-based implementations for typical DRL workloads built with OpenAI-gym.

关键词： system-on-chip computer architecture Reinforcement learning Neural networks Servers Parallel processing Training

来源：评论

学校读者我要写书评

暂无评论

ExploreBP: A Simulation Tool for Mobile Browser Energy Optimization 11th

ExploreBP: A Simulation Tool for Mobile Browser Energy Optim...

引用

11th EAI International Conference on Simulation Tools and Techniques, SIMUTools 2019

作者： Zhang, Jin Wei, Xin Liu, Zhen Liu, Fangxin Li, Tao Lu, Tingjuan Gong, Xiaoli Nankai University Tianjin China IT Department Chinese PLA 117 Hospital Hangzhou China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China

ISBN: (纸本)9783030322151

The browser is one of the most commonly used applications. Users tend to pursue a good user experience and care more about the performance of the browser, while ignoring the power consumption of the browser. This paper proposes a method to reduce the energy consumption of web browsing. In order to better quantify the user experience, this paper uses the first screen load time as the evaluation metric of user experience. First, according to the relationship between the network speed and the first screen load time, find the most suitable primary frequency at a specific network speed, and define the point as the balance point. When the primary frequency is greater than the primary frequency corresponding to the balance point, the first screen load time will almost never change. The balance points of different web pages are also different. Then adjust the CPU frequency according to the balance point of the webpage and the network speed, which can reduce the browser energy consumption and reduce the impact on the user experience. At the same time, this paper proposes a simulation tool ExploreBP, which is used to simulate the working state of the network speed and different web pages to find the optimal energy consumption configuration. © ICST institute for computer Sciences, Social Informatics and Telecommunications Engineering 2019.

关键词： Frequency modulation

来源：评论

学校读者我要写书评

暂无评论

SCIENTISTS RECONSIDER LOW-ENERGY NUCLEAR REACTIONS It's absolutely, definitely, seriously not cold fusion

引用

IEEE SPECTRUM 2018年第12期55卷 10-11页

作者： Koziol, Michael State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences

It's been a big year for low-energy nuclear reactions. LENRs, as they're known, are a fringe research topic that some physicists think could explain the results of an infamous experiment nearly 30 years ago that formed the basis for the idea of cold fusion. That idea didn't hold up, and only a handful of researchers around the world have continued trying to understand the mysterious nature of the inconsistent, heat-generating reactions that had spurred those claims.

关键词：

来源：评论

学校读者我要写书评

暂无评论

AI-oriented medical workload allocation for hierarchical cloud/edge/device computing

arXiv

引用

arXiv 2020年

作者： Hao, Tianshu Zhan, Jianfeng Hwang, Kai Gao, Wanling Wen, Xu State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Chinese University of Hong Kong Shenzhen China University of Chinese Academy of Sciences Shenzhen Institute of Artificial Intelligence and Robotics for Society

In a hierarchically-structured cloud/edge/device computing environment, workload allocation can greatly affect the overall system performance. This paper deals with AI-oriented medical workload generated in emergency rooms (ER) or intensive care units (ICU) in metropolitan areas. The goal is to optimize AI-workload allocation to cloud clusters, edge servers, and end devices so that minimum response time can be achieved in life-saving emergency applications. In particular, we developed a new workload allocation method for the AI workload in distributed cloud/edge/device computing systems. An efficient scheduling and allocation strategy is developed in order to reduce the overall response time to satisfy multi-patient demands. We apply several ICU AI workloads from a comprehensive edge computing benchmark Edge AIBench. The healthcare AI applications involved are short-of-breath alerts, patient phenotype classification, and life-death threats. Our experimental results demonstrate the high efficiency and effectiveness in real-life health-care and emergency applications. Copyright © 2020, The Authors. All rights reserved.

关键词： Edge computing

来源：评论

学校读者我要写书评

暂无评论

Soft Error Mitigation for Deep Convolution Neural Network on FPGA Accelerators

Soft Error Mitigation for Deep Convolution Neural Network on...

引用

IEEE International Conference on Artificial Intelligence Circuits and systems (AICAS)

作者： Wenshuo Li Guangjun Ge Kaiyuan Guo Xiaoming Chen Qi Wei Zhen Gao Yu Wang Huazhong Yang Department of Electronic Engineering BNRist Tsinghua University Beijing China State Key Laboratory of Computer Architecture Institute of Computing Technology CAS Beijing China Tianjin International Engineering Institute Tianjin University Tianjin China

ISBN: (数字)9781728149226

ISBN: (纸本)9781728149233

Convolution neural networks (CNNs) have been widely used in many applications. Field-Programmable Gate Array (FPGA) based accelerator is an ideal solution for CNNs in embedded systems. However, the single event upset (SEU) effect in FPGA device may have a significant influence on the performance of CNNs. In this paper, we analyze the sensibility of CNNs to SEU and present a fault-tolerant design for CNN accelerators. First, we find that SEU in processing elements (PEs) has the worst effects on CNNs since it produces proportional errors and will not get refreshed. Furthermore, it is indicated that the large positive perturbation contributes almost all of the performance loss. Based on such observations, we propose an error detecting scheme to locate incorrect PEs and give an error masking method to achieve fault-tolerance. Experiments demonstrate that the proposed method achieves similar fault-tolerant performance with the triple modular redundancy (TMR) scheme while the overhead is much lower than it.

关键词： Table lookup Adders Field programmable gate arrays Single event upsets Neural networks Fault tolerance

来源：评论

学校读者我要写书评

暂无评论

An Enhanced Handover Scheme for Cellular-Connected UAVs

An Enhanced Handover Scheme for Cellular-Connected UAVs

引用

IEEE International Conference on Communications in China (ICCC)

作者： Wenbin Dong Xinhong Mao Ronghui Hou Xixiang Lv Hui Li School of Cyber Engineering Xidian University P. R. China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Institute of Telecommunication Satellite China Academy of Space Technology Beijing China

ISBN: (数字)9781728173276

ISBN: (纸本)9781728173283

In this paper, we propose an enhanced handover scheme for cellular-connected UAVs. Specifically, our handover scheme considers the following characteristics: 1) UAV can detect multiple cells with the comparable RSRP levels which may cause many unnecessary handovers. The handover event trigger parameters in our scheme are dynamically adjusted to avoid a UAV to handover from a cell to another cell with the comparable RSRP level; 2)In the process of taking off, the UAV would fly through the null space of antenna lobes many times, while the time duration is normally very short. The RSRP during the UAV taking off varies quickly, so that the measurement reports may not provide an accurate channel information for the UAV. In this case, when the link quality between the UAV and the BS is below a threshold, the BS allows the link being maintained for a while with the hope that the link quality would get better again. We implement our proposed handover scheme on the NS3 platform, and compare with the current LTE handover scheme and the sojourn time estimation-based handover algorithm. Our simulation results demonstrate that our proposed scheme can significantly reduce the number of unnecessary handovers. Moreover, the network throughput of our scheme is improved, since the the communication resources taken by the unnecessary handovers is utilized by the UAV for transmitting data.

关键词： Navigation Simulation Null space Handover Throughput Long Term Evolution Antennas

来源：评论

学校读者我要写书评

暂无评论

Beam Management for Cellular-Connected UAVs: A Fast Link Recovery Approach

Beam Management for Cellular-Connected UAVs: A Fast Link Rec...

引用

IEEE International Conference on Communications in China (ICCC)

作者： Jinli Wu Xinhong Mao Ronghui Hou Xixiang Lv Hui Li School of Cyber Engineering Xidian University P. R. China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Institute of Telecommunication Satellite China Academy of Space Technology Beijing China

ISBN: (数字)9781728173276

ISBN: (纸本)9781728173283

Due to the short wavelength of millimeter wave (mmWave) and high directional beamforming, the massive MIMO systems are highly vulnerable to link blockage. Beam switching to unblocked direction is an effective solution to overcome blockage and restore communication links. To this end, a set of candidate beams for beam switching should be selected before the beam is blocked. However, due to the high speed movement of the UAV, identifying the appropriate beam for an UAV with any position is not trivial. In this work, a fast link recovery approach is proposed. Specifically, our proposed beam selection method considers the spatial correlation, estimated reliability probability of the beams and signal quality. The simulation results show that the proposed method can efficiently recover the interrupted link, and the outage probability is almost reduced to 0% in the scene where the UAV moves at high speed.

关键词： Simulation Switches Reinforcement learning Probability Power system reliability Reliability Millimeter wave communication

来源：评论

学校读者我要写书评

暂无评论

Communication Lower Bound in Convolution Accelerators

Communication Lower Bound in Convolution Accelerators

引用

IEEE Symposium on High-Performance computer architecture

作者： Xiaoming Chen Yinhe Han Yu Wang Center for Intelligent Computing Systems State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China Department of Electronic Engineering Tsinghua University Beijing China

ISBN: (数字)9781728161495

ISBN: (纸本)9781728161501

In current convolutional neural network (CNN) accelerators, communication (i.e., memory access) dominates the energy consumption. This work provides comprehensive analysis and methodologies to minimize the communication for CNN accelerators. For the off-chip communication, we derive the theoretical lower bound for any convolutional layer and propose a dataflow to reach the lower bound. This fundamental problem has never been solved by prior studies. The on-chip communication is minimized based on an elaborate workload and storage mapping scheme. We in addition design a communication-optimal CNN accelerator architecture. Evaluations based on the 65nm technology demonstrate that the proposed architecture nearly reaches the theoretical minimum communication in a three-level memory hierarchy and it is computation dominant. The gap between the energy efficiency of our accelerator and the theoretical best value is only 37-87%.

关键词： system-on-chip Convolution Random access memory Convolutional codes Memory management Microsoft Windows Kernel

来源：评论

学校读者我要写书评

暂无评论

OpenClinicalAI: enabling AI to diagnose diseases in real-world clinical settings

arXiv

引用

arXiv 2021年

作者： Huang, Yunyou Wang, Nana Tang, Suqin Ma, Li Hao, Tianshu Jiang, Zihan Zhang, Fan Kang, Guoxin Miao, Xiuxia Guan, Xianglong Zhang, Ruchang Zhang, Zhifei Zhan, Jianfeng Guangxi Key Lab of Multi-Source Information Mining & Security School of Computer Science and Engineering School of Software Guangxi Normal University Guilin China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China Guilin Medical University Guilin China Department of Physiology and Pathophysiology Capital Medical University Beijing China University of Chinese Academy of Sciences China International Open Benchmark Council

This paper quantitatively reveals the state-of-the-art and state-of-the-practice AI systems only achieve acceptable performance on the stringent conditions that all categories of subjects are known, which we call closed clinical settings, but fail to work in real-world clinical settings. Compared to the diagnosis task in the closed setting, real-world clinical settings pose severe challenges, and we must treat them differently. We build a clinical AI benchmark named Clinical AIBench to set up real-world clinical settings to facilitate researches. We propose an open, dynamic machine learning framework and develop an AI system named OpenClinicalAI to diagnose diseases in real-world clinical settings. The first versions of Clinical AIBench and OpenClinicalAI target Alzheimer’s disease. In the real-world clinical setting, OpenClinicalAI significantly outperforms the state-of-the-art AI system. In addition, OpenClinicalAI develops personalized diagnosis strategies to avoid unnecessary testing and seamlessly collaborates with clinicians. It is promising to be embedded in the current medical systems to improve medical services. © 2021, CC BY.

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

To talk or to work: Flexible communication compression for energy efficient federated learning over heterogeneous mobile edge devices

arXiv

引用

arXiv 2020年

作者： Li, Liang Shi, Dian Hou, Ronghui Li, Hui Pan, Miao Han, Zhu School of Cyber Engineering Xidian University Xi’an China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences China Department of Electrical and Computer Engineering University of Houston HoustonTX United States

Recent advances in machine learning, wireless communication, and mobile hardware technologies promisingly enable federated learning (FL) over massive mobile edge devices, which opens new horizons for numerous intelligent mobile applications. Despite the potential benefits, FL imposes huge communication and computation burdens on participating devices due to periodical global synchronization and continuous local training, raising great challenges to battery constrained mobile devices. In this work, we target at improving the energy efficiency of FL over mobile edge networks to accommodate heterogeneous participating devices without sacrificing the learning performance. To this end, we develop a convergence-guaranteed FL algorithm enabling flexible communication compression. Guided by the derived convergence bound, we design a compression control scheme to balance the energy consumption of local computing (i.e., "working") and wireless communication (i.e., "talking") from the long-term learning perspective. In particular, the compression parameters are elaborately chosen for FL participants adapting to their computing and communication environments. Extensive simulations are conducted using various datasets to validate our theoretical analysis, and the results also demonstrate the efficacy of the proposed scheme in energy saving. © 2020, CC BY.

关键词： Wireless networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：