检索结果-内蒙古大学图书馆

10th IEEE International Conference on High Performance and Smart computing (IEEE HPSC)

作者： Chen, Bo Zhu, Yongxin Guo, Yu Xu, Shiyuan Chinese Acad Sci Shanghai Adv Res Inst Shanghai Peoples R China Shanghai Nucl Engn Res & Design Inst Co LTD Shanghai Peoples R China Univ Chinese Acad Sci Beijing Peoples R China

ISBN: (纸本)9798350389463;9798350389470

This paper proposes one approach with federated learning technique to address practical challenges faced by the emerging green energy industries, i.e., wind turbines in terms of Predictive Health Management (PHM). Not as many federated learning applications being used in the scenarios only for simulation, the application of federated learning in this paper is focused on the real industrial problems with raw data collected from the fields. Huge amount of real data was collected by sensors on more than ten wind turbines across different areas in China and transmitted to the storage for in-time processing. The framework proposed in this paper called TurboFed, can handle the raw data and achieves good prediction performance in the practical wind generated power systems. The framework showed its help on improving the efficiency of the wind turbines. The paper has brought three novel results. First, as far as known, the framework here is the first federated learning framework addressing position adjustment of wind turbines in the real environment. Second, it deploys customized recurrent neural computing models to the wind turbines which are considered the client devices under the federated learning paradigm. Finally, it incorporates new customized aggregation algorithms on the sever side.

关键词： customized computing Edge computing Federated Learning Machine Learning Green Energy Wind Energy

来源：评论

学校读者我要写书评

暂无评论

AutoDSE: Enabling Software Programmers to Design Efficient FPGA Accelerators

引用

ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS 2022年第4期27卷 32-32页

作者： Sohrabizadeh, Atefeh Yu, Cody Hao Gao, Min Cong, Jason Univ Calif Los Angeles Dept Comp Sci Los Angeles CA 90095 USA Falcon Comp Inc 3979 Freedom Circle Suite 530 Santa Clara CA 95054 USA

Adopting FPGA as an accelerator in datacenters is becoming mainstream for customized computing, but the fact that FPGAs are hard to program creates a steep learning curve for software programmers. Even with the help of high-level synthesis (HLS), accelerator designers still have to manually perform code reconstruction and cumbersome parameter tuning to achieve optimal performance. While many learning models have been leveraged by existing work to automate the design of efficient accelerators, the unpredictability of modern HLS tools becomes a major obstacle for them to maintain high accuracy. To address this problem, we propose an automated DSE framework-AutoDSE-that leverages a bottleneck-guided coordinate optimizer to systematically find a better design point. AutoDSE detects the bottleneck of the design in each step and focuses on high-impact parameters to overcome it. The experimental results show that AutoDSE is able to identify the design point that achieves, on the geometric mean, 19.9x speedup over one CPU core for MachSuite and Rodinia benchmarks. Compared to the manually optimized HLS vision kernels in Xilinx Vitis libraries, AutoDSE can reduce their optimization pragmas by 26.38x while achieving similar performance. With less than one optimization pragma per design on average, we are making progress towards democratizing customizable computing by enabling software programmers to design efficient FPGA accelerators.

关键词： Bottleneck optimizer customized computing HLS Merlin Compiler

来源：评论

学校读者我要写书评

暂无评论

AutoDSE: Enabling Software Programmers Design Efficient FPGA Accelerators 21

AutoDSE: Enabling Software Programmers Design Efficient FPGA...

引用

The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

作者： Atefeh Sohrabizadeh Cody Hao Yu Min Gao Jason Cong University of California Los Angeles Los Angeles CA USA Falcon Computing Inc. Los Angeles CA USA University of California Los Angeles & Falcon Computing Inc. Los Angeles CA USA

ISBN: (纸本)9781450382182

Adopting FPGA as an accelerator in datacenters is becoming mainstream for customized computing, but the fact that FPGAs are hard to program creates a steep learning curve for software programmers. Even with the help of high-level synthesis (HLS), accelerator designers still must manually perform code reconstruction and cumbersome parameter tuning to achieve the optimal performance. While many learning models have been leveraged by existing work to automate the design of efficient accelerators, the unpredictability of modern HLS tools becomes a major obstacle for them to maintain high accuracy. We address this problem by incorporating an automated DSE framework - AutoDSE - that leverages bottleneck-guided gradient optimizer to systematically find a better design point. AutoDSE finds the bottleneck of the design in each step and focuses on high-impact parameters to overcome that, which is like the approach an expert would take. The experimental results show that AutoDSE is able to find the design point that achieves, on the geometric mean, 19.9x speedup over one CPU core for Machsuite and Rodinia benchmarks and 1.04x over the manually designed HLS accelerated vision kernels in Xilinx Vitis libraries yet with 26x reduction of their optimization pragmas.

关键词： high-level synthesis customized computing design automation

来源：评论

学校读者我要写书评

暂无评论

ERDSE: efficient reinforcement learning based design space exploration method for CNN accelerator on resource limited platform

引用

Graphics and Visual computing 2021年 4卷

作者： Kaijie Feng Xiaoya Fan Jianfeng An Xiping Wang Kaiyue Di Jiangfei Li Minghao Lu Chuxi Li School of Computer Science Engineering and Research Center of Embedded Systems Integration (Ministry of Education) Northwestern Polytechnical University Xi’an 710129 China

Convolutional Neural Network (CNN) accelerator design on resource limited platform faces the challenge of lacking efficient design space exploration (DSE) method because of its huge and irregular design space. Numerous parameters belong to accelerator architecture and dataflow mode jointly construct a huge design space while power and resource constrains make the design space become quite irregular. Under such circumstances, traditional DSE methods based on exhaustive search is infeasible for the non-trivial design space and methods based on general optimization algorithms will also be inefficient because of the irregular distribution of design points. In this paper, we provide an efficient DSE method named ERDSE for CNN accelerator design on resource limited platform. ERDSE is based on reinforcement learning algorithm REINFORCE but refines it to adapt the complex design space. ERDSE implements off-policy strategy to decouple sampling and learning phase, then separately refines them to further improve exploration ability and samples utilization. We implement ERDSE to optimize the computing latency of CNN accelerator for VGG-16 and MobileNet-V3. Under the tightest constraints, ERDSE achieves 1.2x-1.7x (on VGG-16) and 2.3-4.9x (on MobileNet-V3) latency improvement compared with other DSE methods, which demonstrates the efficiency of ERDSE.

关键词： CNN accelerator customized computing Design space exploration Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Software Infrastructure for Enabling FPGA-Based Accelerations in Data Centers 16

Software Infrastructure for Enabling FPGA-Based Acceleration...

引用

21st IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)

作者： Cong, Jason Huang, Muhuan Pan, Peichen Wu, Di Zhang, Peng Falcon Comp Solut Inc Los Angeles CA 90002 USA Univ Calif Los Angeles Dept Comp Sci Los Angeles CA 90024 USA

ISBN: (纸本)9781450341851

This paper focuses on the development of an infrastructure to enable FPGA-based acceleration in data centers. We present an initial version of an integrated solution that includes automated compilation for accelerator generation, runtime accelerator resource scheduling and management, and acceleration libraries for FPGA-based customized computing for big data applications. The solution can help overcome some of the main challenges with FPGA-based accelerated computing. It has the potential to bring significant performance and energy efficiency improvement for data center applications.

关键词： FPGA acceleration customized computing heterogeneous data center

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：