检索结果-内蒙古大学图书馆

A benchmark of ocular disease intelligent recognition: One shot for Multi-disease detection

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Li, Ning Li, Tao Hu, Chunyu Wang, Kai Kang, Hong College of Computer Science Nankai university Tianjin300350 China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Science Beijing100190 China Beijing Shanggong Medical Technology Co. Ltd Beijing100176 China

In ophthalmology, early fundus screening is an economic and effective way to prevent blindness caused by ophthalmic diseases. Clinically, due to the lack of medical resources, manual diagnosis is time-consuming and may delay the condition. With the development of deep learning, some researches on ophthalmic diseases have achieved good results, however, most of them are just based on one disease. During fundus screening, ophthalmologists usually give diagnoses of multi-disease on binocular fundus image, so we release a dataset with 8 diseases to meet the real medical scene, which contains 10,000 fundus images from both eyes of 5,000 patients. We did some benchmark experiments on it through some state-of-the-art deep neural networks. We found simply increasing the scale of network cannot bring good results for multi-disease classification, and a well-structured feature fusion method combines characteristics of multi-disease is needed. Through this work, we hope to advance the research of related fields. Copyright © 2021, The Authors. All rights reserved.

关键词： computer aided diagnosis

Pinpointing the Memory Behaviors of DNN Training

学校读者我要写书评

暂无评论

Pinpointing the Memory Behaviors of DNN Training

IEEE International Symposium on Performance Analysis of systems and Software

作者： Jiansong Li Xiao Dong Guangli Li Peng Zhao Xueying Wang Xiaobing Chen Xianzhi Yu Yongxin Yang Zihan Jiang Wei Cao Lei Liu Xiaobing Feng University of Chinese Academy of Sciences Beijing China Youtu Lab Tencent Shanghai China Huawei Technology Co. Ltd Beijing China State Key Laboratory of Computer Architecture Institute of Computing Technology CAS Beijing China

The training of deep neural networks (DNNs) is usually memory-hungry due to the limited device memory capacity of DNN accelerators. Characterizing the memory behaviors of DNN training is critical to optimize the device memory pressures. In this work, we pinpoint the memory behaviors of each device memory block of GPU during training by instrumenting the memory allocators of the runtime system. Our results show that the memory access patterns of device memory blocks are stable and follow an iterative fashion. These observations are useful for the future optimization of memory-efficient training from the perspective of raw memory access patterns.

关键词： Training Runtime Instruments Neural networks Graphics processing units Software Performance analysis

Pinpointing the memory behaviors of DNN training

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Li, Jiansong Dong, Xiao Li, Guangli Zhao, Peng Wang, Xueying Chen, Xiaobing Yu, Xianzhi Yang, Yongxin Jiang, Zihan Cao, Wei Liu, Lei Feng, Xiaobing State Key Laboratory of Computer Architecture Institute of Computing Technology CAS Beijing China University of Chinese Academy of Sciences Beijing China Youtu Lab Tencent Shanghai China Huawei Technology Co. Ltd Beijing China

关键词： Deep neural networks

Understanding the Runtime Overheads of Deep Learning Inference on Edge Devices

学校读者我要写书评

暂无评论

Understanding the Runtime Overheads of Deep Learning Inferen...

IEEE International Conference on Big Data and Cloud computing (BdCloud)

作者： Xiu Ma Guangli Li Lei Liu Huaxiao Liu Xiaobing Feng College of Computer Science and Technology Jilin University China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University China SKL of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences China University of Chinese Academy of Sciences China

With the growing ubiquity of the Internet of Things, in-the-edge inference of deep neural network models has been a major driver for promoting the widespread use of intelligent applications. As model inference characteristics are crucial for optimizing and deploying deep neural networks on hardware platforms, many studies focus on analyzing the performance of neural networks such as latency, accuracy, throughput, and energy consumption. However, few existing works have ever discussed the runtime overheads hidden in neural network inference, despite the overheads are non-negligible for edge applications. The lack of in-depth analysis of the overheads hinders the understanding of how hardware designs and model structures impact on-device inference performance. In this paper, we characterize the runtime overheads of deep learning inference on representative edge devices by leveraging state-of-the-art neural network models, performing a systematical analysis from the perspective of end-to-end performance, hardware platforms, memory bandwidth, and neural network model structures. Based on experimental results, the crucial insights are offered to facilitate the design and configure of resource-efficient networks and pick appropriate models on the specific platform, which provides a comprehensive view of runtime overheads of in-the- edge neural network inference for architects and developers.

关键词： Deep learning Performance evaluation Analytical models Runtime Roads Neural networks Throughput

Prediction of Register Instance Usage and Time-sharing Register for Extended Register Reuse Scheme

学校读者我要写书评

暂无评论

Prediction of Register Instance Usage and Time-sharing Regis...

Asia and South Pacific Design Automation Conference

作者： Shuxin Zhou Huandong Wang Dong Tong State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China Loongson Corporation Beijing China Peking University Beijing China

ISBN: (数字)9781450379991

ISBN: (纸本)9781728180571

Register renaming is the key for the performance of out-of-order processors. However, the release mechanism of the physical register may cause a waste from time dimension. The register reuse technique is the earliest solution to release a physical register at renaming stage, which takes the advantage of those register instances with only one time use. However, the range of possible reuse mined by this scheme is not high, and the physical structure of the register have to be modified. Aiming at these two problems, we propose an extended register reuse scheme. Our work presents: 1) prediction of the use times of the register instance, so as to reuse the physical registers at the end of the last use, to expand the range of possible reuse. 2) A design of time-sharing register file with little overheads which is implemented by Backup Registers, avoiding to modify the physical register structure. Compared with the original register reuse technique, this work achieves 8.5% performance improvement, alternatively, 9.6% decrease of the number of physical registers with minor hardware overhead.

关键词： Out of order Program processors Design automation Asia Hardware Registers

A Survey on Indoor Visible Light Positioning systems: Fundamentals, Applications, and Challenges

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Zhu, Zhiyu Yang, Yang Chen, Mingzhe Guo, Caili Cheng, Julian Cui, Shuguang The Beijing Key Laboratory of Network System Architecture and Convergence School of Information and Communication Engineering Beijing University of Posts and Telecommunications Beijing100876 China The Department of Electrical and Computer Engineering Institute for Data Science and Computing University of Miami Coral GablesFL33146 United States The Beijing Laboratory of Advanced Information Networks School of Information and Communication Engineering Beijing University of Posts and Telecommunications Beijing100876 China The Faculty of Applied Science School of Engineering The University of British Columbia KelownaBCV1V 1V7 Canada The Chinese University of Hong Kong Shenzhen518172 China

The growing demand for location-based services in areas like virtual reality, robot control, and navigation has intensified the focus on indoor localization. Visible light positioning (VLP), leveraging visible light communications (VLC), becomes a promising indoor positioning technology due to its high accuracy and low cost. This paper provides a comprehensive survey of VLP systems. In particular, since VLC lays the foundation for VLP, we first present a detailed overview of the principles of VLC. Then, we provide an in-depth overview of VLP algorithms. The performance of each positioning algorithm is also compared in terms of various metrics such as accuracy, coverage, and orientation limitation. Beyond the physical layer studies, the network design for a VLP system is also investigated, including multi-access technologies, resource allocation, and light-emitting diode (LED) placements. Next, the applications of the VLP systems are overviewed. Finally, this paper outlines open issues, challenges, and opportunities for the research field. In a nutshell, this paper constitutes the first holistic survey on VLP from state-of-the-art studies to practical uses. Copyright © 2024, The Authors. All rights reserved.

关键词： Resource allocation

Modeling and analysis of three properties of mobile interactive systems based on variable Petri nets

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Yang, Ru Ding, Zhijun Jiang, Changjun Zhou, MengChu The Key Laboratory of Embedded System and Service Computing Ministry of Education Tongji University The Department of Computer Science and Technology Tongji University Shanghai201804 China The Institute of Systems Engineering Macau University of Science and Technology 999078 China The Department of Electrical and Computer Engineering New Jersey Institute of Technology NewarkNJ07102 United States

Due to the mobility and frequent disconnections, the correctness of mobile interaction systems, such as mobile robot systems and mobile payment systems, are often difficult to analyze. This paper introduces three critical properties of systems, called system connectivity, interaction soundness and data validity, and presents a related modeling and analysis method, based on a kind of Petri nets called VPN. For a given system, a model including component nets and interaction structure nets is constructed by using VPNs. The component net describes the internal process of each component, while the interaction structure net reflects the dynamic interaction between components. Based on this model, three properties are defined and analyzed. The case study of a practical mobile payment system shows the effectiveness of the proposed method. Copyright © 2021, The Authors. All rights reserved.

关键词： Petri nets

Dental Detection and Classification of YOLOv3-SPP based on Convolutional Block Attention Module

学校读者我要写书评

暂无评论

Dental Detection and Classification of YOLOv3-SPP based on C...

International Conference on computer and Communications (ICCC)

作者： Ning Li Ruifeng Guo Xiaozhou Liu Lin Wu Hongliang Wang Shenyang Institute of Computing Technology Chinese Academy of Sciences Shenyang China University of Chinese Academy of Sciences Shenyang China Liaoning Provience Human-Computer Interaction System Engineering Research Center Based on Digital Twin Shenyang China Liaoning Provincial Key Laboratory of Oral Diseases School and Hospital of Stomatology China Medical University Shenyang China

The purpose of this research paper is to implement the tooth target detection task by deep convolutional neural networks. In order to solve the problems of low accuracy of target detection due to the high similarity between teeth and complex tooth textures, an improved tooth detection method with YOLOv3-SPP model is proposed in this paper. This method incorporates the convolutional block attention module (CBAM) in the YOLOv3-SPP algorithm framework, and improves the performance of the network features by adding the channel attention mechanism and spatial attention mechanism to the feature extraction network to enhance the saliency of the tooth target region in the image. Secondly, CIoU border regression loss is introduced to improve the localization accuracy. In addition, a Non-maximum suppression (NMS) method is used to solve the candidate frame overlap problem. Through experiments, it is shown that the mAP of the modified YOLOv3-SPP target detection model is improved to 86.8 percent, which indicates that the improved model can be applied to the detection of teeth.

关键词： Location awareness Training Teeth Object detection Feature extraction Dentistry Classification algorithms

RCGA-Net: An Improved Multi-hybrid Attention Mechanism Network in Biomedical Image Segmentation

学校读者我要写书评

暂无评论

RCGA-Net: An Improved Multi-hybrid Attention Mechanism Netwo...

2021 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2021

作者： Xiao, Feng Shen, Cong Chen, Yu Yang, Tian Chen, Shengyong Liao, Zhijun Tang, Jijun Tianjin University of Technology School of Computer Science and Engineering Tianjin China Engineering Research Center of Learning-Based Intelligent System Ministry of Education Tianjin China School of Basic Medical Sciences Fujian Medical University Fujian Fuzhou China College of Intelligence and Computing Tianjin University Tianjin China Key Laboratory of Systems Bioengineering Ministry of Education Tianjin China Shenzhen Institute of Advanced Technology Chinese Academy of Sciences Guangdong Shenzhen China Department of Computer Science and Engineering University of South Carolina ColumbiaSC United States

ISBN: (纸本)9781665401265

Drawing support from an effective Medical Image Segmentation (MIS) is conducive to a substantial diagnostic basis for the physicians to identify the focus lesion in the patient body and give the subsequent clinical assessment of the patient status. Although various works have tried the challenging quantitative analysis problem, it is still difficult to conduct precise automatic segmentation, especially the soft tissue organs. In this decade, with the increased amount of available datasets, deep learning-based networks have achieved remarkable performance in image processing. Inspired by the state-of-the-art deep learning works, in this paper, we propose an end-to-end multi-layer network named RCGA-Net. It consists of an encoder-decoder backbone that integrates a coordinate attention mechanism based on space and channel and a global context extraction module to highlight more valuable information. To evaluate the performance of RCGA-Net, we apply it to different kinds of clinical and experimental MIS tasks to testify its generalization ability. Extensive experiments represent that our schema has taken the outperform or compatible results among the comparison methods group. Specifically, the numeric result of RCGA-Net on the pulmonary dataset has achieved a 99.12% optimum F1-score. © 2021 IEEE.

关键词： Diagnosis