检索结果-内蒙古大学图书馆

arXiv 2023年

作者： Gong, Cheng Lu, Ye Dai, Surong Qian, Deng Du, Chenkun Li, Tao College of Software Nankai University Tianjin300350 China College of Computer Science Nankai University Tianjin300350 China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing100190 China

Exploring the expected quantizing scheme with suitable mixed-precision policy is the key point to compress deep neural networks (DNNs) in high efficiency and accuracy. This exploration implies heavy workloads for domain experts, and an automatic compression method is needed. However, the huge search space of the automatic method introduces plenty of computing budgets that make the automatic process challenging to be applied in real scenarios. In this paper, we propose an end-to-end framework named AutoQNN, for automatically quantizing different layers utilizing different schemes and bitwidths without any human labor. AutoQNN can seek desirable quantizing schemes and mixed-precision policies for mainstream DNN models efficiently by involving three techniques: quantizing scheme search (QSS), quantizing precision learning (QPL), and quantized architecture generation (QAG). QSS introduces five quantizing schemes and defines three new schemes as a candidate set for scheme search, and then uses the differentiable neural architecture search (DNAS) algorithm to seek the layer- or model-desired scheme from the set. QPL is the first method to learn mixed-precision policies by reparameterizing the bitwidths of quantizing schemes, to the best of our knowledge. QPL optimizes both classification loss and precision loss of DNNs efficiently and obtains the relatively optimal mixed-precision model within limited model size and memory footprint. QAG is designed to convert arbitrary architectures into corresponding quantized ones without manual intervention, to facilitate end-to-end neural network quantization. We have implemented AutoQNN and integrated it into Keras. Extensive experiments demonstrate that AutoQNN can consistently outperform state-of-the-art quantization. For 2-bit weight and activation of AlexNet and ResNet18, AutoQNN can achieve the accuracy results of 59.75% and 68.86%, respectively, and obtain accuracy improvements by up to 1.65% and 1.74%, respectively, compared

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

BuildEnVR: An Immersive Analysis System for Environmental Field

BuildEnVR: An Immersive Analysis System for Environmental Fi...

引用

International Conference on Parallel and Distributed Systems (ICPADS)

作者： Zhenghan Zhou Kebin Liu Yantong Xie Hangxu Jin Shi Liu Ruiqing Wang Haitian Zhao Borong Lin Xiaofang Mu Hui Qi Global Innovation Exchange Tsinghua University Fuzhou Fuyao Institute for Advanced Study School of Architecture Carnegie Mellon University School of Architecture Tsinghua University College of Computer Science and Technology Taiyuan Normal University Shanxi Key Laboratory of Intelligent Optimization Computing and Blockchain Technology

ISBN: (数字)9798331515966

ISBN: (纸本)9798331515973

Amidst global warming and escalating extreme weather events, indoor environmental quality’s impact on human health and public hygiene gains prominence. Environmental parameters exist essentially as fields, which are characterized by high dimensionality, density and complexity, and contain massive amounts of information in space. To facilitate visualization and analysis of indoor environmental field, we design and implement BuildEnVR, an immersive analysis system by virtual reality, enabling remote analysis of real-time and historical environmental field data. Grounded in user needs and cognitive psychology, three visualization modes emerge: the Virtual Sensor mode enables users to access perceptual data in real-time at any 3D coordinates in ambient space, the 4D Heatmap mode visualizes spatial variations and trends over time in environmental field data, and the Synaesthesia mode realizes the fusion display of multi-dimensional environmental field data, allowing users to quickly understand the overall condition of the indoor environment with a low cognitive load. Extensive user surveys validate BuildEnVR’s intuitiveness and precision, and it is suitable for both experts and general users.

关键词： Wireless communication Wireless sensor networks Three-dimensional displays Buildings Psychology Data visualization Cognitive load Real-time systems Space heating Meteorology Climate change Global warming Environmental factors Immersive learning

来源：评论

学校读者我要写书评

暂无评论

TNANet: A Temporal-Noise-Aware Neural Network for Suicidal Ideation Prediction with Noisy Physiological Data

arXiv

引用

arXiv 2024年

作者： Liu, Niqi Liu, Fang Ji, Wenqi Du, Xinxin Liu, Xu Zhao, Guozhen Mu, Wenting Liu, Yong-Jin BNRist Department of Computer Science and Technology MOE-Key Laboratory of Pervasive Computing Tsinghua University China State Key Laboratory of Media Convergence and Communication Communication University of China China Multimodal Sensing and Computing Laboratory Beijing China CAS Key Laboratory of Behavioral Science Institute of Psychology China Department of Psychology Tsinghua University China

The robust generalization of deep learning models in the presence of inherent noise remains a significant challenge, especially when labels are subjective and noise is indiscernible in natural settings. This problem is particularly pronounced in many practical applications. In this paper, we address a special and important scenario of monitoring suicidal ideation, where time-series data, such as photoplethysmography (PPG), is susceptible to such noise. Current methods predominantly focus on image and text data or address artificially introduced noise, neglecting the complexities of natural noise in time-series analysis. To tackle this, we introduce a novel neural network model tailored for analyzing noisy physiological time-series data, named TNANet, which merges advanced encoding techniques with confidence learning, enhancing prediction accuracy. Another contribution of our work is the collection of a specialized dataset of PPG signals derived from real-world environments for suicidal ideation prediction. Employing this dataset, our TNANet achieves the prediction accuracy of 63.33% in a binary classification task, outperforming state-of-the-art models. Furthermore, comprehensive evaluations were conducted on three other well-known public datasets with artificially introduced noise to rigorously test the TNANet's capabilities. These tests consistently demonstrated TNANet's superior performance by achieving an accuracy improvement of more than 10% compared to baseline methods. Copyright © 2024, The Authors. All rights reserved.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Energy-efficient NTT Design with One-bank SRAM and 2-D PE Array

Energy-efficient NTT Design with One-bank SRAM and 2-D PE Ar...

引用

Design, Automation and Test in Europe Conference and Exhibition

作者： Jianan Mu Huajie Tan Jiawen Wu Haotian Lu Chip-Hong Chang Shuai Chen Shengwen Liang Jing Ye Huawei Li Xiaowei Li State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences University of Chinese Academy of Sciences CASTEST Tianjin University Nanyang Technological University Rock-Solid Security Lab Fiberhome

In Number Theoretic Transform (NTT) operation, more than half of the active energy consumption stems from memory accesses. Here, we propose a generalized design method to improve the energy efficiency of NTT operation by considering the effect of processing element (PE) geometry and memory organization on the data flow between PEs and memory. To decrease the number of data bits that are required to be accessed from the memory, a two-dimensional (2-D) PE array architecture is used. A pair of ping-pong buffers are proposed to transposed swap the coefficients to enable a single bank of memory to be used with the 2-D PE array to reduce the average memory bit access energy without compromising the throughput. Our experimental results show that this design method can produce NTT accelerators with up to 69.8% saving in average energy consumption compared with the existing designs based on multi-bank SRAM and one-bank SRAM with one-dimensional PE array with the same number of PEs and total memory size.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Memory Access Optimization of High-Order CFD Stencil Computations on GPU 1

引用

21st International Conference on Parallel and Distributed computing, Applications, and Technologies, PDCAT 2020

作者： Wang, Shengxiang Li, Zhuoqian Che, Yonggang Institute for Quantum Information and State Key Laboratory of High Performance Computing College of Computer National University of Defense Technology Changsha China

ISBN: (数字)9783030692445

ISBN: (纸本)9783030692438

Stencils computations are a class of computations commonly found in scientific and engineering applications. They have relatively lower arithmetic intensity. Therefore, their performance is greatly affected by memory access. This paper studies the issue of memory access optimization for the key stencil computations of a high-order CFD program on the NVidia GPU. Two methods are used to optimize the performance. First, we use registers to cache the data used by the stencil computations in the kernel. We use the CUDA warp shuffle functions to exchange data between neighboring grid points, and adjust the thread computation granularity to increase the data reuse. Second, we use the shared memory to buffer the grid data used by the stencil computations in the kernel, and utilize loop tiling to reduce redundant accesses to the global memory. Performance evaluation is done on an NVidia Tesla K80 GPU. The results show that compared to the original implementation that only uses the global memory, the optimized implementation that utilizes the registers achieves a maximum speedup of 2.59 and 2.79 relatively for 15M and 60M grids, and the optimized implementation that utilizes the shared memory achieves a maximum speedup of 3.51 and 3.36 relatively for 15M and 60M grids. © 2021, Springer Nature Switzerland AG.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Radio Modulation Recognition Based on Attention-based Convolutional Neural Network and Time-frequency Analysis 2

Radio Modulation Recognition Based on Attention-based Convol...

引用

2nd International Seminar on Artificial Intelligence, Networking and Information technology, AINIT 2021

作者： Li, Xinlong Huang, Da Yang, Wenjing Li, Xueqiong Institute for Quantum Information and State Key Laboratory of High Performance Computing College of Computer Science and Technology National University of Defense Technology Changsha China

ISBN: (纸本)9781665412964

Radio modulation recognition is the key link of modern electronic warfare. This paper applies the idea of deep learning to radio modulation recognition. Since the modulation type is the most important information about the subtle features of the radio signal, we use short time Fourier transform (STFT) to convert the radio signal into a time-frequency distribution map to form a joint distribution of frequency and time. The attention-based convolutional neural network is used to classify and recognize the modulation types of time-frequency distribution maps. The simulation results show that time-frequency analysis can improve the accuracy of recognition, and adding attention can effectively improve the performance of the network. © 2021 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

DSD-MatchingNet:Deformable sparse-to-dense feature matching for learning accurate correspondences

引用

Virtual Reality & Intelligent Hardware 2022年第5期4卷 432-443页

作者： Yicheng ZHAO Han ZHANG Ping LU Ping LI Enhua WU Bin SHENG Department of Computer Science and Engineering Shanghai Jiao Tong UniversityShanghai 200240China ZTE Corporation Shenzhen 518057China State Key Laboratory of Mobile Network and Mobile Multimedia Technology Shenzhen 518057China Department of Computing The Hong Kong Polytechnic UniversityHong Kong 999077China School of Design The Hong Kong Polytechnic UniversityHong Kong 999077China State Key Laboratory of Computer Science Institute of SoftwareChinese Academy of SciencesBeijing 100190China Faculty of Science and Technology University of MacaoMacao 999078China

Background Exploring correspondences across multiview images is the basis of various computer vision ***,most existing methods have limited accuracy under challenging *** To learn more robust and accurate correspondences,we propose DSD-MatchingNet for local feature matching in this ***,we develop a deformable feature extraction module to obtain multilevel feature maps,which harvest contextual information from dynamic receptive *** dynamic receptive fields provided by the deformable convolution network ensure that our method obtains dense and robust ***,we utilize sparse-to-dense matching with symmetry of correspondence to implement accurate pixel-level matching,which enables our method to produce more accurate *** Experiments show that our proposed DSD-MatchingNet achieves a better performance on the image matching benchmark,as well as on the visual localization ***,our method achieved 91.3%mean matching accuracy on the HPatches dataset and 99.3%visual localization recalls on the Aachen Day-Night dataset.

关键词： Image matching Deformable convolution network Sparse-to-dense matching

来源：评论

学校读者我要写书评

暂无评论

Automatic architecture design for distributed quantum computing

引用

Chinese Physics B 2024年第12期 62-77页

作者：骆挺宇郑宇真付祥邓玉欣 Shanghai Key Laboratory of Trustworthy Computing East China Normal University Institute for Quantum Information & State Key Laboratory of High Performance Computing College of ComputerNational University of Defense Technology Tianjin Institute of Advanced Technology

In distributed quantum computing(DQC), quantum hardware design mainly focuses on providing as many as possible high-quality inter-chip connections. Meanwhile, quantum software tries its best to reduce the required number of remote quantum gates between chips. However, this “hardware first, software follows” methodology may not fully exploit the potential of DQC. Inspired by classical software–hardware co-design, this paper explores the design space of application-specific DQC architectures. More specifically, we propose Auto Arch, an automated quantum chip network(QCN) structure design tool. With qubits grouping followed by a customized QCN design, AutoArch can generate a near-optimal DQC architecture suitable for target quantum algorithms. Experimental results show that the DQC architecture generated by Auto Arch can outperform other general QCN architectures when executing target quantum algorithms.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Segmentation-aware Synergy Network for Single Particle Recognition in Cryo-EM

A Segmentation-aware Synergy Network for Single Particle Rec...

引用

2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022

作者： Li, Shuo Li, Hongjia Zhang, Chi Zhang, Fa Wan, Xiaohua Institute of Computing Technology Chinese Academy of Sciences High Performance Computer Research Center Beijing China University of Chinese Academy of Sciences Beijing China Zhejiang University State Key Laboratory of Cad&cg Hangzhou China Beijing Institute of Technology Beijing China

ISBN: (纸本)9781665468190

Cryo-electron microscopy (cryo-EM) single particle analysis (SPA) has been an indispensable technology to reconstruct three-dimensional (3D) structures of biomolecules at near-atomic resolution. Tens of thousands of particles are required to obtain high-resolution 3D reconstructions, nevertheless, it is rather challenging due to the extremely noisy microscopy images and the diversity of particles. Recently, while deep learning-based methods have been devoted into the improvement of particle feature extraction and location estimation, most of them are plagued with vulnerable feature representation, inexact supervised ground truth. Furthermore, these DL-methods usually adopt denoising and particle picking as two-stage operations in the existing pipeline, which is inadequate to achieve accurate estimation for location. In this paper, we propose a segmentation-aware synergy framework to automatically select particles in which two tightly-coupled networks are designed including a multiple output convolution subnet for denoise to jointly learn strong object representation and pixel representation simultaneously and a deep convolution subnet for particle location. Furthermore, joint learning of the two networks can effectively enhance the synergy relationship between denoising and downstream recognition, thus leading to accurate and reliable location estimations for SPA. When applied with various EMPAIR real-world datasets, our model improves the performance of particle detection and exaction, especially intersection over union metric, and this strength has important implications for the next 2D alignment, 2D classification averaging, and high-resolution 3D refinement steps in SPA. © 2022 IEEE.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Practical security of twin-field quantum key distribution with optical phase-locked loop under wavelength-switching attack

arXiv

引用

arXiv 2024年

作者： Peng, Qingquan Chen, Jiu-Peng Xing, Tianyi Wang, Dongyang Wang, Yizhi Liu, Yang Huang, Anqi Institute for Quantum Information State Key Laboratory of High Performance Computing College of Computer Science and Technology National University of Defense Technology Hunan Changsha410073 China Jinan Institute of Quantum Technology Hefei National Laboratory Jinan Branch Shandong Jinan250101 China

The twin-field class quantum key distribution (TF-class QKD) has experimentally demonstrated the ability to surpass the fundamental rate-distance limit without requiring a quantum repeater, as a revolutional milestone. In TF-class QKD implementation, an optical phase-locked loop (OPLL) structure is commonly employed to generate a reference light with correlated phase, ensuring coherence of optical fields between Alice and Bob. In this configuration, the reference light, typically located in the untrusted station Charlie, solely provides wavelength reference for OPLL and does not participate in quantum-state encoding. However, the reference light may open a door for Eve to enter the source stations that are supposed to be well protected. Here, by identifying vulnerabilities of an acousto-optic modulator (AOM) in the OPLL scheme, we propose and demonstrate a wavelength-switching attack on a TF-class QKD system. This attack involves Eve deliberately manipulating the wavelength of the reference light to increase mean photon number of prepared quantum states, while maintaining stable interference between Alice and Bob as required by TF-class QKD protocols. The maximum observed increase in mean photon number is 8.7%, which has been theoretically proven to compromise the security of a TF-class QKD system. Moreover, we have shown that with well calibration of the modulators, the attack can be eliminated. Through this study, we highlight the importance of system calibration in the practical security in TF-class QKD *** Codes 81P94 Copyright © 2024, The Authors. All rights reserved.

关键词： Modulators

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：