检索结果-内蒙古大学图书馆

A Generic, High-Performance, Compression-Aware Framework for Data Parallel DNN Training

IEEE Transactions on Parallel and Distributed Systems 2023年第08期36卷 1-20页

作者： Wu, Hao Wang, Shiyi Bai, Youhui Li, Cheng Zhou, Quan Yi, Jun Yan, Feng Chen, Ruichuan Xu, Yinlong Department of Computer Science and Technology University of Science and Technology of China Hefei China Computer Science Department and Electrical and Computer Engineering Department University of Houston USA Nokia Bell Labs

Gradient compression is a promising approach to alleviating the communication bottleneck in data parallel deep neural network (DNN) training by significantly reducing the data volume of gradients for synchronization. While gradient compression is being actively adopted by the industry (e.g., Facebook and AWS), our study reveals that there are two critical but often overlooked challenges: 1) inefficient coordination between compression and communication during gradient synchronization incurs substantial overheads, and 2) developing, optimizing, and integrating gradient compression algorithms into DNN systems imposes heavy burdens on DNN practitioners, and ad-hoc compression implementations often yield surprisingly poor system performance. In this paper, we propose a compression-aware gradient synchronization architecture, CaSync, which relies on flexible composition of basic computing and communication primitives. It is general and compatible with any gradient compression algorithms and gradient synchronization strategies and enables high-performance computation-communication pipelining. We further introduce a gradient compression toolkit, CompLL, to enable efficient development and automated integration of on-GPU compression algorithms into DNN systems with little programming burden. Lastly, we build a compression-aware DNN training framework HiPress with CaSync and CompLL. HiPress is open-sourced and runs on mainstream DNN systems such as MXNet, TensorFlow, and PyTorch. Evaluation via a 16-node cluster with 128 NVIDIA V100 GPUs and a 100 Gbps network shows that HiPress improves the training speed over current compression-enabled systems (e.g., BytePS-onebit, Ring-DGC and PyTorch-PowerSGD) by 9.8%-69.5% across six popular DNN models. IEEE

关键词： Synchronization

来源：评论

学校读者我要写书评

暂无评论

Broadband angular spectrum analog processors based on all-dielectric metasurfaces

引用

science China(Physics,Mechanics & Astronomy) 2024年第7期67卷 188-190页

作者： Lin Deng Yongmin Liu Department of Electrical and Computer Engineering Northeastern UniversityBoston 02115USA Department of Mechanical and Industrial Engineering Northeastern UniversityBoston 02115USA

Digital signal processors are extensively used to execute mathematical operations and advanced computational tasks on digital ***,they suffer from several inherent limitations,including low speed,high energy consumption,and large memory requirements,because of the hardware bottleneck and the imperative conversion between digital and analogue signals.

关键词： dielectric spectrum hardware

来源：评论

学校读者我要写书评

暂无评论

Manifold Clustering Based Nonlinear Model Reduction with Application to Nonlinear Convection 63

Manifold Clustering Based Nonlinear Model Reduction with App...

引用

63rd IEEE Conference on Decision and Control, CDC 2024

作者： Wu, T. Wilson, D. Djouadi, S.M. Department of Electrical Engineering and Computer Science United States

ISBN: (纸本)9798350316339

This paper proposes a new cluster method combined with Dynamic Mode Decomposition with Control (DMDc), and the Proper Orthogonal Decomposition (POD) to construct more accurate reduced order models. DMDc and POD are popular data-driven techniques that extract loworder models from high-dimensional complex dynamic systems. However, these methods are inherently linear, i.e., the data is assumed to belong to linear manifolds. However, this may lead to inaccuracies in the reduced models commensurate with the presence of nonlinearities. To capture the nonlinear behavior, manifold clustering is introduced to group the snapshots obtained by experiments or numerical simulation into several sub-regions based on the underlying non-linear structure. Manifold clustering is a powerful approach for exploratory data analysis, allowing the discovery of patterns and structures that are not apparent in raw high-dimensional data. It does not require knowing the number of clusters and the intrinsic manifold dimensions in advance. Manifold clustering is combined with DMDc and POD to construct the local reduced-order models. Time clustering is applied to the snapshots generated by a nonlinear convective flow governed by the 2D Burgers' equations with boundary actuation. The manifold cluster reduced order model outperforms standard and other cluster-based (K-means) reduced order models. © 2024 IEEE.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Leveraging feature fusion ensemble of VGG16 and ResNet-50 for automated potato leaf abnormality detection in precision agriculture

引用

Soft Computing 2025年第4期29卷 2263-2277页

作者： Trivedi, Amit Kumar Mahajan, Tripti Maheshwari, Tanmay Mehta, Rajesh Tiwari, Shailendra Department of Computer Science and Engineering Thapar Institute of Engineering and Technology Punjab Patiala147001 India

In the era of advancement in technology and modern agriculture, early disease detection of potato leaves will improve crop yield. Various researchers have focussed on disease due to different types of microbial infection in potato leaves using computer vision and machine learning approaches. In this paper, a data science approach for multiclass classification of potato normal and abnormal leaves due to fungal infection like early blight and late blight is performed using the ensembling of deep learning (DL) CNN models. Firstly, the performance of classification on potato disease is verified separately on VGG16 and ResNet-50 CNN models after pre-processing of the leaf dataset. The pre-processing includes noise removal and normalization. Further improvement in classification accuracy is achieved by the ensembling of VGG16 and ResNet-50 CNN models. The ensembling of CNN models is performed on the feature level by fusing features extracted using VGG16 and ResNet-50. From the experimental results, performed on publicly available datasets consisting of 2152 number of normal and abnormal images it is observed that the average classification accuracy of 98.22%, 96.16% and 95.68% is achieved using the proposed ensemble, VGG16 and ResNet-50 models respectively. The efficacy of the proposed approach (ensemble technique at feature level fusion) is verified in comparison with recently reported DL model-based approaches. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2025.

关键词： Classification (of information)

来源：评论

学校读者我要写书评

暂无评论

The superalignment of superhuman intelligence with large language models

引用

science China(Information sciences) 2025年第6期68卷 101-111页

作者： Minlie HUANG Yingkang WANG Shiyao CUI Pei KE Jie TANG The CoAI Group Department of Computer Science and Technology Tsinghua University Laboratory of Intelligent Collaborative Computing University of Electronic Science and Technology of China Knowledge Engineering Group Department of Computer Science and Technology Tsinghua University

We have witnessed the emergence of superhuman intelligence thanks to the fast development of large language models(LLMs) and multimodal language models. As the application of such superhuman models becomes increasingly popular, a critical question arises: how can we ensure they still remain safe, reliable, and aligned well with human values encompassing moral values, Schwartz's Values, ethics, and many more? In this position paper, we discuss the concept of superalignment from a learning perspective to answer this question by outlining the learning paradigm shift from large-scale pretraining and supervised fine-tuning, to alignment training. We define superalignment as designing effective and efficient alignment algorithms to learn from noisy-labeled data(point-wise samples or pair-wise preference data) in a scalable way when the task is very complex for human experts to annotate and when the model is stronger than human experts. We highlight some key research problems in superalignment, namely, weak-to-strong generalization, scalable oversight, and evaluation. We then present a conceptual framework for superalignment, which comprises three modules: an attacker which generates the adversary queries trying to expose the weaknesses of a learner model, a learner which refines itself by learning from scalable feedbacks generated by a critic model with minimal human experts, and a critic which generates critics or explanations for a given query-response pair, with a target of improving the learner by criticizing. We discuss some important research problems in each component of this framework and highlight some interesting research ideas that are closely related to our proposed framework, for instance, self-alignment, self-play, self-refinement, and more. Last, we highlight some future research directions for superalignment, including the identification of new emergent risks and multi-dimensional alignment.

关键词： superalignment superhuman intelligence large language models scalable feedback weak-to-strong generalization

来源：评论

学校读者我要写书评

暂无评论

Federated Learning With Meta-Layers Training for Privacy-Preserving in Vehicular Consumer Electronics

引用

IEEE Transactions on Consumer Electronics 2024年第1期71卷 621-629页

作者： Shen, Xiaoyang Li, Haibin Li, Yaqian Zhang, Wenming Alenazi, Mohammed J.F. Agarwal, Kadambri Yanshan University College of Electrical Engineering Hebei Qinhuangdao066000 China Key Laboratory of Industrial Computer Control Engineering of Hebei Province Hebei Qinhuangdao066000 China King Saud University College of Computer and Information Sciences Department of Computer Engineering Riyadh11451 Saudi Arabia ABES Engineering College Department of Computer Science and Engineering Ghaziabad201009 India

Vehicular consumer electronics, such as autonomous vehicles (AVs), need collecting large amounts of private user information, which face the risk of privacy leakage. To protect the privacy of consumers, researchers have proposed to apply federated learning (FL) to privacy-preserving vehicular consumer electronics, that is, leveraging FL for collaborative training of a decision model without exchanging the sensitive information generated by AVs. However, FL generally needs to exchange the gradients of client models periodically with the central server, which can be attacked by adversaries to infer user information. Thereby, it may still face the risk of privacy leakage. To solve that challenge, we put forward a novel FL framework, called FL with Meta-Layers Training (FL-MLT). Instead of exchanging the gradients of the client models, it exchanges, meta-layers with the central servers. Since meta-layers are only a slice of the client models, exchanging them helps protect privacy. On the other hand, they contain meta-knowledge to help the FL training process. In the experiments, we conduct extensive visual classification simulation to evaluate FL-MLT, and the experimental results demonstrate the superior performance of FL-MLT. © 1975-2011 IEEE.

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

On building automation system security

引用

High-Confidence Computing 2024年第3期4卷 103-122页

作者： Christopher Morales-Gonzalez Matthew Harper Michael Cash Lan Luo Zhen Ling Qun Z.Sun Xinwen Fu Department of Computer Science University of Massachusetts LowellLowell 01854USA Department of Electrical and Computer Engineering University of Central FloridaOrlando 32816USA School of Computer Science and Technology Southeast UniversityMa’anshan 243032China School of Computer Science and Engineering Anhui University of TechnologyNanjing 211189China

Building Automation Systems(BASs)are seeing increased usage in modern society due to the plethora of benefits they provide such as automation for climate control,HVAC systems,entry systems,and lighting *** BASs in use are outdated and suffer from numerous vulnerabilities that stem from the design of the underlying BAS *** this paper,we provide a comprehensive,up-to-date survey on BASs and attacks against seven BAS protocols including BACnet,EnOcean,KNX,LonWorks,Modbus,ZigBee,and *** studies of secure BAS protocols are also presented,covering BACnet Secure Connect,KNX Data Secure,KNX/IP Secure,ModBus/TCP Security,EnOcean High Security and Z-Wave *** and ZigBee do not have security *** point out how these security protocols improve the security of the BAS and what issues remain.A case study is provided which describes a real-world BAS and showcases its vulnerabilities as well as recommendations for improving the security of *** seek to raise awareness to those in academia and industry as well as highlight open problems within BAS security.

关键词： Building automation system BAS protocols Security Attack

来源：评论

学校读者我要写书评

暂无评论

Ill-condition enhancement for BC speech using RMC method

引用

International Journal of Speech Technology 2024年第4期27卷 1085-1092页

作者： Ohidujjaman Hasan, Mahmudul Zhang, Shiming Huda, Mohammad Nurul Uddin, Mohammad Shorif Computer Science and Engineering Daffodil International University Dhaka1216 Bangladesh Computer Science and Engineering Comilla University Comilla3506 Bangladesh School of Electrical and Information Northeast Agricultural University Harbin150030 China Computer Science and Engineering United International University Dhaka1212 Bangladesh Computer Science and Engineering Green University of Bangladesh Kanchon1460 Bangladesh Computer Science and Engineering Jahangirnagar University Savar1342 Bangladesh

This paper improves the ill-condition of bone-conducted (BC) speech signal by reducing the eigenvalue expansion. BC speech commonly contains a large spectral dynamic range that causes ill-condition for the classical linear prediction (LP) methods. In the field of numerical analysis, we often face the situation where an ill-conditioned case occurs in finding the solution. Principally, eigenvalue expansion causes ill-condition in numerical analysis. To mitigate this problem, the regularized least squares (RLS) technique is commonly used. Motivated by the RLS concept, we derive the regularized modified covariance (RMC) method for BC speech analysis in this study. The RMC method reduces eigenvalue expansion by compressing the spectral dynamic range of the speech signal. Thus, the RMC method resolves the ill-conditioned problem of LP. In experiments, we show that the RMC method provides compressed eigenvalue expansion than the conventional methods for BC speech where synthetic and real BC speeches are considered. The performance of the RMC method is affected by the setting of the regularization parameter. In this paper, the regularization parameter in practice is iteratively and rule-based derived. The RMC method with such a setting provides the best performance for BC speech analysis. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Harnessing Blockchain to Address Plasma Donation Network Challenges

引用

computers, Materials & Continua 2023年第7期76卷 631-646页

作者： Shivani Batra Mohammad Zubair Khan Gatish Priyadarshi Ayman Noor Talal H.Noor Namrata Sukhija Prakash Srivastava Department of Computer Science and Engineering SRM UniversityDelhi-NCR1310029India Department of Computer Science and Information Taibah UniversityMedina42353Saudi Arabia Department of Computer Science and Engineering KIET Group of InstitutionsDelhi-NCRGhaziabad201206India College of Computer Science and Engineering Taibah UniversityMadinahSaudi Arabia College of Computer Science and Engineering Taibah UniversityYanbuMadinahSaudi Arabia Department of Computer Science and Engineering Banasthali VidyapithRajasthan304022India Department of Computer Science and Engineering Graphic Era(Deemed to Be University)Dehradun248002India

Plasma therapy is an extensively used treatment for critically unwell *** this procedure,a legitimate plasma donor who can continue to supply plasma after healing is ***,significant dangers are associated with supply management,such as the ambiguous provenance of plasma and the spread of infected or subpar blood into medicinal ***,from an ideological standpoint,less powerful people may be exploited throughout the contribution ***,there is a danger to the logistics system because there are now just some plasma *** research intends to investigate the blockchain-based solution for blood plasma to facilitate authentic plasma *** parameters,including electronic identification,chain code,and certified ledgers,have the potential to exert a substantial,profound influence on the distribution and implementation process of blood *** understand the practical ramifications of blockchain,the current study provides a proof of concept approach that aims to simulate the procedural code of modern plasma distribution ecosystems using a blockchain-based *** agent-based modeling used in the testing and evaluation mimics the supply chain to assess the blockchain’s feasibility,advantages,and constraints for the plasma.

关键词： Blockchain hyperledger fabric information visibility plasma donation network plasma quality

来源：评论

学校读者我要写书评

暂无评论

QoE-Aware Volumetric Video Caching and Rendering for Mobile Extended Reality Services

引用

IEEE Internet of Things Journal 2025年第12期12卷 21852-21865页

作者： Pei, Yingying Li, Mushu Huang, Xinyu Shen, Xuemin University of Waterloo Department of Electrical and Computer Engineering WaterlooONN2L 3G1 Canada Lehigh University Department of Computer Science and Engineering BethlehemPA18015 United States

In this article, we propose a novel volumetric video caching and rendering approach for an edge-assisted extended reality (XR) system to enhance user Quality of Experience (QoE). Particularly, user QoE consists of visual quality and quality variation. Different quality of volumetric videos are required to be cached, rendered, and delivered to XR devices for different viewing distances within a time latency. Given the limited caching, computing, and communication resources on the edge server, we formulate a long-term user QoE maximization problem to jointly optimize video caching and rendering by considering user locations and viewing distances. To solve this problem, we first design an online optimization algorithm in which caching decisions are obtained using a regularization technique. We then develop a low-complexity binary search algorithm to determine optimal rendering quality. Extensive simulations are conducted to demonstrate that our proposed approach outperforms benchmark schemes by an average 46% improvement in terms of long-term user QoE. © 2014 IEEE.

关键词： Video streaming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：