检索结果-内蒙古大学图书馆

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Shen, Zi-Qiang Tang, Yu-Yi Yan, Jun-Feng Li, Yang Zhao, Guo-Ying Hunan University of Chinese Medicine China Hunan Digital Chinese Medicine Engineering Technology Research Center China Center for Machine Vision Research Department of Computer Science and Engineering University of Oulu Finland

ISBN: (纸本)9798350368741

With the emergence of the Transformer architecture, the accuracy of deep learning within the domain of facial emotion recognition has seen further enhancement. However, Transformer comes with increased training complexity and time due to the large parameter count. Additionally, the global receptive field in Transformer's attention leads to unnecessary computations for features with limited spatial extent in image sentiment analysis. In this paper we presents a MSRFormer model, which combines Hybrid-scale self-attention and local fast convolution to address existing issues. The Hybrid-scale self-attention enables precise focus on salient regions of the image, where key features are located. A Local Fast Convolution module was integrated into the MLP head of the Transformer model, enhancing training speed and reducing parameters while maintaining feature learning. we conducted experiments on FERPlus, RAF-DB, and KDEF datasets and confirming its strong performance and broad applicability in facial expression recognition. © 2025 IEEE.

关键词： facial expression recognition hybrid scale self attention local fast convolution transformer

来源：评论

学校读者我要写书评

暂无评论

Minimal Context-Switching Data Race Detection with Dataflow Tracking

引用

Journal of computer science & Technology 2024年第1期39卷 211-226页

作者：郑龙李洋辛杰刘海峰郑然廖小飞金海 National Engineering Research Center for Big Data Technology and System School of Computer Science and Technology Huazhong University of Science and TechnologyWuhan 430074China Services Computing Technology and System Laboratory School of Computer Science and TechnologyHuazhong University of Science and TechnologyWuhan 430074China Cluster and Grid Computing Laboratory School of Computer Science and TechnologyHuazhong University of Science and TechnologyWuhan 430074China

Data race is one of the most important concurrent anomalies in multi-threaded *** con-straint-based techniques are leveraged into race detection,which is able to find all the races that can be found by any oth-er sound race ***,this constraint-based approach has serious limitations on helping programmers analyze and understand data ***,it may report a large number of false positives due to the unrecognized dataflow propa-gation of the ***,it recommends a wide range of thread context switches to schedule the reported race(in-cluding the false one)whenever this race is exposed during the constraint-solving *** ad hoc recommendation imposes too many context switches,which complicates the data race *** address these two limitations in the state-of-the-art constraint-based race detection,this paper proposes DFTracker,an improved constraint-based race detec-tor to recommend each data race with minimal thread context ***,we reduce the false positives by ana-lyzing and tracking the dataflow in the *** this means,DFTracker thus reduces the unnecessary analysis of false race *** further propose a novel algorithm to recommend an effective race schedule with minimal thread con-text switches for each data *** experimental results on the real applications demonstrate that 1)without removing any true data race,DFTracker effectively prunes false positives by 68%in comparison with the state-of-the-art constraint-based race detector;2)DFTracker recommends as low as 2.6-8.3(4.7 on average)thread context switches per data race in the real world,which is 81.6%fewer context switches per data race than the state-of-the-art constraint based race ***,DFTracker can be used as an effective tool to understand the data race for programmers.

关键词： data race satisfiability modulo theory multi-threaded program dynamic detection

来源：评论

学校读者我要写书评

暂无评论

Omni Contextual Aggregation Networks for High-Fidelity Image Inpainting

引用

IEEE Transactions on Circuits and Systems for Video Technology 2025年第6期35卷 6129-6144页

作者： Peng, Jinjia Li, Mengkai Wang, Bingyan Wang, Huibing Hebei University School of Cyber Security and Computer Hebei Machine Vision Engineering Research Center Hebei Baoding071002 China Hebei University School of Cyber Security and Computer Hebei Baoding071002 China Dalian Maritime University College of Information Science and Technology Liaoning Dalian116026 China

Image inpainting aims to restore a realistic image from a damaged or incomplete version. Although Transformer-based methods have achieved impressive results by modeling long-range dependencies, the inherent quadratic complexity of canonical self-attention has typically led to these approaches adopting uni-dimensional modeling, which limits the model’s ability to capture complex relationships from both spatial and channel dimensions. To this end, this paper exploits a novel attention paradigm termed Dynamic Omni-Attention Mechanism (DOAM) for simultaneously modeling pixel-interaction from both spatial and channel dimensions, and implements the information interaction across the omni-axis (i.e., spatial and channel) with linear computational complexity. In addition, to handle large-scale degradation, this paper proposes a Multi-band Feature Enhancement (MFE) module to enhance feature representation in downsampling, thus unlocking the potential of subsequent attentional interactions. Moreover, motivated by recent advances in image restoration, this paper incorporates a domain-related prior representation from CNN-based Network to modulate the features during proposed attention mechanism and feed-forward networks. Integrating the above designs into an encoder-decoder architecture, the proposed Omni Contextual Aggregation Networks (OCANet) achieve superior performance at lower parameters and time costs than the competitive baselines. Extensive experiments on CelebA-HQ, Paris Street View, FFHQ and Dunhuang datasets validate the efficacy of the proposed method. © 1991-2012 IEEE.

关键词： Image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Incentive mechanism design via smart contract in blockchainbased edge-assisted crowdsensing

引用

Frontiers of computer science 2025年第3期19卷 65-85页

作者： Chenhao YING Haiming JIN Jie LI Xueming SI Yuan LUO Department of Computer Science and Engineering Shanghai Jiao Tong UniversityShanghai 200240China Shanghai Jiao Tong University(Wuxi)Blockchain Advanced Research Center Wuxi 214101China

Edge-assisted mobile crowdsensing(EMCS)has gained significant attention as a data collection ***,existing incentive mechanisms in EMCS systems rely on centralized platforms,making them impractical for the decentralized nature of EMCS *** address this limitation,we propose CHASER,an incentive mechanism designed for blockchain-based EMCS(BEMCS)*** fact,CHASER can attract more participants by satisfying the incentive requirements of budget balance,double-side truthfulness,double-side individual rationality and also high social ***,the proposed BEMCS system with CHASER in smart contracts guarantees the data confidentiality by utilizing an asymmetric encryption scheme,and the anonymity of participants by applying the zero-knowledge succinct non-interactive argument of knowledge(zk-SNARK).This also restrains the malicious behaviors of ***,most simulations show that the social welfare of CHASER is increased by approximately when compared with the state-of-the-art ***,CHASER achieves a competitive ratio of approximately 0.8 and high task completion rate of over 0.8 in large-scale *** findings highlight the robustness and desirable performance of CHASER as an incentive mechanism within the BEMCS system.

关键词： mobile crowdsensing edge computing blockchain smart contract incentive mechanism

来源：评论

学校读者我要写书评

暂无评论

A hybrid memory architecture supporting fine-grained data migration

引用

Frontiers of computer science 2024年第2期18卷 31-41页

作者： Ye CHI Jianhui YUE Xiaofei LIAO Haikun LIU Hai JIN National Engineering Research Center for Big Data Technology and System Services Computing Technology and System LabCluster and Grid Computing LabSchool of Computer Science and TechnologyHuazhong University of Science and TechnologyWuhan 430074China Department of Computer Science Michigan Technological UniversityMichigan 49931USA

Hybrid memory systems composed of dynamic random access memory(DRAM)and Non-volatile memory(NVM)often exploit page migration technologies to fully take the advantages of different memory *** previous proposals usually migrate data at a granularity of 4 KB pages,and thus waste memory bandwidth and DRAM *** this paper,we propose Mocha,a non-hierarchical architecture that organizes DRAM and NVM in a flat address space physically,but manages them in a cache/memory *** the commercial NVM device-Intel Optane DC Persistent Memory Modules(DCPMM)actually access the physical media at a granularity of 256 bytes(an Optane block),we manage the DRAM cache at the 256-byte size to adapt to this feature of *** design not only enables fine-grained data migration and management for the DRAM cache,but also avoids write amplification for Intel Optane *** also create an Indirect Address Cache(IAC)in Hybrid Memory Controller(HMC)and propose a reverse address mapping table in the DRAM to speed up address translation and cache ***,we exploit a utility-based caching mechanism to filter cold blocks in the NVM,and further improve the efficiency of the DRAM *** implement Mocha in an architectural *** results show that Mocha can improve application performance by 8.2%on average(up to 24.6%),reduce 6.9%energy consumption and 25.9%data migration traffic on average,compared with a typical hybrid memory architecture-HSCC.

关键词： non-volatile memory hybrid memory system data migration fine-grained caching

来源：评论

学校读者我要写书评

暂无评论

Towards Source-Free Domain Adaptive Semantic Segmentation Via Importance-Aware and Prototype-Contrast Learning

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-13页

作者： Cao, Yihong Zhang, Hui Lu, Xiao Xiao, Zheng Yang, Kailun Wang, Yaonan College of Computer Science and Electronic Engineering Hunan University Changsha China National Engineering Research Center of Robot Vision Perception and Control Technology School of Robotics Hunan University Changsha China College of Engineering and Design Hunan Normal University Changsha China

Domain adaptive semantic segmentation enables robust pixel- wise understanding in real-world driving scenes. Source-free domain adaptation, as a more practical technique, addresses the concerns of data privacy and storage limitations in typical unsupervised domain adaptation methods, making it especially relevant in the context of intelligent vehicles. It utilizes a well-trained source model and unlabeled target data to achieve adaptation in the target domain. However, in the absence of source data and target labels, current solutions cannot sufficiently reduce the impact of domain shift and fully leverage the information from the target data. In this paper, we propose an end-to-end source-free domain adaptation semantic segmentation method via Importance-Aware and Prototype-Contrast (IAPC) learning. The proposed IAPC framework effectively extracts domain-invariant knowledge from the well-trained source model and learns domain-specific knowledge from the unlabeled target domain. Specifically, considering the problem of domain shift in the prediction of the target domain by the source model, we put forward an importance-aware mechanism for the biased target prediction probability distribution to extract domain-invariant knowledge from the source model. We further introduce a prototype-contrast strategy, which includes a prototype-symmetric cross-entropy loss and a prototype-enhanced cross-entropy loss, to learn target intra-domain knowledge without relying on labels. A comprehensive variety of experiments on two domain adaptive semantic segmentation benchmarks demonstrates that the proposed end-to-end IAPC solution outperforms existing state-of-the-art methods. The source code is publicly available at https://***/yihong-97/Source-free-IAPC. IEEE

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Ultra-large mode area multi-core orbital angular momentum transmission fiber designed by neural network and optimization algorithms

引用

Optoelectronics Letters 2023年第12期19卷 744-751页

作者： GU Zhiwei HUANG Wei ZHANG Ran FAN Junjie SONG Binbin Key Laboratory of Computer Vision and Systems(Ministry of Education) School of Computer Science and EngineeringTianjin University of TechnologyTianjin 300384China Engineering Research Center of Learning-Based Intelligent System(Ministry of Education) Tianjin University of TechnologyTianjin 300384China

A large mode area multi-core orbital angular momentum(OAM)transmission fiber is designed and optimized by neural network and optimization *** neural network model has been established first to predict the optical properties of multi-core OAM transmission fibers with high accuracy and speed,including mode area,nonlinear coefficient,purity,dispersion,and effective index *** the trained neural network model is combined with different particle swarm optimization(PSO)algorithms for automatic iterative optimization of multi-core structures *** to the structural advantages of multi-core fiber and the automatic optimization process,we designed a number of multi-core structures with high OAM mode purity(>95%)and ultra-large mode area(>3000µm^(2)),which is larger by more than an order of magnitude compared to the conventional ring-core OAM transmission fibers.

关键词： network fiber OAM

来源：评论

学校读者我要写书评

暂无评论

A Facial Expression Recognition Method Integrating Uncertainty Estimation and Active Learning

引用

computers, Materials & Continua 2024年第10期81卷 533-548页

作者： Yujian Wang Jianxun Zhang Renhao Sun School of Computer Science and Engineering Chongqing University of TechnologyChongqing400054China Data Space Research Institute of Hefei Comprehensive National Science Center Hefei230088China

The effectiveness of facial expression recognition(FER)algorithms hinges on the model’s quality and the availability of a substantial amount of labeled expression ***,labeling large datasets demands significant human,time,and financial *** active learning methods have mitigated the dependency on extensive labeled data,a cold-start problem persists in small to medium-sized expression recognition *** issue arises because the initial labeled data often fails to represent the full spectrum of facial expression *** paper introduces an active learning approach that integrates uncertainty estimation,aiming to improve the precision of facial expression recognition regardless of dataset scale *** method is divided into two primary ***,the model undergoes self-supervised pre-training using contrastive learning and uncertainty estimation to bolster its feature extraction ***,the model is fine-tuned using the prior knowledge obtained from the pre-training phase to significantly improve recognition *** the pretraining phase,the model employs contrastive learning to extract fundamental feature representations from the complete unlabeled *** features are then weighted through a self-attention mechanism with rank ***,data from the low-weighted set is relabeled to further refine the model’s feature extraction *** pre-trained model is then utilized in active learning to select and label information-rich samples more *** results demonstrate that the proposed method significantly outperforms existing approaches,achieving an improvement in recognition accuracy of 5.09%and 3.82%over the best existing active learning methods,Margin,and Least Confidence methods,respectively,and a 1.61%improvement compared to the conventional segmented active learning method.

关键词： Expression recognition active learning self-supervised learning uncertainty estimation

来源：评论

学校读者我要写书评

暂无评论

A New Median Filter Circuit Design Based on Atomic Silicon Quantum-Dot for Digital Image Processing and IoT Applications

引用

IEEE Internet of Things Journal 2025年第13期12卷 23000-23007页

作者： Ahmadpour, Seyed-Sajad Avval, Danial Bakhshayeshi Navimipour, Nima Jafari Rasmi, Hadi Heidari, Arash Kassa, Sankit Misra, Neeraj Kumar Navin, Ahmad Habibizad Mosleh, Mohammad Hosseinzadeh, Mehdi Patidar, Mukesh Kadir Has University Department of Computer Engineering Istanbul34083 Turkey Sakarya University Department of Information Systems Engineering Serdivan Sakarya54050 Turkey National Yunlin University of Science and Technology Future Technology Research Center Douliou64002 Taiwan Western Caspian University Research Center of High Technologies and Innovative Engineering Baku1001 Azerbaijan Islamic Azad University Department of Computer Engineering Dezful*** Iran Faculty of Engineering and Natural Science Department of Computer Engineering İstanbul Atlas University Istanbul34408 Turkey Qatar University Department of Computer Science and Engineering Doha Qatar Symbiosis International Deemed University Symbiois Institute of Technology Pune412115 India VIT-AP University School of Electronics Engineering Amaravathi522237 India Islamic Azad University Tabriz Branch Department of Computer Engineering Tabriz*** Iran Islamic Azad University Materials and Energy Research Center Department of Computer Engineering Dez.C. Dezful*** Iran Duy Tan University School of Computer Science Institute of Research and Development Da Nang550000 Viet Nam Parul Institute of Engineering and Technology Parul University Department of Computer Science and Engineering Vadodara391760 India

Digital image processing (DIP) is the ability to manipulate digital photographs via algorithms for pattern detection, segmentation, enhancement, and noise reduction. In addition, the Internet of Things (IoT) acts as the eye and system for all DIP in various applications. It can possess a camera or another image sensor in order to capture real-time data from its environment. All vital data is processed by image processing in such a way that it recognizes the object, detects an anomaly, and automatically decides in real-time. In addition, in an IoT system, the median filter is the technique used for noise reduction by substituting the value of the pixel with the central value of the surrounding pixels. It provides speed and efficiency for quick analysis in all IoT systems. However, the images can get corrupted, especially in resource-constrained IoT devices with small cameras, because of random glitches. Moreover, using new quantum technology like atomic-scale silicon dangling bond (DB) logic circuits, which have advanced in fabrication and become a strong contender for field-coupled nano-computing, can solve previous problems in IoT systems. In this article, we propose a unique quantum CSM based on two new proposed Mux and De-mux. The proposed CSM can be used for computational circuits like median filter circuits (MFC) in a wide range of digital circuits, specifically IoT devices. The proposed design is verified and validated using the powerful SiQAD tool. When comparing CSM to the newest designs, the suggested quantum circuit uses 85% less energy and takes up 61% less area. © 2014 IEEE.

关键词： Dangling bonds

来源：评论

学校读者我要写书评

暂无评论

TSCompiler: efficient compilation framework for dynamic-shape models

引用

science China(Information sciences) 2024年第10期67卷 67-84页

作者： Xiang LUO Chen ZHANG Chenbo GENG Yanzhi YI Jiahui HU Renwei ZHANG Zhen ZHANG Gianpietro CONSOLARO Fan YANG Tun LU Ning GU Li SHANG School of Computer Science Fudan University School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University School of Computer Science and Technology Harbin Institute of Technology Huawei Technologies Co. Ltd. Huawei Paris Research Center School of Microelectronics Fudan University

Today's deep learning models face an increasing demand to handle dynamic shape tensors and computation whose shape information remains unknown at compile time and varies in a nearly infinite range at runtime. This shape dynamism brings tremendous challenges for existing compilation pipelines designed for static models which optimize tensor programs relying on exact shape values. This paper presents TSCompiler, an end-to-end compilation framework for dynamic shape models. TSCompiler first proposes a symbolic shape propagation algorithm to recover symbolic shape information at compile time to enable subsequent optimizations. TSCompiler then partitions the shape-annotated computation graph into multiple subgraphs and fine-tunes the backbone operators from the subgraph within a hardware-aligned search space to find a collection of high-performance schedules. TSCompiler can propagate the explored backbone schedule to other fusion groups within the same subgraph to generate a set of parameterized tensor programs for fused cases based on dependence analysis. At runtime, TSCompiler utilizes an occupancy-targeted cost model to select from pre-compiled tensor programs for varied tensor shapes. Extensive evaluations show that TSCompiler can achieve state-of-the-art speedups for dynamic shape models. For example, we can improve kernel efficiency by up to 3.97× on NVIDIA RTX3090, and 10.30× on NVIDIA A100 and achieve up to five orders of magnitude speedups on end-to-end latency.

关键词： machine learning tensor compilers dynamic shape operator fusion code generation auto-tuning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：