检索结果-内蒙古大学图书馆

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Kaijun Deng Dezhi Zheng Jindong Xie Jinbao Wang Weicheng Xie Linlin Shen Siyang Song Computer Vision Institute School of Computer Science and Software Engineering Shenzhen University National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Guangdong Provincial Key Laboratory of Intelligent Information Processing Department of Computer Science University of Exeter

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Accurately synthesizing talking face videos and capturing fine facial features for individuals with long hair presents a significant challenge. To tackle these challenges in existing methods, we propose a decomposed per-embedding Gaussian fields (DEGSTalk), a 3D Gaussian Splatting (3DGS)-based talking face synthesis method for generating realistic talking faces with long hairs. Our DEGSTalk employs Deformable Pre-Embedding Gaussian Fields, which dynamically adjust pre-embedding Gaussian primitives using implicit expression coefficients. This enables precise capture of dynamic facial regions and subtle expressions. Additionally, we propose a Dynamic Hair-Preserving Portrait Rendering technique to enhance the realism of long hair motions in the synthesized videos. Results show that DEGSTalk achieves improved realism and synthesis quality compared to existing approaches, particularly in handling complex facial dynamics and hair preservation. Our code is available at https://***/CVI-SZU/DEGSTalk.

关键词： Hair Training Three-dimensional displays Dynamics Signal processing Rendering (computer graphics) Noise measurement Speech processing Faces Videos

来源：评论

学校读者我要写书评

暂无评论

Remote State Estimation for Complex Networks Under Probabilistic Bit Flips: A Transmission Power Allocation Scheme

引用

IEEE Transactions on Automatic Control 2025年

作者： Song, Jiahao Wang, Zidong Liu, Qinyuan He, Xiao Tsinghua University Department of Automation Beijing100084 China Brunel University London Department of Computer Science UxbridgeUB8 3PH United Kingdom Tongji University Department of Computer Science and Technology Shanghai201804 China Ministry of Education Tongji University Key Laboratory of Embedded System and Service Computing Shanghai200092 China Shanghai Artificial Intelligence Laboratory Shanghai China

In this paper, the problem of remote state estimation is investigated for a class of complex networks with noisy wireless communication channels. The employment of the binary encoding scheme allows for the description of the influence of channel noises as probabilistic bit flips, where the probability of these bit flips is influenced by several factors with the signal-to-noise ratio (SNR) being a crucial one. In engineering practice, the method of adjusting the transmission power for each node in a complex network is commonly applied to modify the SNR so as to reduce the data distortion caused by bit flips. Furthermore, due to restricted communication resources, the total available transmission power is often limited particularly in large-scale systems such as complex networks. Consequently, the allocation of transmission power for all nodes under a total transmission power constraint becomes a concern. Utilizing the ultimately bounded filtering method, we devise a transmission-power-dependent state estimator, by which the combined effect of probabilistic bit flips and transmission power allocation on estimation performance is analyzed. Moreover, the task of co-designing the transmission power allocation scheme and estimator gains is modeled as an optimization problem, which is then addressed through a two-step optimization strategy. The existence of a unique optimal transmission power allocation scheme is also proven. Finally, numerical simulation examples are provided to demonstrate the effectiveness of the proposed co-design approach. © 1963-2012 IEEE.

关键词： Large scale systems

来源：评论

学校读者我要写书评

暂无评论

Safe and Reliable Diffusion Models via Subspace Projection

arXiv

引用

arXiv 2025年

作者： Chen, Huiqiang Zhu, Tianqing Wang, Linlin Yu, Xin Gao, Longxiang Zhou, Wanlei Faculty of Data Science City University of Macau China School of Computer Science University of Queensland QLD Australia Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology Shandong Academy of Sciences Jinan China Shandong Provincial Key Laboratory of Computing Power Internet and Service Computing Shandong Fundamental Research Center for Computer Science Jinan China

Large-scale text-to-image (T2I) diffusion models have revolutionized image generation, enabling the synthesis of highly detailed visuals from textual descriptions. However, these models may inadvertently generate inappropriate content, such as copyrighted works or offensive images. While existing methods attempt to eliminate specific unwanted concepts, they often fail to ensure complete removal—allowing the concept to reappear in subtle forms. For instance, a model may successfully avoid generating images in Van Gogh’s style when explicitly prompted with "Van Gogh", yet still reproduce his signature artwork when given the prompt "Starry Night". In this paper, we propose SAFER, a novel and efficient approach for thoroughly removing target concepts from diffusion models. At a high level, SAFER is inspired by the observed low-dimensional structure of the text embedding space. The method first identifies a concept-specific subspace Sc associated with the target concept c. It then projects the prompt embeddings onto the complementary subspace of Sc, effectively erasing the concept from the generated images. Since concepts can be abstract and difficult to fully capture using natural language alone, we employ textual inversion to learn an optimized embedding of the target concept from a reference image. This enables more precise subspace estimation and enhances removal performance. Furthermore, we introduce a subspace expansion strategy to ensure comprehensive and robust concept erasure. Extensive experiments demonstrate that SAFER consistently and effectively erases unwanted concepts from diffusion models while preserving generation quality. © 2025, CC BY.

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

E2DAS: An Efficient Equivariant Dynamic Aggregation Saliency Model for Omnidirectional Images 27th

E2DAS: An Efficient Equivariant Dynamic Aggregation Saliency...

引用

27th International Conference on Pattern Recognition, ICPR 2024

作者： Zhang, Nana Liu, Qian Zhu, Dandan Zhu, Kun Zhai, Guangtao Yang, Xiaokang School of Computer Science and Technology Donghua University No.2999 Renmin North Road Songjiang District Shanghai201620 China Institute of AI Education Shanghai East China Normal University No.3663 Zhongshan North Road Putuo District Shanghai200333 China Key Laboratory of Embedded System and Service Computing Ministry of Education Tongji University No.4800 Cao’an Highway Jiading District Shanghai201804 China Institute of Image Communication and Network Engineering Shanghai Jiao Tong University No. 800 Dongchuan Road Minhang District Shanghai200240 China

ISBN: (纸本)9783031781216

Recent years have witnessed rapid progress of convolutional neural networks (CNNs) and their successful application in the task of saliency prediction for omnidirectional images (ODIs). Albeit achieving tremendous performance improvements, these CNNs-based saliency models are plagued by two major shortcomings: spatial content-agnostic and computationally intensive. Inspired by the effectiveness of equivariant network in the majority of computer vision tasks, we propose a novel efficient equivariant dynamic aggregation saliency (E2DAS) model to efficiently tackle the issue of human fixation prediction in ODIs. To be specific, our proposed model consists of an efficient equivariant module, a dynamic convolutional aggregation module, and an optimization computation module. Different from existing saliency models for ODIs, we are the first attempt to introduce an efficient equivariant dynamic convolutional aggregation operation into the saliency prediction task, which can fundamentally alleviate the projection distortion problem and can effectively learn spatial content-adaptive features. Moreover, we clearly observe a considerable decrease in the number of parameters resulting from the replacement of standard convolution with dynamic convolution aggregation. Extensive experiments on several benchmark datasets show the proposed model’s superiority over other state-of-the-art methods in terms of performance. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Equivariant dynamic aggregation light-weight model omnidirectional images saliency prediction spatial content-adaptive

来源：评论

学校读者我要写书评

暂无评论

Dual Encoders for Diffusion-based Image Inpainting

Dual Encoders for Diffusion-based Image Inpainting

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Dezhi Zheng Kaijun Deng Jinbao Wang Linlin Shen Computer Vision Institute School of Computer Science and Software Engineering Shenzhen University Shenzhen China National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Shenzhen China Guangdong Provincial Key Laboratory of Intelligent Information Processing Shenzhen China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Current diffusion-based inpainting models struggle to preserve unmasked regions or generate highly coherent content. Additionally, it is hard for them to generate meaningful content for 3D inpainting. To tackle these challenges, we design a plug-and-play branch that runs through the entire generation process to enhance existing models. Specifically, we utilize dual encoders - a Convolutional Neural Network (CNN) encoder and the pre-trained Variational AutoEncoder (VAE) encoder, to encode masked images. The latent code and the feature map from the dual encoders are fed to diffusion models simultaneously. In addition, we apply Zero-padded initialization to solve the problem of mode collapse caused by this branch. Experiments on BrushBench and EditBench demonstrate that models with our plug-and-play branch can improve the coherence of inpainting, and our model achieves new state-of-the-art results.

关键词： Visualization Three-dimensional displays Codes Autoencoders Coherence Signal processing Diffusion models Decoding Convolutional neural networks Speech processing

来源：评论

学校读者我要写书评

暂无评论

Constructive Interference Precoding Empowered NOMA-ISAC Design

引用

IEEE Transactions on Wireless Communications 2025年

作者： Wang, Wei Dong, Chao Zhao, Nan Wu, Qihui Niyato, Dusit Ministry of Industry and Information Technology Nanjing University of Aeronautics and Astronautics Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space Nanjing210023 China Dalian University of Technology School of Information and Communication Engineering Dalian116024 China Nanyang Technological University College of Computing and Data Science 639798 Singapore

Non-orthogonal multiple access (NOMA) can help integrated sensing and communication (ISAC) to accommodate more users and well manage interference. In this paper, we first propose a NOMA-ISAC scheme, in which a multiantenna base station (BS) transmits ISAC signal to detect a radar user (RU), and provide wireless service to the RU and the communication user (CU) simultaneously. The inter-user interference can be mitigated by the successive interference cancellation (SIC). We further investigate the trade-off between minimizing the beampattern matching error and maximizing the CU’s achievable signal-to-noise ratio (SNR), and propose a penalty-based semi-definite relaxation (SDR) method to solve this non-convex problem. Then, to mitigate the instantaneous NOMA-ISAC beampattern shaking and enhance its stability, we utilize constructive interference precoding (CIP) to assist the NOMA-ISAC beampattern design. Introducing CIP can convert the interference from RU into the beneficial signal to CU and the complex SIC can be avoided. Then, the corresponding trade-off can be transformed into a convex problem by the Taylor-series approximation, and an iterative algorithm is proposed to solve it. Moreover, the Manopt toolbox assisted initialization is utilized to accelerate its convergence speed. Simulation results verify that the proposed CIP-NOMA-ISAC scheme can effectively enhance the stability of instantaneous NOMA-ISAC beampattern over limited time slots, and provide higher SNR for CU. © 2002-2012 IEEE.

关键词： Taylor series

来源：评论

学校读者我要写书评

暂无评论

Joint UAV Deployment and Edge Association for Energy-Efficient Federated Learning

引用

IEEE Transactions on Cognitive Communications and Networking 2025年

作者： Wu, Tao Li, Maomao Qu, Yuben Wang, Hongjun Wei, Zhenhua Cao, Jiannong National University of Defense Technology Hefei230009 China Nanjing University of Aeronautics and Astronautics Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space Ministry of Industry and Information Technology Nanjing211106 China Xi'an Research Institute of High Technology Xi'an710025 China The HongKong Polytechnic University Department of Computing Hong Kong Hong Kong

Recently, federated learning (FL) has become a promising distributed learning paradigm that caters to the recent trend of pushing intelligence from the cloud to the edge. Nevertheless, communication bottlenecks and device dropout can lead to inefficient FL in the large network scale, where massive devices cannot be accessed with severely limited network resources. Inspired by the Unmanned Aerial Vehicle (UAV)-assisted mobile edge computing (MEC), we propose the multi-UAV assisted FL design to provide the intermediate model aggregation in the sky. Specifically, we study the problem of joint UAV dePloyment and edge aSsociation (UPS) to minimize the overall energy consumption, which concerns UAV deployment, edge association, and resource allocation. Unfortunately, solving this problem is non-trivial, due to its infinite search space and the complex coupling among mixed optimization variables. To tackle this difficulty, we exploit the FL bundle generation method to reduce candidate locations of UAVs from infinite to finite. Then, we decompose the initial problem and devise an alternating optimization-based algorithm to achieve the optimal resource allocation in the closed form. On this basis, we design a greedy-based approximation algorithm with logN performance guarantee for UAV deployment and edge association. Extensive simulations are conducted to validate the effectiveness of our proposed solution. Compared with five benchmarks, our proposed algorithm can significantly reduce the overall training energy consumption under the training time constraint, while always maintaining better training performance under different parameter settings. © 2015 IEEE.

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

Adaptive Hybrid FFT: A Novel Pipeline and Memory-Based Architecture for Radix-2k FFT in Large Size Processing

arXiv

引用

arXiv 2025年

作者： Zhao, Fangyu Xiao, Chunhua Wang, Zhiguo Du, Xiaohua Dong, Bo College of Computer Science Chongqing University Chongqing China Key Laboratory of Dependable Service Computing in Cyber Physical Society Ministry of Education China Sichuan Huacun Zhigu Technology Co. Ltd. Chengdu China

In the field of digital signal processing, the fast Fourier transform (FFT) is a fundamental algorithm, with its processors being implemented using either the pipelined architecture, well-known for high-throughput applications but weak in hardware utilization, or the memory-based architecture, designed for area-constrained scenarios but failing to meet stringent throughput requirements. Therefore, we propose an adaptive hybrid FFT, which leverages the strengths of both pipelined and memory-based architectures. In this paper, we propose an adaptive hybrid FFT processor that combines the advantages of both architectures, and it has the following features. First, a set of radix-2kmulti-path delay commutators (MDC) units are developed to support high-performance large-size processing. Second, a conflict-free memory access scheme is formulated to ensure a continuous data flow without data contention. Third, We demonstrate the existence of a series of bit-dimension permutations for reordering input data, satisfying the generalized constraints of variable-length, high-radix, and any level of parallelism for wide adaptivity. Furthermore, the proposed FFT processor has been implemented on a field-programmable gate array (FPGA). As a result, the proposed work outperforms conventional memory-based FFT processors by requiring fewer computation cycles. It achieves higher hardware utilization than pipelined FFT architectures, making it suitable for highly demanding applications. Copyright © 2025, The Authors. All rights reserved.

关键词： Pipeline processing systems

来源：评论

学校读者我要写书评

暂无评论

Enhanced Practical Byzantine Fault Tolerance for service Function Chain Deployment: Advancing Big Data Intelligence in Control systems

引用

Computers, Materials and Continua 2025年第3期83卷 4393-4409页

作者： Peiying Zhang Yihong Yu Jing Liu Chong Lv Lizhuang Tan Yulin Zhang Qingdao Institute of Software College of Computer Science and Technology China University of Petroleum (East China) Qingdao 266580 China Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software Qingdao 266580 China Library of Shanghai Lixin University of Accounting and Finance Shanghai 201209 China Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center (National Supercomputer Center in Jinan) Qilu University of Technology (Shandong Academy of Sciences) Jinan 250014 China Shandong Provincial Key Laboratory of Computing Power Internet and Service Computing Shandong Fundamental Research Center for Computer Science Jinan 250014 China Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE Minzu University of China Beijing 100081 China Key Laboratory of Intelligent Game Yangtze River Delta Research Institute of NPU Taicang 215400 China Key Laboratory of Education Informatization for Nationalities (Yunnan Normal University) Ministry of Education Kunming 650092 China

As Internet of Things (IoT) technologies continue to evolve at an unprecedented pace, intelligent big data control and information systems have become critical enablers for organizational digital transformation, facilitating data-driven decision making, fostering innovation ecosystems, and maintaining operational stability. In this study, we propose an advanced deployment algorithm for service Function Chaining (SFC) that leverages an enhanced Practical Byzantine Fault Tolerance (PBFT) mechanism. The main goal is to tackle the issues of security and resource efficiency in SFC implementation across diverse network settings. By integrating blockchain technology and Deep Reinforcement Learning (DRL), our algorithm not only optimizes resource utilization and quality of service but also ensures robust security during SFC deployment. Specifically, the enhanced PBFT consensus mechanism (VRPBFT) significantly reduces consensus latency and improves Byzantine node detection through the introduction of a Verifiable Random Function (VRF) and a node reputation grading model. Experimental results demonstrate that compared to traditional PBFT, the proposed VRPBFT algorithm reduces consensus latency by approximately 30% and decreases the proportion of Byzantine nodes by 40% after 100 rounds of consensus. Furthermore, the DRL-based SFC deployment algorithm (SDRL) exhibits rapid convergence during training, with improvements in long-term average revenue, request acceptance rate, and revenue/cost ratio of 17%, 14.49%, and 20.35%, respectively, over existing algorithms. Additionally, the CPU resource utilization of the SDRL algorithm reaches up to 42%, which is 27.96% higher than other algorithms. These findings indicate that the proposed algorithm substantially enhances resource utilization efficiency, service quality, and security in SFC deployment.

关键词： Big data intelligent transformation heterogeneous networks service function chain blockchain deep reinforcement learning trusted deployment

来源：评论

学校读者我要写书评

暂无评论

DRL-Based Time-Varying Workload Scheduling With Priority and Resource Awareness

引用

IEEE Transactions on Network and service Management 2025年

作者： Liu, Qifeng Fan, Qilin Zhang, Xu Li, Xiuhua Wang, Kai Xiong, Qingyu Chongqing University School of Big Data and Software Engineering Chongqing400044 China Chongqing University Key Laboratory of Dependable Service Computing in Cyber Physical Society of Ministry of Education Chongqing400044 China Nanjing University School of Electronic Science and Engineering Nanjing210023 China Haihe Laboratory of Information Technology Application Innovation Tianjin300072 China Harbin Institute of Technology School of Computer Science and Technology Weihai264209 China Shandong Key Laboratory of Industrial Network Security Weihai264209 China

With the proliferation of cloud services and the continuous growth in enterprises' demand for dynamic multi-dimensional resources, the implementation of effective strategy for time-varying workload scheduling has become increasingly significant. In this paper, we propose a deep reinforcement learning (DRL)-based method for time-varying workload scheduling, aiming to allocate resources efficiently across servers in the cluster. Specifically, we integrate a classifier and queue scorer to construct a priority queue that exploits temporal resource utilization patterns across different workload classes. Then, we design parallel graph attention layers to capture the dimensional features and temporal dynamics of cloud server cluster. Moreover, we propose a DRL algorithm to generate scheduling strategies that can adapt to dynamic environments. Validation on real-world traces from Google cluster demonstrates that our method outperforms existing approaches in key metrics of cloud server cluster management. © 2004-2012 IEEE.

关键词： Cloud platforms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：