检索结果-内蒙古大学图书馆

GLOBECOM 2022 - 2022 IEEE Global Communications Conference

作者： Wenjing Zhang Yining Wang Mingzhe Chen Tao Luo Dusit Niyato Beijing Laboratory of Advanced Information Network Beijing University of Posts and Telecommunications Beijing China Department of Electrical and Computer Engineering Institute for Data Science and ComputingUniversity of Miami Coral Gables FL USA School of Computer Science and Engineering Nanyang Technological University Singapore

ISBN: (纸本)9781665435413

In this paper, a semantic communication framework for image transmission is investigated. In the framework, a server transmits image data to a set of users utilizing semantic communication techniques, which enable the server to transmit only the semantic information that accurately captures the meaning of an image. To evaluate the performance of the studied semantic communication system, we propose a multimodal metric called image-to-graph semantic similarity (ISS). The significance of this new metric is that it can measure the correlation of the meaning between semantic information and the original image. To meet the ISS requirement of each user, the server must jointly determine the semantic information to be transmitted and the resource blocks (RBs) used for semantic information transmission. We formulate this problem as an optimization problem whose goal is to minimize the average transmission latency while reaching the ISS requirement. To solve this problem, we propose a model-based actor critic deep reinforcement learning (DRL) algorithm. Compared to traditional actor critic DRL, in the proposed algorithm, we design a novel value function to improve the action exploration thus improving the probability of finding an optimal solution. Simulation results show that the proposed method can reduce the transmission delay by 16.4% and improves the convergence speed by up to 50% compared to the traditional actor critic DRL.

关键词： Measurement Wireless communication Image communication Simulation Semantics Reinforcement learning Servers

来源：评论

学校读者我要写书评

暂无评论

Learning From Single-Expert Annotated Labels for Automatic Sleep Staging

Learning From Single-Expert Annotated Labels for Automatic S...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Zhiheng Luan Yanzhen Ren Li Peng Xiong Chen Xiuping Yang Weiping Tu Yuhong Yang Ministry of Education Key Laboratory of Aerospace Information Security and Trusted Computing School of Cyber Science and Engineering Wuhan University Department of Otorhinolaryngology Head and Neck Surgery Zhongnan Hospital of Wuhan University Sleep Medicine Centre Zhongnan Hospital of Wuhan University School of Computer Science Wuhan University

Existing automatic sleep staging algorithms rely on accurately labeled data. However, due to the subjectivity of sleep experts, accurate labels must be obtained through joint labeling by multiple experts, which results in high time and labor costs. In this work, we treat labels mislabeled by a single expert as noisy labels and first propose SE-ASS, an automatic sleep staging learning framework based on single-expert annotated data. Since multiple models tend to produce inconsistent predictions for instances with incorrect labels during training, we use two networks with the same structure but different initializations and regularize them with a prediction consistency loss to prevent overfitting to noisy labels. Furthermore, we use a contrastive loss between models to enhance the exploration of feature representations without relying on potentially noisy labels. Our results on two publicly available datasets show that SE-ASS can effectively improve the performance of automatic sleep staging models trained on single-expert annotated datasets.

关键词： Training Costs Medical services Signal processing Predictive models Noise measurement Labeling

来源：评论

学校读者我要写书评

暂无评论

Graph Neural Network Enabled Fluid Antenna Systems: A Two-Stage Approach

引用

IEEE Transactions on Vehicular Technology 2025年

作者： He, Changpeng Lu, Yang Chen, Wei Ai, Bo Wong, Kai-Kit Niyato, Dusit State Key Laboratory of Advanced Rail Autonomous Operation China Beijing Jiaotong University School of Computer Science and Technology Beijing100044 China Beijing Jiaotong University School of Electronics and Information Engineering Beijing100044 China University College London Department of Electronic and Electrical Engineering LondonWC1E 6BT United Kingdom Yonsei University Yonsei Frontier Laboratory School of Integrated Technology Seoul03722 Korea Republic of Nanyang Technological University College of Computing and Data Science 639798 Singapore

An emerging fluid antenna system (FAS) brings a new dimension, i.e., the antenna positions, to deal with the deep fading, but simultaneously introduces challenges related to the transmit design. This paper proposes an "unsupervised learning to optimize" paradigm to optimize the FAS. Particularly, we formulate the sum-rate and energy efficiency (EE) maximization problems for a multiple-user multiple-input single-output (MU-MISO) FAS and solved by a two-stage graph neural network (GNN) where the first stage and the second stage are for the inference of antenna positions and beamforming vectors, respectively. The outputs of the two stages are jointly input into an unsupervised loss function to train the two-stage GNN. The numerical results demonstrate that the advantages of the FAS for performance improvement and the two-stage GNN for real-time and scalable optimization. Besides, the two stages can function separately. © 1967-2012 IEEE.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Multi-view clustering guided by unconstrained non-negative matrix factorization

引用

Knowledge-Based Systems 2023年 266卷

作者： Deng, Ping Li, Tianrui Wang, Dexian Wang, Hongjun Peng, Hong Horng, Shi-Jinn School of Computer and Software Engineering Xihua University Chengdu610039 China School of Computing and Artificial Intelligence Southwest Jiaotong University Chengdu611756 China National Engineering Laboratory of Integrated Transportation Big Data Application Technology Southwest Jiaotong University Chengdu611756 China Manufacturing Industry Chains Collaboration and Information Support Technology Key Laboratory of Sichuan Province Chengdu611756 China Department of Computer Science and Information Engineering National Taiwan University of Science and Technology Taipei106 Taiwan

Multi-view clustering based on non-negative matrix factorization (NMFMvC) is a well-known method for handling high-dimensional multi-view data. To satisfy the non-negativity constraint of the matrix, NMFMvC is usually solved using the Karush–Kuhn–Tucker (KKT) conditions. However, this optimization method is poorly scalable. To this end, we propose an unconstrained non-negative matrix factorization multi-view clustering (uNMFMvC) model. First, the objective function was constructed by decoupling the elements of the matrix and combining the elements with a non-linear mapping function in a non-negative value domain. The objective function was then optimized using the stochastic gradient descent (SGD) algorithm. Subsequently, three uNMFMvC methods were constructed based on different mapping functions and detailed reasoning was provided. Finally, experiments were conducted on eight public datasets and compared with cutting-edge multi-view clustering methods. The experimental results demonstrate that the proposed model has significant advantages. © 2023 Elsevier B.V.

关键词： Non-negative matrix factorization

来源：评论

学校读者我要写书评

暂无评论

Energy Efficient Collaborative Federated Learning Design: A Graph Neural Network based Approach

Energy Efficient Collaborative Federated Learning Design: A ...

引用

IEEE Conference on Global Communications (GLOBECOM)

作者： Nuocheng Yang Sihua Wang Mingzhe Chen Christopher G. Brinton Changchuan Yin Beijing Laboratory of Advanced Information Network Beijing University of Posts and Telecommunications Beijing China State Key Laboratory of Networking and Switching Technology Beijing University of Posts and Telecommunications Beijing China Department of Electrical and Computer Engineering Institute for Data Science and Computing University of Miami Coral Gables FL USA School of Electrical and Computer Engineering Purdue University West Lafayette IN USA

In this paper, we consider the design of an energy efficient collaborative federated learning (CFL) methodology where devices exchange their local FL parameters with a subset of their neighbors without reliance on a parameter server. In the considered model, mobile devices implement the designed CFL to train their local FL models using their own datasets over a realistic wireless network. Due to the limited wireless resources and user movements, each device may not be able to transmit its FL parameters with all neighboring devices. Therefore, each device must select a subset of devices to share its FL parameters and optimize the transmit power. This problem is formulated as an optimization problem, whose goal is to minimize CFL training energy consumption while satisfying the delay and CFL training loss requirements. To solve this problem, a two-stage solution is proposed. At the first stage, a graph neural network (GNN) based algorithm is proposed, which enables each device to individually determine the subset of devices to transmit FL parameters using its neighboring devices' location and connection information. Compared to standard iterative algorithms that need to iteratively optimize device connections and transmit power, the proposed GNN based method can directly obtain the optimal device connections without iterative optimization. Given the optimal device connections, at the second stage, each device can directly obtain the optimal transmit power. Simulation results show that the proposed algorithm can decrease energy consumption by up to 46% compared to the algorithm where each device will directly connect to its first and second nearest neighbors.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Based Object Pose Estimation: A Comprehensive Survey

arXiv

引用

arXiv 2024年

作者： Liu, Jian Sun, Wei Yang, Hui Zeng, Zhiwen Liu, Chongpei Zheng, Jin Liu, Xingyu Rahmani, Hossein Sebe, Nicu Mian, Ajmal The National Engineering Research Center for Robot Visual Perception and Control Technology College of Electrical and Information Engineering the State Key Laboratory of Advanced Design and Manufacturing for Vehicle Body Hunan University Changsha410082 China The School of Architecture and Art Central South University Changsha410082 China The Department of Automation Tsinghua University Beijing100084 China The School of Computing and Communications Lancaster University LA1 4YW United Kingdom The Department of Information Engineering and Computer Science University of Trento Trento38123 Italy The Department of Computer Science The University of Western Australia WA6009 Australia

Object pose estimation is a fundamental computer vision problem with broad applications in augmented reality and robotics. Over the past decade, deep learning models, due to their superior accuracy and robustness, have increasingly supplanted conventional algorithms reliant on engineered point pair features. Nevertheless, several challenges persist in contemporary methods, including their dependency on labeled training data, model compactness, robustness under challenging conditions, and their ability to generalize to novel unseen objects. A recent survey discussing the progress made on different aspects of this area, outstanding challenges, and promising future directions, is missing. To fill this gap, we discuss the recent advances in deep learning-based object pose estimation, covering all three formulations of the problem, i.e., instance-level, category-level, and unseen object pose estimation. Our survey also covers multiple input data modalities, degrees-of-freedom of output poses, object properties, and downstream tasks, providing the readers with a holistic understanding of this field. Additionally, it discusses training paradigms of different domains, inference modes, application areas, evaluation metrics, and benchmark datasets, as well as reports the performance of current state-of-the-art methods on these benchmarks, thereby facilitating the readers in selecting the most suitable method for their application. Finally, the survey identifies key challenges, reviews the prevailing trends along with their pros and cons, and identifies promising directions for future research. We also keep tracing the latest works at Awesome-Object-Pose-Estimation. Copyright © 2024, The Authors. All rights reserved.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Hiding Scrambling Text Message in Speech Signal Based on Lightwight Hyperchaotic Map and Conditional Lsb Mechanisms

SSRN

引用

SSRN 2023年

作者： Al Sibahee, Mustafa A. Abduljaleel, Iman Qays Luo, Chengwen Zhang, Jin Huang, Yijing Abduljabbar, Zaid Ameen Ma, Junchao National Engineering Laboratory for Big Data System Computing Technology Shenzhen Technology University 518060 China Department of Computer Science College of Computer Science and Information Technology University of Basrah 61004 Iraq College of Computer Science and Software Engineering Shenzhen University 518060 China Department of Computer Science College of Education for Pure Sciences University of Basrah Basrah61004 Iraq Technical Computer Engineering Department Al-kunooze University College Basrah61001 Iraq Huazhong University of Science and Technology Shenzhen Institute Shenzhen518118 China College of Big Data and Internet Shenzhen Technology University Shenzhen518118 China

This study proposed a lightweight and secure audio steganography system for hiding text messages during transmission over the Internet to address the computational cost exaggeration, and Insufficient levels of security in earlier studies. The paper proposes a two-phase functioning mechanism. Text characters are transformed into ASCII code and then stored in a vector, which is then divided into three sub-vectors. The sub-vectors are scrambled using a two low complexity operation, namely a forwardbackward reading technique and an odd-even index. Two scrambling circuits can be performed: the first on tiny sub-vectors and another on the vector as a whole. In the hiding phase, the speech signal samples were divided into 256 blocks using only 200 values per block, and low complexity Quadratic and the Hénon maps to hide the speech signal in a random manner. This method utilizes the conditional LSB as low complexity algorithm to identify hidden bits and a special hyperchaotic map algorithm for randomly choosing locations. The proposed approach provides good security for a scrambled text message with high SNR and PSNR, small MSE error and PESQ, SSIM near one, BER close to zero, and MOS near five as well as the lowest computational hiding cost. © 2023, The Authors. All rights reserved.

关键词： Steganography

来源：评论

学校读者我要写书评

暂无评论

Packet Encoding Based on Encrypted Raptor Code for Secure Internet of Vehicles Communication

Packet Encoding Based on Encrypted Raptor Code for Secure In...

引用

IEEE Conference on Vehicular Technology (VTC)

作者： Junzhe Cheng Dongyang Xu Gautam Srivastava Keping Yu School of Information and Communications Engineering Xi’an Jiaotong University Xi’an China National Mobile Communications Research Laboratory Southeast University China Department of Computer Science Brandon University Brandon Canada Research Centre for Interneural Computing China Medical University Taichung Taiwan Graduate School of Science and Engineering Hosei University Tokyo Japan

The Internet of Vehicles (IoV) industry has developed rapidly in recent years. However, the information security of IoV needs more attention. The use of cross-layer secure transmission technology can improve the security of IoV communication, but the existing cross-layer schemes have some shortcomings. To this end, we propose a packet encoding scheme based on encrypted Raptor codes to improve the secure capacity of IoV communication by utilizing fountain codes and physical layer Low-density parity-check (LDPC) codes. Specifically, we choose Raptor codes which combine LDPC codes and fountain codes for secure encoding. With a sparser degree distribution, Raptor codes make decoding faster and more accurate at the legitimate receiver. In the transmission, the transmitter encrypts and sends the coding control information corresponding to the packets received by the legitimate receiver, rather than sending the generating matrix directly. We found that confidentiality can be improved by this encrypting. The simulation results show that the proposed scheme has higher security than the comparison schemes.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Multimodal Medical Image Fusion Method Incorporating an Adaptive Attention Mechanism

A Multimodal Medical Image Fusion Method Incorporating an Ad...

引用

International Conference on computer Supported Cooperative Work in Design

作者： Yingxian Zhang Yushui Geng Hu Liang Hai Zhong Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Provincial Key Laboratory of Industrial Network and Information System Security Shandong Fundamental Research Center for Computer Science Jinan China Graduate School Qilu University of Technology (Shandong Academy of Sciences) Jinan China Department of Radiology The Second Hospital of Shandong University Jinan China

ISBN: (数字)9798331513054

ISBN: (纸本)9798331513061

Multimodal medical image fusion technology provides more comprehensive and accurate image support for clinical diagnosis and treatment by integrating complementary information from different imaging modalities. Aiming at the problem that existing methods are still insufficient in detail feature extraction and inter-modal information fusion, this paper proposes a multimodal medical image fusion method combined with an adaptive attention mechanism. First, we design the Grouped Receptive Field Attentional Convolution (GRFAConv) to solve the problem of insufficient detail feature extraction capability. With the multi-head receptive field adaptive weighting strategy of grouped convolution, the range and weight of the receptive field of the convolution kernel can be adaptively adjusted according to the different demands of local and global features of the image to improve the effect of detail retention. Second, for the problem of information fusion between different modalities, we introduce an improved CBAM attention module in the feature fusion process, which adaptively selects and enhances the features in the key regions through the channel attention and spatial attention mechanisms, which greatly improves the clarity of the fused image details and the accuracy of the information expression in the key regions. Furthermore, experimental results on several medical image datasets show that the algorithm proposed in this paper can generate relatively high-quality fused images. It not only enriches the detailed features of the image, but also achieves significant advantages in several evaluation metrics.

关键词： Measurement Attention mechanisms Accuracy Convolution Federated learning Feature extraction Clinical diagnosis Kernel Medical diagnostic imaging Image fusion

来源：评论

学校读者我要写书评

暂无评论

Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning

arXiv

引用

arXiv 2024年

作者： Zhu, Hongze Xie, Guoyang Hou, Chengbin Dai, Tao Gao, Can Wang, Jinbao Shen, Linlin National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Shenzhen China Department of Computer Science City University of Hong Kong Hong Kong Department of Intelligent Manufacturing CATL Ningde China Fuzhou Fuyao Institute for Advanced Study Fuyao University of Science and Technology Fuzhou China College of Computer Science and Software Engineering Shenzhen University Shenzhen China Guangdong Provincial Key Laboratory of Intelligent Information Processing Shenzhen China Shenzhen University Shenzhen China Shenzhen Institute of Artificial Intelligence and Robotics for Society Shenzhen China

High-resolution point clouds (HRPCD) anomaly detection (AD) plays a critical role in precision machining and high-end equipment manufacturing. Despite considerable 3D-AD methods that have been proposed recently, they still cannot meet the requirements of the HRPCD-AD task. There are several challenges: i) It is difficult to directly capture HRPCD information due to large amounts of points at the sample level;ii) The advanced transformer-based methods usually obtain anisotropic features, leading to degradation of the representation;iii) The proportion of abnormal areas is very small, which makes it difficult to characterize. To address these challenges, we propose a novel group-level feature-based network, called Group3AD, which has a significantly efficient representation ability. First, we design an Intercluster Uniformity Network (IUN) to present the mapping of different groups in the feature space as several clusters, and obtain a more uniform distribution between clusters representing different parts of the point clouds in the feature space. Then, an Intracluster Alignment Network (IAN) is designed to encourage groups within the cluster to be distributed tightly in the feature space. In addition, we propose an Adaptive Group-Center Selection (AGCS) based on geometric information to improve the pixel density of potential anomalous regions during inference. The experimental results verify the effectiveness of our proposed Group3AD, which surpasses Reg3D-AD by the margin of 5% in terms of object-level AUROC on Real3DAD. We provide the code and supplementary information on our website: https://***/M-3LAB/Group3AD. Copyright © 2024, The Authors. All rights reserved.

关键词： Anomaly detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：