检索结果-内蒙古大学图书馆

IAENG International Journal of computer science 2024年第6期51卷 572-581页

作者： Ren, Jiaxin Cui, Wenhua Tao, Ye Shi, Tianwei School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan China

Safety equipment detection is an important application of object detection, receiving widespread attention in fields such as smart construction sites and video surveillance. Significant progress has been made in object detection due to the rapid development of deep learning. Multi-scale targets and complex scenes increase the likelihood of false positives and missed detections, which can affect the accuracy of the detection. To address this issue, this study proposes YOLOv7-DSE. It is a small complex target scene detection network based on the improved YOLOv7. Also, we have created a private dataset of safety equipment. First, we enhanced the ELAN and MP backbone networks. Backbone is replaced by ordinary convolution by the depthwise separable convolution. We enabled the backbone network to extract deeper image features without increasing the amount of parameters and computation. Simultaneously, the model incorporates the EIOU loss function to improve its convergence speed and positioning effect. Secondly, we propose a new ELAN-SPD structure in the head network. Based on the ELAN structure, a space-to-depth convolutional layer is added to fully downsample the feature map, preserving all learnable features. Our network model can better detect objects with significant size differences faced with complex scenes. YOLOv7-DSE achieved the mAP of 82.38%, surpassing the original YOLOv7 with 2.64%. The YOLOv7-DSE model has a minor size compared to the baseline model. Our improvement has reduced the model parameters by 22.4%. © (2024), (International Association of Engineers). All rights reserved.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Robust Transmission Design for Federated Learning Through Over-the-Air Computation

引用

China Communications 2025年第3期22卷 65-75页

作者： Hamideh Zamanpour Abyaneh Saba Asaad Amir Masoud Rabiei School of Electrical and Computer Engineering College of EngineeringUniversity of TehranTehranIran Department of Electrical Engineering and Computer Science York UniversityCanada

Over-the-air computation(AirComp)enables federated learning(FL)to rapidly aggregate local models at the central server using waveform superposition property of wireless *** this paper,a robust transmission scheme for an AirCompbased FL system with imperfect channel state information(CSI)is *** model CSI uncertainty,an expectation-based error model is *** main objective is to maximize the number of selected devices that meet mean-squared error(MSE)requirements for model broadcast and model *** problem is formulated as a combinatorial optimization problem and is solved in two ***,the priority order of devices is determined by a sparsity-inducing ***,a feasibility detection scheme is used to select the maximum number of devices to guarantee that the MSE requirements are *** alternating optimization(AO)scheme is used to transform the resulting nonconvex problem into two convex *** results illustrate the effectiveness and robustness of the proposed scheme.

关键词： federated learning imperfect CSI optimization over-the-air computing robust design

来源：评论

学校读者我要写书评

暂无评论

Feature-Grounded Single-Stage Text-to-Image Generation

引用

Tsinghua science and Technology 2024年第2期29卷 469-480页

作者： Yuan Zhou Peng Wang Lei Xiang Haofeng Zhang School of Artificial Intelligence Nanjing University of Information Science and TechnologyNanjing 210044China School of Computer Science and Engineering Nanjing University of Science and TechnologyNanjing 210094China

Recently,Generative Adversarial Networks(GANs)have become the mainstream text-to-image(T2I)***,a standard normal distribution noise of inputs cannot provide sufficient information to synthesize an image that approaches the ground-truth image ***,the multistage generation strategy results in complex T2I ***,this study proposes a novel feature-grounded single-stage T2I model,which considers the“real”distribution learned from training images as one input and introduces a worst-case-optimized similarity measure into the loss function to enhance the model's generation *** results on two benchmark datasets demonstrate the competitive performance of the proposed model in terms of the Frechet inception distance and inception score compared to those of some classical and state-of-the-art models,showing the improved similarities among the generated image,text,and ground truth.

关键词： text-to-image(T2I) feature-grounded single-stage generation Generative Adversarial Network(GAN)

来源：评论

学校读者我要写书评

暂无评论

AInvR:Adaptive Learning Rewards for Knowledge Graph Reasoning Using Agent Trajectories

引用

Tsinghua science and Technology 2023年第6期28卷 1101-1114页

作者： Hao Zhang Guoming Lu Ke Qin Kai Du School of Computer Science and Engineering University of Electronic Science and Technology of ChinaChengdu 611731China

Multi-hop reasoning for incomplete Knowledge Graphs(KGs)demonstrates excellent interpretability with decent *** Learning(RL)based approaches formulate multi-hop reasoning as a typical sequential decision *** intractable shortcoming of multi-hop reasoning with RL is that sparse reward signals make performance *** mainstream methods apply heuristic reward functions to counter this ***,the inaccurate rewards caused by heuristic functions guide the agent to improper inference paths and unrelated object *** this end,we propose a novel adaptive Inverse Reinforcement Learning(IRL)framework for multi-hop reasoning,called AInvR.(1)To counter the missing and spurious paths,we replace the heuristic rule rewards with an adaptive rule reward learning mechanism based on agent’s inference trajectories;(2)to alleviate the impact of over-rewarded object entities misled by inaccurate reward shaping and rules,we propose an adaptive negative hit reward learning mechanism based on agent’s sampling strategy;(3)to further explore diverse paths and mitigate the influence of missing facts,we design a reward dropout mechanism to randomly mask and perturb reward parameters for the reward learning *** results on several benchmark knowledge graphs demonstrate that our method is more effective than existing multi-hop approaches.

关键词： Knowledge Graph Reasoning(KGR) Inverse Reinforcement Learning(IRL) multi-hop reasoning

来源：评论

学校读者我要写书评

暂无评论

From Extraction to Generation: Multimodal Emotion-Cause Pair Generation in Conversations

引用

IEEE Transactions on Affective Computing 2024年第2期16卷 1-12页

作者： Ma, Heqing Yu, Jianfei Wang, Fanfan Cao, Hanyu Xia, Rui School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing Jiangsu China

As an important task in emotion analysis, Multimodal Emotion-Cause Pair Extraction in conversations (MECPE) aims to extract all the emotion-cause utterance pairs from a conversation. However, there are two shortcomings in the MECPE task: 1) it ignores emotion utterances whose causes cannot be located in the conversation but require contextualized inference;2) it fails to locate the exact causes that occur in vision or audio modalities beyond text. To address these issues, in this paper, we introduce a new task named Multimodal Emotion-Cause Pair Generation in Conversations (MECPG), which aims to identify the emotion utterances with their emotion categories and generate their corresponding causes in a conversation. To tackle the MECPG task, we construct a dataset based on a benchmark corpus for MECPE. We further propose a generative framework named MONICA, which jointly performs emotion recognition and emotion cause generation with a sequence-to-sequence model. Experiments on our annotated dataset show the superiority of MONICA over several competitive systems. Our dataset and source codes will be publicly released. IEEE

关键词： Emotion Recognition

来源：评论

学校读者我要写书评

暂无评论

A Deepfake Detection Algorithm Based on Fourier Transform of Biological Signal

引用

computers, Materials & Continua 2024年第6期79卷 5295-5312页

作者： Yin Ni Wu Zeng Peng Xia Guang Stanley Yang Ruochen Tan School of Electrical and Electronic Engineering Wuhan Polytechnic UniversityWuhan430023China School of Mathematics and Computer Science Wuhan Polytechnic UniversityWuhan430048China Paul G.Allen School of Computer Science and Engineering University ofWashingtonSeattleWA98195USA School of Computer Science and Engineering University of CaliforniaSanDiegoCA92093USA

Deepfake-generated fake faces,commonly utilized in identity-related activities such as political propaganda,celebrity impersonations,evidence forgery,and familiar fraud,pose new societal *** current deepfake generators strive for high realism in visual effects,they do not replicate biometric signals indicative of cardiac *** this gap,many researchers have developed detection methods focusing on biometric *** methods utilize classification networks to analyze both temporal and spectral domain features of the remote photoplethysmography(rPPG)signal,resulting in high detection ***,in the spectral analysis,existing approaches often only consider the power spectral density and neglect the amplitude spectrum—both crucial for assessing cardiac *** introduce a novel method that extracts rPPG signals from multiple regions of interest through remote photoplethysmography and processes them using Fast Fourier Transform(FFT).The resultant time-frequency domain signal samples are organized into matrices to create Matrix Visualization Heatmaps(MVHM),which are then utilized to train an image classification ***,we explored various combinations of time-frequency domain representations of rPPG signals and the impact of attention *** experimental results show that our algorithm achieves a remarkable detection accuracy of 99.22%in identifying fake videos,significantly outperforming mainstream algorithms and demonstrating the effectiveness of Fourier Transform and attention mechanisms in detecting fake faces.

关键词： Deepfake detector remote photoplethysmography fast fourier transform spatial attention mechanism

来源：评论

学校读者我要写书评

暂无评论

Dynamic Hand Gesture-Based Person Identification Using Leap Motion and Machine Learning Approaches

引用

computers, Materials & Continua 2024年第4期79卷 1205-1222页

作者： Jungpil Shin Md.AlMehedi Hasan Md.Maniruzzaman Taiki Watanabe Issei Jozume School of Computer Science and Engineering The University of AizuAizuwakamatsuFukushima965-8580Japan Department of Computer Science&Engineering Rajshahi University of Engineering&TechnologyRajshahi6204Bangladesh

Person identification is one of the most vital tasks for network security. People are more concerned about theirsecurity due to traditional passwords becoming weaker or leaking in various attacks. In recent decades, fingerprintsand faces have been widely used for person identification, which has the risk of information leakage as a resultof reproducing fingers or faces by taking a snapshot. Recently, people have focused on creating an identifiablepattern, which will not be reproducible falsely by capturing psychological and behavioral information of a personusing vision and sensor-based techniques. In existing studies, most of the researchers used very complex patternsin this direction, which need special training and attention to remember the patterns and failed to capturethe psychological and behavioral information of a person properly. To overcome these problems, this researchdevised a novel dynamic hand gesture-based person identification system using a Leap Motion sensor. Thisstudy developed two hand gesture-based pattern datasets for performing the experiments, which contained morethan 500 samples, collected from 25 subjects. Various static and dynamic features were extracted from the handgeometry. Randomforest was used to measure feature importance using the Gini Index. Finally, the support vectormachinewas implemented for person identification and evaluate its performance using identification accuracy. Theexperimental results showed that the proposed system produced an identification accuracy of 99.8% for arbitraryhand gesture-based patterns and 99.6% for the same dynamic hand gesture-based patterns. This result indicatedthat the proposed system can be used for person identification in the field of security.

关键词： Person identification leap motion hand gesture random forest support vector machine

来源：评论

学校读者我要写书评

暂无评论

A Privacy-Preserving Data Aggregation Protocol for Internet of Vehicles with Federated Learning

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-11页

作者： Xu, Zisang Zhang, Ruirui Liang, Wei Li, Kuan-Ching Gu, Ke Li, Xiong Huang, Jialun Computer and Communication Engineer Institute Changsha University of Science and Technology Changsha China School of Computer Science and Engineering Hunan University of Science and Technology Xiangtan China Department of Computer Science and Information Engineering Providence University Taichung Taiwan Institute for Cyber Security School of Computer Science and Engineering University of Electronic Science and Technology of China Chengdu China

Federated learning (FL) is widely used in various fields because it can guarantee the privacy of the original data source. However, in data-sensitive fields such as Internet of Vehicles (IoV), insecure communication channels, semi-trusted RoadSide Unit (RSU), and collusion between vehicles and the RSU may lead to leakage of model parameters. Moreover, when aggregating data, since different vehicles usually have different computing resources, vehicles with relatively insufficient computing resources will affect the data aggregation efficiency. Therefore, in order to solve the privacy leakage problem and improve the data aggregation efficiency, this paper proposes a privacy-preserving data aggregation protocol for IoV with FL. Firstly, the protocol is designed based on methods such as shamir secret sharing scheme, pallier homomorphic encryption scheme and blinding factor protection, which can guarantee the privacy of model parameters. Secondly, the protocol improves the data aggregation efficiency by setting dynamic training time windows. Thirdly, the protocol reduces the frequent participations of Trusted Authority (TA) by optimizing the fault-tolerance mechanism. Finally, the security analysis proves that the proposed protocol is secure, and the performance analysis results also show that the proposed protocol has high computation and communication efficiency. IEEE

关键词： Vehicles

来源：评论

学校读者我要写书评

暂无评论

RoleNet: A multiple features fusion network for role classification in cantonese opera

引用

Multimedia Tools and Applications 2025年 1-14页

作者： Li, Yue Peng, Zhengwei Xu, Di Chen, Yuanguang Chen, Guoan School of Computer Science and Engineering South China University of Technology Guangdong Guangzhou510006 China School of Computer Science and Engineering Sun Yat-sen University Guangdong Guangzhou510006 China

Cantonese opera, a key facet of Chinese traditional opera, boasts profound cultural and artistic value and has been designated as intangible cultural heritage. The use of certain roles is a basic concept in Cantonese opera, where each role has a specific style of singing, movement, and costume that performers are trained to perform throughout their careers. Therefore, identifying the role category of characters in a play can provide theoretical and systematic foundations for further researches and artistic explorations. By dissecting musical traits and performance styles of each role, comprehensive studies on its regional and artistic nuances are enabled. To achieve role classification in Cantonese opera, we propose RoleNet, an integration network that consists of SincNets, transformers, and a feature fusion block. For a given musical fragment (e.g., an audio signal), SincNets extract 1D features at multiple scales and transformers extract features from time and frequency axes from the 2D Mel Spectrogram. Subsequently, the extracted features are concatenated by the fusioner using multiple features selection strategy to perform role classification tasks. The experimental results on a real-world dataset demonstrated the superior performance of RoleNet compared to single-objective methods. An overall classification accuracy of 98.67% was achieved. The codes are available at https://***/AaronPeng920/RoleNet. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2025.

关键词： Feature Selection

来源：评论

学校读者我要写书评

暂无评论

Towards kernelizing the classifier for hyperbolic data

引用

Frontiers of computer science 2024年第1期18卷 17-31页

作者： Meimei YANG Qiao LIU Xinkai SUN Na SHI Hui XUE School of Computer Science and Engineering Southeast UniversityNanjing 210096China MOE Key Laboratory of Computer Science and Information Integration(Southeast University) Nanjing 210096China

Data hierarchy,as a hidden property of data structure,exists in a wide range of machine learning applications.A common practice to classify such hierarchical data is first to encode the data in the Euclidean space,and then train a Euclidean ***,such a paradigm leads to a performance drop due to distortion of data embedding in the Euclidean *** relieve this issue,hyperbolic geometry is investigated as an alternative space to encode the hierarchical data for its higher ability to capture the hierarchical *** methods cannot explore the full potential of the hyperbolic geometry,in the sense that such methods define the hyperbolic operations in the tangent plane,causing the distortion of data *** this paper,we develop two novel kernel formulations in the hyperbolic space,with one being positive definite(PD)and another one being indefinite,to solve the classification tasks in hyperbolic *** PD one is defined via mapping the hyperbolic data to the Drury-Arveson(DA)space,which is a special reproducing kernel Hilbert space(RKHS).To further increase the discrimination of the classifier,an indefinite kernel is further defined in the Krein ***,we design a 2-layer nested indefinite kernel which first maps hyperbolic data into the DA spaces,followed by a mapping from the DA spaces to the Krein *** experiments on real-world datasets demonstrate the superiority ofthe proposed kernels.

关键词： data hierarchy hyperbolic cgeometry drury-arveson space krein space

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：