The rapid evolution of artificial intelligence(AI)technologies has significantly propelled the advancement of the Internet of Vehicles(IoV).With AI support,represented by machine learning technology,vehicles gain the ...
详细信息
The rapid evolution of artificial intelligence(AI)technologies has significantly propelled the advancement of the Internet of Vehicles(IoV).With AI support,represented by machine learning technology,vehicles gain the capability to make intelligent *** a distributed learning paradigm,federated learning(FL)has emerged as a preferred solution in *** to traditional centralized machine learning,FL reduces communication overhead and improves privacy *** these benefits,FL still faces some security and privacy concerns,such as poisoning attacks and inference attacks,prompting exploration into blockchain integration to enhance its security *** paper introduces a novel blockchain-enabled federated learning(BCFL)scheme with differential privacy(DP)tailored for *** order to meet the performance demanding IoV environment,the proposed methodology integrates a consortium blockchain with Practical Byzantine Fault Tolerance(PBFT)consensus,which offers superior efficiency over the conventional public *** addition,the proposed approach utilizes the Differentially Private Stochastic Gradient Descent(DP-SGD)algorithm in the local training process of FL for enhanced privacy *** results indicate that the integration of blockchain elevates the security level of FL in that the proposed approach effectively safeguards FL against poisoning *** the other hand,the additional overhead associated with blockchain integration is also limited to a moderate level to meet the efficiency criteria of ***,by incorporating DP,the proposed approach is shown to have the(ε-δ)privacy guarantee while maintaining an acceptable level of model *** enhancement effectively mitigates the threat of inference attacks on private information.
The development of the industrial Internet of Things and smart grid networks has emphasized the importance of secure smart grid communication for the future of electric power transmission. However, the current deploym...
详细信息
With the increasing scale of industrial equipments, delay and energy consumption have emerged as critical concerns within the Industrial Internet of Things (Industrial IoT). Mobile edge computing (MEC) offloads tasks ...
详细信息
Topology is usually perceived intrinsically immutable for a given *** argue that optical topologies do not immediately enjoy such ***'optical skyrmions'as an example,we show that they will exhibit varying text...
详细信息
Topology is usually perceived intrinsically immutable for a given *** argue that optical topologies do not immediately enjoy such ***'optical skyrmions'as an example,we show that they will exhibit varying textures and topological invariants(skyrmion numbers),depending on how to construct the skyrmion vector when projecting from real to parameter *** demonstrate the fragility of optical skyrmions under a ubiquitous scenario-simple reflection off an optical *** topology is not without benefit,but it must not be assumed.
Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this ***,as the performance of crack detect...
详细信息
Automatic crack detection of cement pavement chiefly benefits from the rapid development of deep learning,with convolutional neural networks(CNN)playing an important role in this ***,as the performance of crack detection in cement pavement improves,the depth and width of the network structure are significantly increased,which necessitates more computing power and storage *** limitation hampers the practical implementation of crack detection models on various platforms,particularly portable devices like small mobile *** solve these problems,we propose a dual-encoder-based network architecture that focuses on extracting more comprehensive fracture feature information and combines cross-fusion modules and coordinated attention mechanisms formore efficient feature ***,we use small channel convolution to construct shallow feature extractionmodule(SFEM)to extract low-level feature information of cracks in cement pavement images,in order to obtainmore information about cracks in the shallowfeatures of *** addition,we construct large kernel atrous convolution(LKAC)to enhance crack information,which incorporates coordination attention mechanism for non-crack information filtering,and large kernel atrous convolution with different cores,using different receptive fields to extract more detailed edge and context ***,the three-stage feature map outputs from the shallow feature extraction module is cross-fused with the two-stage feature map outputs from the large kernel atrous convolution module,and the shallow feature and detailed edge feature are fully fused to obtain the final crack prediction *** evaluate our method on three public crack datasets:DeepCrack,CFD,and *** results on theDeepCrack dataset demonstrate the effectiveness of our proposed method compared to state-of-the-art crack detection methods,which achieves Precision(P)87.2%,Recall(R)87.7%,and F-score(F1)87.4%.Thanks to our lightweight cr
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical *** study prop...
详细信息
Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical *** study proposes a novel end-to-end disparity estimation model to address these *** approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting *** study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and *** model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video *** results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing ***,the model exhibited faster convergence during training,contributing to overall performance *** study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.
Considering the problems of the limited energy in wireless multi-media sensor networks (WMSNs) and the focused regions discontinuity of the fused image obtained using traditional multi-scale analysis tools (MST)-based...
详细信息
Unified information extraction (UIE) aims to extract diverse structured information from unstructured text. While large language models (LLMs) have shown promise for UIE, they require significant computational resourc...
详细信息
In vehicular edge computing, both edge and cloud can provide computing services (i.e., tasks). The edge can reduce vehicular task delay by processing data nearby, but is resource-constrained and cannot handle too many...
详细信息
Human-centric Emotional Video Captioning (H-EVC) aims to generate fine-grained, emotion-related sentences for human-based videos, enhancing the understanding of human emotions and facilitating human-computer emotional...
详细信息
Human-centric Emotional Video Captioning (H-EVC) aims to generate fine-grained, emotion-related sentences for human-based videos, enhancing the understanding of human emotions and facilitating human-computer emotional interaction. However, existing video captioning methods primarily focus on overall event content, often overlooking sufficient subtle emotional clues and interactions in videos. As a result, the generated captions frequently lack emotional information. To address this, we propose a novel Emotion-oriented Cross-modal Prompting and Alignment (ECPA) approach for large foundation models to enhance H-EVC accuracy by effectively modeling fine-grained visual-textual emotion clues and interactions. Using large foundation models, our ECPA introduces two learnable prompting strategies: visual emotion prompting (VEP) and textual emotion prompting (TEP), as well as an emotion-oriented cross-modal alignment (ECA) module. In VEP, we develop two-level learnable visual prompts, i.e., emotion recognition (ER)-level and action unit (AU)-level prompting, to assist pre-trained vision-language foundation models to attend to both coarse and fine emotion-related visual information in videos. In TEP, we correspondingly devise two-level learnable textual prompts, i.e., sentence-level emotional tokens, and word-level masked tokens, for obtaining both whole and local textual prompt representations related to emotions. To further facilitate the interaction and alignment of visual-textual emotion prompt representations, our ECA introduces another two levels of emotion-oriented prompt alignment learning mechanisms: the ER-sentence level and the AU-word level alignment losses. Both enhance the model's ability to capture and integrate both global and local cross-modal emotion semantics, thereby enabling the generation of fine-grained emotional linguistic descriptions in video captioning. Extensive experiments not only demonstrate that our ECPA outperforms existing state-of-the-art ap
暂无评论