In this paper, we address the trajectory planning problem in uncertain nonconvex static and dynamic environments that contain obstacles with probabilistic location, size, and geometry. To address this problem, we prov...
详细信息
The widespread availability of digital multimedia data has led to a new challenge in digital *** source camera identification algorithms usually rely on various traces in the capturing ***,these traces have become inc...
详细信息
The widespread availability of digital multimedia data has led to a new challenge in digital *** source camera identification algorithms usually rely on various traces in the capturing ***,these traces have become increasingly difficult to extract due to wide availability of various image processing *** Neural Networks(CNN)-based algorithms have demonstrated good discriminative capabilities for different brands and even different models of camera ***,their performances is not ideal in case of distinguishing between individual devices of the same model,because cameras of the same model typically use the same optical lens,image sensor,and image processing algorithms,that result in minimal overall *** this paper,we propose a camera forensics algorithm based on multi-scale feature fusion to address these *** proposed algorithm extracts different local features from feature maps of different scales and then fuses them to obtain a comprehensive feature *** representation is then fed into a subsequent camera fingerprint classification *** upon the Swin-T network,we utilize Transformer Blocks and Graph Convolutional Network(GCN)modules to fuse multi-scale features from different stages of the backbone ***,we conduct experiments on established datasets to demonstrate the feasibility and effectiveness of the proposed approach.
Flexible polymer-based foam sensors have significant potential for application in wearable electronics and motion monitoring. However, these prospects are hindered by the complex and unenvironmentally friendly manufac...
Flexible polymer-based foam sensors have significant potential for application in wearable electronics and motion monitoring. However, these prospects are hindered by the complex and unenvironmentally friendly manufacturing processes. In this study, we employed melt blending and supercritical carbon dioxide foaming to fabricate an ethylene-vinyl acetate copolymer(EVA)/low-density polyethylene(LDPE)/carbon nanotube(CNT) piezoresistive foam sensor. The cross-linking agent bis(tert-butyldioxyisopropyl) benzene and the conductive filler CNT were incorporated into the EVA/LDPE composite, successfully achieving a chemically cross-linked and physically entangled composite structure that significantly enhanced the storage modulus and complex viscosity. Additionally, the compressive strength of EVA/LDPE/CNT foam with 10 parts per hundred rubber(phr) CNT reached 1.37 MPa at 50% compression, marking a 340% increase compared to the 0.31 MPa of the CNT-free sample. Furthermore, the EVA/LDPE/CNT composite foams, which incorporated 10 phr CNT, were prepared under specific foaming conditions, resulting in an ultra-low density of 0.11 g/cm3and a higher sensitivity, with a gauge factor of –2.3. The piezoresistive foam sensors developed in this work could accurately detect human motion, thereby expanding their applications in the field of piezoresistive foam sensors and providing an effective strategy for the advancement of high-performance piezoresistive foam sensors.
Recently, pre-trained language models (PLMs) have been significantly improved for downstream tasks by infusing knowledge. In the field of medical research, with the continuous updating and increasing of data, PLM ofte...
详细信息
Communication strongly influences attitudes on climate change. Within sponsored communication, high spend and high reach advertising dominates. In the advertising ecosystem we can distinguish actors with adversarial s...
详细信息
The exceptionally high bandwidth requirements associated with the delivery of live 360° video content pose significant challenges in the current network context. An avenue for addressing this bandwidth challenge ...
详细信息
The exceptionally high bandwidth requirements associated with the delivery of live 360° video content pose significant challenges in the current network context. An avenue for addressing this bandwidth challenge is to use the limited network resources for sending the user's Field-of-View (FoV) tiles at a high resolution, instead of transmitting all frame components at high quality. However, precisely forecasting the FoV for 360° live video content distribution remains a complex endeavor due to the lack of pre-knowledge on user viewing behaviors. In this paper, we present GL360, a novel 360° transmission framework, which employs Graph Representation Learning for FoV prediction. First, we analyze the interaction between users and tiles in panoramic videos utilizing a dynamic heterogeneous Relational Graph Convolutional Network (RGCN), which facilitates efficient user and tile embedding representation learning. Secondly, we propose an online dynamic heterogeneous graph learning (DHGL)-based algorithm to dynamically capture the time-varying features of the user's viewing behaviors with limited prior knowledge. Further, we design a FoV-aware content delivery algorithm that allows the edge servers to determine the video tiles' resolution for each accessed user. Experimental results based on real traces demonstrate how our solution outperforms four other solutions in terms of FoV prediction and network performance IEEE
Retinal images play an essential role in the early diagnosis of ophthalmic *** segmentation of retinal vessels in color fundus images is challenging due to the morphological differences between the retinal vessels and...
详细信息
Retinal images play an essential role in the early diagnosis of ophthalmic *** segmentation of retinal vessels in color fundus images is challenging due to the morphological differences between the retinal vessels and the low-contrast *** the same time,automated models struggle to capture representative and discriminative retinal vascular *** fully utilize the structural information of the retinal blood vessels,we propose a novel deep learning network called Pre-Activated Convolution Residual and Triple Attention Mechanism Network(PCRTAM-Net).PCRTAM-Net uses the pre-activated dropout convolution residual method to improve the feature learning ability of the *** addition,the residual atrous convolution spatial pyramid is integrated into both ends of the network encoder to extract multiscale information and improve blood vessel information flow.A triple attention mechanism is proposed to extract the structural information between vessel contexts and to learn long-range feature *** evaluate the proposed PCRTAM-Net on four publicly available datasets,DRIVE,CHASE_DB1,STARE,and *** model achieves state-of-the-art performance of 97.10%,97.70%,97.68%,and 97.14%for ACC and 83.05%,82.26%,84.64%,and 81.16%for F1,respectively.
Financial transaction systems have become the critical backbone of modern society, and the sharp increase in fraudulent transactions has become an unavoidable significant topic. Their presence poses a severe threat to...
详细信息
We apply implicit neural representations—which naturally capture spectral regularity—to reconstruct color Fourier ptychographic microscopy images from spectrally-sparse measurements. We conduct experiments on real-w...
This paper studies the challenging task of makeup transfer, which aims to apply diverse makeup styles precisely and naturally to a given facial image. Due to the absence of paired data, current methods typically synth...
ISBN:
(纸本)9798331314385
This paper studies the challenging task of makeup transfer, which aims to apply diverse makeup styles precisely and naturally to a given facial image. Due to the absence of paired data, current methods typically synthesize sub-optimal pseudo ground truths to guide the model training, resulting in low makeup fidelity. Additionally, different makeup styles generally have varying effects on the person face, but existing methods struggle to deal with this diversity. To address these issues, we propose a novel Self-supervised Hierarchical Makeup Transfer (SHMT) method via latent diffusion models. Following a "decoupling-and-reconstruction" paradigm, SHMT works in a self-supervised manner, freeing itself from the misguidance of imprecise pseudo-paired data. Furthermore, to accommodate a variety of makeup styles, hierarchical texture details are decomposed via a Laplacian pyramid and selectively introduced to the content representation. Finally, we design a novel Iterative Dual Alignment (IDA) module that dynamically adjusts the injection condition of the diffusion model, allowing the alignment errors caused by the domain gap between content and makeup representations to be corrected. Extensive quantitative and qualitative analyses demonstrate the effectiveness of our method. Our code is available at https://***/Snowfallingplum/SHMT.
暂无评论