Cervical cell segmentation is a significant task in medical image analysis and can be used for screening various cervical diseases. In recent years, substantial progress has been made in cervical cell segmentation tec...
详细信息
Existing lip synchronization(lip-sync)methods generate accurately synchronized mouths and faces in a generated ***,they still confront the problem of artifacts in regions of non-interest(RONI),e.g.,background and othe...
详细信息
Existing lip synchronization(lip-sync)methods generate accurately synchronized mouths and faces in a generated ***,they still confront the problem of artifacts in regions of non-interest(RONI),e.g.,background and other parts of a face,which decreases the overall visual *** solve these problems,we innovatively introduce diverse image inpainting to lip-sync *** propose Modulated Inpainting Lip-sync GAN(MILG),an audio-constraint inpainting network to predict synchronous *** utilizes prior knowledge of RONI and audio sequences to predict lip shape instead of image generation,which can keep the RONI ***,we integrate modulated spatially probabilistic diversity normalization(MSPD Norm)in our inpainting network,which helps the network generate fine-grained diverse mouth movements guided by the continuous audio ***,to lower the training overhead,we modify the contrastive loss in lipsync to support small-batch-size and few-sample *** experiments demonstrate that our approach outperforms the existing state-of-the-art of image quality and authenticity while keeping lip-sync.
The rapid development of 5G/6G and AI enables an environment of Internet of Everything(IoE)which can support millions of connected mobile devices and applications to operate smoothly at high speed and low ***,these ma...
详细信息
The rapid development of 5G/6G and AI enables an environment of Internet of Everything(IoE)which can support millions of connected mobile devices and applications to operate smoothly at high speed and low ***,these massive devices will lead to explosive traffic growth,which in turn cause great burden for the data transmission and content *** challenge can be eased by sinking some critical content from cloud to *** this case,how to determine the critical content,where to sink and how to access the content correctly and efficiently become new *** work focuses on establishing a highly efficient content delivery framework in the IoE *** particular,the IoE environment is re-constructed as an end-edge-cloud collaborative system,in which the concept of digital twin is applied to promote the *** on the digital asset obtained by digital twin from end users,a content popularity prediction scheme is firstly proposed to decide the critical content by using the Temporal Pattern Attention(TPA)enabled Long Short-Term Memory(LSTM)***,the prediction results are input for the proposed caching scheme to decide where to sink the critical content by using the Reinforce Learning(RL)***,a collaborative routing scheme is proposed to determine the way to access the content with the objective of minimizing *** experimental results indicate that the proposed schemes outperform the state-of-the-art benchmarks in terms of the caching hit rate,the average throughput,the successful content delivery rate and the average routing overhead.
Drug-target interactions(DTIs) prediction plays an important role in the process of drug *** computational methods treat it as a binary prediction problem, determining whether there are connections between drugs and t...
详细信息
Drug-target interactions(DTIs) prediction plays an important role in the process of drug *** computational methods treat it as a binary prediction problem, determining whether there are connections between drugs and targets while ignoring relational types information. Considering the positive or negative effects of DTIs will facilitate the study on comprehensive mechanisms of multiple drugs on a common target, in this work, we model DTIs on signed heterogeneous networks, through categorizing interaction patterns of DTIs and additionally extracting interactions within drug pairs and target protein pairs. We propose signed heterogeneous graph neural networks(SHGNNs), further put forward an end-to-end framework for signed DTIs prediction, called SHGNN-DTI,which not only adapts to signed bipartite networks, but also could naturally incorporate auxiliary information from drug-drug interactions(DDIs) and protein-protein interactions(PPIs). For the framework, we solve the message passing and aggregation problem on signed DTI networks, and consider different training modes on the whole networks consisting of DTIs, DDIs and PPIs. Experiments are conducted on two datasets extracted from Drug Bank and related databases, under different settings of initial inputs, embedding dimensions and training modes. The prediction results show excellent performance in terms of metric indicators, and the feasibility is further verified by the case study with two drugs on breast cancer.
Bat Algorithm (BA) is a nature-inspired metaheuristic search algorithm designed to efficiently explore complex problem spaces and find near-optimal solutions. The algorithm is inspired by the echolocation behavior of ...
详细信息
Long-term urban traffic flow prediction is an important task in the field of intelligent transportation,as it can help optimize traffic management and improve travel *** improve prediction accuracy,a crucial issue is ...
详细信息
Long-term urban traffic flow prediction is an important task in the field of intelligent transportation,as it can help optimize traffic management and improve travel *** improve prediction accuracy,a crucial issue is how to model spatiotemporal dependency in urban traffic *** recent years,many studies have adopted spatiotemporal neural networks to extract key information from traffic ***,most models ignore the semantic spatial similarity between long-distance areas when mining spatial *** also ignore the impact of predicted time steps on the next unpredicted time step for making long-term ***,these models lack a comprehensive data embedding process to represent complex spatiotemporal *** paper proposes a multi-scale persistent spatiotemporal transformer(MSPSTT)model to perform accurate long-term traffic flow prediction in *** adopts an encoder-decoder structure and incorporates temporal,periodic,and spatial features to fully embed urban traffic data to address these *** model consists of a spatiotemporal encoder and a spatiotemporal decoder,which rely on temporal,geospatial,and semantic space multi-head attention modules to dynamically extract temporal,geospatial,and semantic *** spatiotemporal decoder combines the context information provided by the encoder,integrates the predicted time step information,and is iteratively updated to learn the correlation between different time steps in the broader time range to improve the model’s accuracy for long-term *** on four public transportation datasets demonstrate that MSPSTT outperforms the existing models by up to 9.5%on three common metrics.
State-of-the-art recommender systems are increasingly focused on optimizing implementation efficiency, such as enabling on-device recommendations under memory constraints. Current methods commonly use lightweight embe...
详细信息
State-of-the-art recommender systems are increasingly focused on optimizing implementation efficiency, such as enabling on-device recommendations under memory constraints. Current methods commonly use lightweight embeddings for users and items or employ compact embeddings to enhance reusability and reduce memory usage. However, these approaches consider only the coarse-grained aspects of embeddings, overlooking subtle semantic nuances. This limitation results in an adversarial degradation of meta-embedding performance, impeding the system's ability to capture intricate relationships between users and items, leading to suboptimal recommendations. To address this, we propose a novel approach to efficiently learn meta-embeddings with varying grained and apply fine-grained meta-embeddings to strengthen the representation of their coarse-grained counterparts. Specifically, we introduce a recommender system based on a graph neural network, where each user and item is represented as a node. These nodes are directly connected to coarse-grained virtual nodes and indirectly linked to fine-grained virtual nodes, facilitating learning of multi-grained semantics. Fine-grained semantics are captured through sparse meta-embeddings, which dynamically balance embedding uniqueness and memory constraints. To ensure their sparseness, we rely on initialization methods such as sparse principal component analysis combined with a soft thresholding activation function. Moreover, we propose a weight-bridging update strategy that aligns coarse-grained meta-embedding with several fine-grained meta-embeddings based on the underlying semantic properties of users and items. Comprehensive experiments demonstrate that our method outperforms existing baselines. The code of our proposal is available at https://***/htyjers/C2F-MetaEmbed.
Chinese Named Entity Recognition (NER) for Electronic Medical Records (EMR) is a fundamental task in building a digital hospital and is widely considered to be a sequence annotation problem in the Natural Language Pro...
详细信息
The course Introduction to computer Networks (ICN) has become one of the most vital courses in computerscience and softwareengineering degrees and clearly is an imperative course for a degree in computer networking....
详细信息
Traditional autonomous navigation methods for mobile robots mainly rely on geometric feature-based LiDAR scan-matching algorithms, but in complex environments, this method is often affected due to the presence of movi...
详细信息
暂无评论