Precise polyp segmentation is vital for the early diagnosis and prevention of colorectal cancer(CRC)in clinical ***,due to scale variation and blurry polyp boundaries,it is still a challenging task to achieve satisfac...
详细信息
Precise polyp segmentation is vital for the early diagnosis and prevention of colorectal cancer(CRC)in clinical ***,due to scale variation and blurry polyp boundaries,it is still a challenging task to achieve satisfactory segmentation performance with different scales and *** this study,we present a novel edge-aware feature aggregation network(EFA-Net)for polyp segmentation,which can fully make use of cross-level and multi-scale features to enhance the performance of polyp ***,we first present an edge-aware guidance module(EGM)to combine the low-level features with the high-level features to learn an edge-enhanced feature,which is incorporated into each decoder unit using a layer-by-layer ***,a scale-aware convolution module(SCM)is proposed to learn scale-aware features by using dilated convolutions with different ratios,in order to effectively deal with scale ***,a cross-level fusion module(CFM)is proposed to effectively integrate the cross-level features,which can exploit the local and global contextual ***,the outputs of CFMs are adaptively weighted by using the learned edge-aware feature,which are then used to produce multiple side-out segmentation *** results on five widely adopted colonoscopy datasets show that our EFA-Net outperforms state-of-the-art polyp segmentation methods in terms of generalization and *** implementation code and segmentation maps will be publicly at https://***/taozh2017/EFANet.
Recommender systems are effective in mitigating information overload, yet the centralized storage of user data raises significant privacy concerns. Cross-user federated recommendation(CUFR) provides a promising distri...
详细信息
Recommender systems are effective in mitigating information overload, yet the centralized storage of user data raises significant privacy concerns. Cross-user federated recommendation(CUFR) provides a promising distributed paradigm to address these concerns by enabling privacy-preserving recommendations directly on user devices. In this survey, we review and categorize current progress in CUFR, focusing on four key aspects: privacy, security, accuracy, and efficiency. Firstly,we conduct an in-depth privacy analysis, discuss various cases of privacy leakage, and then review recent methods for privacy protection. Secondly, we analyze security concerns and review recent methods for untargeted and targeted *** untargeted attack methods, we categorize them into data poisoning attack methods and parameter poisoning attack methods. For targeted attack methods, we categorize them into user-based methods and item-based methods. Thirdly,we provide an overview of the federated variants of some representative methods, and then review the recent methods for improving accuracy from two categories: data heterogeneity and high-order information. Fourthly, we review recent methods for improving training efficiency from two categories: client sampling and model compression. Finally, we conclude this survey and explore some potential future research topics in CUFR.
Estimating hand pose is a challenge that has significantly benefited from using deep learning-based algorithms. This study area holds critical significance across various computervision and robotics domains, includin...
详细信息
Several genetic disorders and other metabolic abnormalities work together to generate the lethal disease known as cancer. Today’s most contributing factors to mortality and disability in patients are lung and colon c...
详细信息
In smart driving for rail transit, a reliable obstacle detection system is an important guarantee for the safety of trains. Therein, the detection of the rail area directly affects the accuracy of the system to identi...
详细信息
In smart driving for rail transit, a reliable obstacle detection system is an important guarantee for the safety of trains. Therein, the detection of the rail area directly affects the accuracy of the system to identify dangerous targets. Both the rail line and the lane are presented as thin line shapes in the image, but the rail scene is more complex, and the color of the rail line is more difficult to distinguish from the background. By comparison, there are already many deep learning-based lane detection algorithms, but there is a lack of public datasets and targeted deep learning detection algorithms for rail line detection. To address this, this paper constructs a rail image dataset RailwayLine and labels the rail line for the training and testing of models. This dataset contains rich rail images including single-rail, multi-rail, straight rail, curved rail, crossing rails, occlusion, blur, and different lighting conditions. To address the problem of the lack of deep learning-based rail line detection algorithms, we improve the CLRNet algorithm which has an excellent performance in lane detection, and propose the CLRNet-R algorithm for rail line detection. To address the problem of the rail line being thin and occupying fewer pixels in the image, making it difficult to distinguish from complex backgrounds, we introduce an attention mechanism to enhance global feature extraction ability and add a semantic segmentation head to enhance the features of the rail region by the binary probability of rail lines. To address the poor curve recognition performance and unsmooth output lines in the original CLRNet algorithm, we improve the weight allocation for line intersection-over-union calculation in the original framework and propose two loss functions based on local slopes to optimize the model’s local sampling point training constraints, improving the model’s fitting performance on curved rails and obtaining smooth and stable rail line detection results. Through expe
Exploration strategy design is a challenging problem in reinforcement learning(RL),especially when the environment contains a large state space or sparse *** exploration,the agent tries to discover unexplored(novel)ar...
详细信息
Exploration strategy design is a challenging problem in reinforcement learning(RL),especially when the environment contains a large state space or sparse *** exploration,the agent tries to discover unexplored(novel)areas or high reward(quality)*** existing methods perform exploration by only utilizing the novelty of *** novelty and quality in the neighboring area of the current state have not been well utilized to simultaneously guide the agent’s *** address this problem,this paper proposes a novel RL framework,called clustered reinforcement learning(CRL),for efficient exploration in *** adopts clustering to divide the collected states into several clusters,based on which a bonus reward reflecting both novelty and quality in the neighboring area(cluster)of the current state is given to the *** leverages these bonus rewards to guide the agent to perform efficient ***,CRL can be combined with existing exploration strategies to improve their performance,as the bonus rewards employed by these existing exploration strategies solely capture the novelty of *** on four continuous control tasks and six hard-exploration Atari-2600 games show that our method can outperform other state-of-the-art methods to achieve the best performance.
The cross-view matching of local image features is a fundamental task in visual localization and 3D *** study proposes FilterGNN,a transformer-based graph neural network(GNN),aiming to improve the matching efficiency ...
详细信息
The cross-view matching of local image features is a fundamental task in visual localization and 3D *** study proposes FilterGNN,a transformer-based graph neural network(GNN),aiming to improve the matching efficiency and accuracy of visual *** on high matching sparseness and coarse-to-fine covisible area detection,FilterGNN utilizes cascaded optimal graph-matching filter modules to dynamically reject outlier ***,we successfully adapted linear attention in FilterGNN with post-instance normalization support,which significantly reduces the complexity of complete graph learning from O(N2)to O(N).Experiments show that FilterGNN requires only 6%of the time cost and 33.3%of the memory cost compared with SuperGlue under a large-scale input size and achieves a competitive performance in various tasks,such as pose estimation,visual localization,and sparse 3D reconstruction.
We present the design and implementation of a novel low-cost smart buoy IoUT device for anchor monitoring of recreational vessels to detect anchor drag. All current solutions solve this problem by monitoring the posit...
详细信息
Wheat is the most widely grown crop in the world,and its yield is closely related to global food *** number of ears is important for wheat breeding and yield ***,automated wheat ear counting techniques are essential f...
详细信息
Wheat is the most widely grown crop in the world,and its yield is closely related to global food *** number of ears is important for wheat breeding and yield ***,automated wheat ear counting techniques are essential for breeding high-yield varieties and increasing grain ***,all existing methods require position-level annotation for training,implying that a large amount of labor is required for annotation,limiting the application and development of deep learning technology in the agricultural *** address this problem,we propose a count-supervised multiscale perceptive wheat counting network(CSNet,count-supervised network),which aims to achieve accurate counting of wheat ears using quantity *** particular,in the absence of location information,CSNet adopts MLP-Mixer to construct a multiscale perception module with a global receptive field that implements the learning of small target attention maps between wheat ear *** conduct comparative experiments on a publicly available global wheat head detection dataset,showing that the proposed count-supervised strategy outperforms existing position-supervised methods in terms of mean absolute error(MAE)and root mean square error(RMSE).This superior performance indicates that the proposed approach has a positive impact on improving ear counts and reducing labeling costs,demonstrating its great potential for agricultural counting *** code is available at .
A dynamic video summarization system detects key parts of the input video to generate its compact representation. The summaries can be used for efficient management of video data. This paper proposes an approach, Vide...
详细信息
暂无评论