camouflaged object detection is a hard assignment due to their textures are similar to the background. The main intention of this paper is probe into a problem about the camouflaged object detection, that is, detectin...
详细信息
ISBN:
(纸本)9781728165905
camouflaged object detection is a hard assignment due to their textures are similar to the background. The main intention of this paper is probe into a problem about the camouflaged object detection, that is, detecting its camouflagedobject for a given image. This problem has not been well studied in spite of a large area of potential applications such as camouflage military targets detection and wildlife protection. To address this problem, a camouflage objectdetection method based on deep learning is proposed. The suggested method can detect camouflagedobject which can extract deep features automatically. It can also provide detection probability which reflect camouflage efficiency. Experimental results show that the deep learning measure can effectively detect different scene, representing the camouflage level of low, medium and high respectively.
camouflagedobject segmentation (COS) is a recently emerging task due to its broad application prospect. The coloration and texture similarities between the objects and their surroundings makes it a challenging task. ...
详细信息
camouflagedobject segmentation (COS) is a recently emerging task due to its broad application prospect. The coloration and texture similarities between the objects and their surroundings makes it a challenging task. Motivated by this, we propose a consistency-oriented network (CoNet) to address these challenges by looking into the visual consistencies between object and background. Specifically, we design a primary detection module (PDM) to firstly locate the object by fusing the backbone features. A filter is introduced to better focus on the object's foreground feature based on its primary location. To obtain the visual consistency between the object and background, the foreground feature is then fed into the consistency evaluation module (CEM) to interact with the global feature. Both features are simultaneously processed by a shared discriminator and then fused together to attain the consistency attention map. The final feature refinement is conducted in the detail refinement module (DRM) by merging the consistency attention map with the global features via hierarchical feature fusion. Extensive experiments on benchmark COS datasets show that the proposed CoNet outperforms the state-of-the-art (SOTA) models in most cases. Ablation experiments verify the effectiveness of different backbones, designed modules and upsampling methods. Furthermore, extra studies on the labelling techniques and interdisciplinary applications demonstrate the great potential of the proposed CoNet.
camouflaged object detection (COD) aims to detect objects that 'blend in' with their surroundings and the lack of a clear boundary between the target object and the background in COD tasks makes accurate detec...
详细信息
camouflaged object detection (COD) aims to detect objects that 'blend in' with their surroundings and the lack of a clear boundary between the target object and the background in COD tasks makes accurate detection of targets difficult. Although many innovative algorithms and methods have been developed to improve the results of camouflaged object detection, the problem of poor detection accuracy in complex scenes still exists. To improve the accuracy of camouflage target segmentation, a camouflaged object detection algorithm using contextual feature enhancement and an attention mechanism called amplify and predict network (APNet) is proposed. In this paper, context feature enhancement module (CFEM) and reverse attention prediction module (RAPM) are *** can accept multi-level features extracted from the backbone network, and convey the features with enhancement processing to achieve the fusion of multi-level *** focuses on the edge feature information through the reverse attention mechanism to mine deeper camouflaged target information to achieve and further refine the predicted results. The proposed algorithm achieves weighted F-measure and mean absolute error (MAE) of 0.708 and 0.033 on the COD10K dataset, respectively, and the experimental results on other publicly available datasets are also significantly better than the other 14 state-of-the-art models, and achieves the optimal performance on the four objective evaluation metrics, and the proposed algorithm obtains sharper edge details on COD tasks and improves the prediction performance.
In order to obtain the higher efficiency and the more accuracy in camouflaged object detection (COD), a lightweight COD network based on edge detection and coordinate attention assistance (EDCAANet) is presented in th...
详细信息
In order to obtain the higher efficiency and the more accuracy in camouflaged object detection (COD), a lightweight COD network based on edge detection and coordinate attention assistance (EDCAANet) is presented in this paper. Firstly, an Integrated Edge and Global Context Information Module (IEGC) is proposed, which uses edge detection as an auxiliary means to collaborate with the atrous spatial convolution pooling pyramid (ASPP) for obtaining global context information to achieve the preliminary positioning of the camouflagedobject. Then, the Receptive Field Module based on Coordinate Attention (RFMC) is put forward, in which the Coordinate Attention (CA) mechanism is employed as another aid means to expand receptive ffeld features and then achieve global comprehensive of the image. In the final stage of feature fusion, the proposed lightweight Adjacent and Global Context Focusing module (AGCF) is employed to aggregate the multi-scale semantic features output by RFMC at adjacent levels and the global context features output by IEGC. These aggregated features are mainly refined by the proposed Multi Scale Convolutional Aggregation (MSDA) blocks in the module, allowing features to interact and combine at various scales to ultimately produce prediction results. The experiments include performance comparison experiment, testing in complex background, generalization experiment, as well as ablation experiment and complexity analysis. Four public datasets are adopted for experiments, four recognized COD metrics are employed for performance evaluation, 3 backbone networks and 18 methods are used for comparison. The experimental results show that the proposed method can obtain both the more excellent detection performance and the higher efficiency.
Depth information provides valuable insights into the 3D structure, especially the outline of objects, which can be utilized to enhance semantic segmentation. However, a naive fusion of depth information can disrupt f...
详细信息
Depth information provides valuable insights into the 3D structure, especially the outline of objects, which can be utilized to enhance semantic segmentation. However, a naive fusion of depth information can disrupt features and compromise accuracy due to the gap between depth and RGB modalities. In this work, we introduce a depth-guided texture diffusion approach that effectively tackles the outlined challenge. Our method extracts low-level features from edges and textures to create a texture image. This image is then selectively diffused across the depth map, enhancing structural information vital for precisely extracting object outlines. By integrating this enhanced depth map with the original RGB image into a joint feature embedding, our method effectively bridges the disparity between depth and RGB modalities, enabling more accurate semantic segmentation. We conduct comprehensive experiments on diverse, widely-used datasets covering various semantic segmentation tasks, including camouflaged object detection (COD), Salient objectdetection (SOD), and indoor semantic segmentation. With source-free estimated depth or depth captured by depth cameras, our method consistently outperforms existing baselines and achieves new state-of-the-art results, demonstrating the effectiveness of our depth-guided texture diffusion for image semantic segmentation. The source code and datasets are publicly available at https://***/Wistzz/***.
Due to the high similarity between hidden objects and the surrounding background, camouflaged object detection (COD) remains a challenge. While many recently proposed methods have shown remarkable performance, most of...
详细信息
Due to the high similarity between hidden objects and the surrounding background, camouflaged object detection (COD) remains a challenge. While many recently proposed methods have shown remarkable performance, most of them begin object perception by indiscriminately considering every pixel of the image. However, these early-stage region-insensitive perception methods still struggle to resist background interference, potentially missing subtle pixel changes by not prioritizing potential camouflaged areas initially. Fortunately, we reveal that the availability of an accurate mutation map can significantly enhance camouflaged discrimination ability. To this end, we propose MRNet (Mutation Region Network). MRNet initially generates a mutation map that identifies potential mutation regions exhibiting subtle pixel changes. The generation method involves amplifying and differing pixel changes based on the position and corresponding values of pixels. Subsequently, the selective expansion search operation utilizes the mutation map to extract the mapped graph, effectively reducing interference from background pixels that are distant from the mutation regions. Finally, decoding the mapped graph generates precise masks. Furthermore, we have created the largest test dataset with known categories to advance community research. Extensive experiments conducted on three widely used datasets and our proposed dataset show that MRNet surpasses other methods with superior performance.
camouflaged object detection (COD) aims to find objects hidden in the surrounding environment. camouflagedobjects are usually fused with the background and have high intrinsic similarity, and the sizes and appearance...
详细信息
camouflaged object detection (COD) aims to find objects hidden in the surrounding environment. camouflagedobjects are usually fused with the background and have high intrinsic similarity, and the sizes and appearances of camouflagedobjects vary, causing most previous single-stage methods to fail. To this end, we propose a multivariate feature interaction network (MFINet) based on a multistage design. Specifically, the MFINet consists of three stages: a feature exploration stage, a feature fusion stage, and a feature refinement stage. In the feature exploration phase, we design three key components: a global information exploration unit (GIEU), a spatial information exploration unit (SIEU), and a multi-scale exploration unit (MEU). The GIEU locates the position of the camouflagedobject by integrating the global contextual information. The SIEU aggregates effective spatial information. The MEU explores the rich multi-scale information. In the feature fusion stage, we design a multielement feature fusion module to integrate the consistency and complementarity of multi-scale features to fully explore the valuable clues in the context by exploiting the global semantics and spatial details. In the feature refinement phase, we design a cross-layer consistency refinement module to facilitate information transfer between neighboring layers and refine camouflage features by mutual guidance of low-level and high-level features. The proposed MFINet is evaluated on four challenging COD benchmark datasets (i.e., CAMO, CHAMELEON, COD10K, and NC4K) using four widely used metrics. These evaluations demonstrate that our proposed method is superior to other existing cutting-edge methods for camouflaged object detection. The code will be released at https://***/ZYJ9224/MFINet.
camouflagedobjects often blend in with their surroundings, making the perception of a camouflagedobject a more complex procedure. However, most neural-network-based methods that simulate the visual information proce...
详细信息
camouflagedobjects often blend in with their surroundings, making the perception of a camouflagedobject a more complex procedure. However, most neural-network-based methods that simulate the visual information processing pathway of creatures only roughly define the general process, which deficiently reproduces the process of identifying camouflagedobjects. How to make modeled neural networks perceive camouflagedobjects as effectively as creatures is a significant topic that deserves further consideration. After meticulous analysis of biological visual information processing, we propose an end-to-end prudent and comprehensive neural network, termed IdeNet, to model the critical information processing. Specifically, IdeNet divides the entire perception process into five stages: information collection, information augmentation, information filtering, information localization, and information correction and object identification. In addition, we design tailored visual information processing mechanisms for each stage, including the information augmentation module (IAM), the information filtering module (IFM), the information localization module (ILM), and the information correction module (ICM), to model the critical visual information processing and establish the inextricable association of biological behavior and visual information processing. The extensive experiments show that IdeNet outperforms state-of-the-art methods in all benchmarks, demonstrating the effectiveness of the five-stage partitioning of visual information processing pathway and the tailored visual information processing mechanisms for camouflaged object detection. Our code is publicly available at: https://***/whyandbecause/IdeNet.
作者:
Wei, TaoWang, YiqiZhang, YuqiangWang, YunfuZhao, LiangGuangxi Univ
Sch Comp & Elect Informat Nanning Peoples R China Hubei Univ Med
Taihe Hospita l Shiyan 442000 Peoples R China Hubei Univ Med
Taihe Hosp Hubei Key Lab Embryon Stem Cell Res Shiyan 442000 Peoples R China Hubei Univ Med
Taihe Hosp Hubei Prov Clin Res Ctr Precise Diag & Treatment L Hubei Key Lab Embryon Stem Cell Res Shiyan 442000 Peoples R China Guangxi Univ
Sch Comp & Elect Informat Nanning 530004 Peoples R China
Early diagnosis plays a pivotal role in handling the global health challenge posed by liver diseases. However, early-stage lesions are typically quite small, presenting significant difficulties due to insufficient reg...
详细信息
Early diagnosis plays a pivotal role in handling the global health challenge posed by liver diseases. However, early-stage lesions are typically quite small, presenting significant difficulties due to insufficient regions for developing effective features, indistinguishable boundaries of small lesions, and a lack of tiny liver lesion masks. To address these issues, we approach the solution in two-fold: an efficient model and a high-quality dataset. The model is built upon the advantages of path signature and camouflaged object detection. The path signature narrows down the ambiguous boundaries between lesions and other tissues while the camouflaged object detection achieves high accuracy in detecting inconspicuous lesions. The two are seamlessly integrated to ensure high accuracy and fidelity. For the dataset, we collect more than ten thousand liver images with over four thousand lesions, approximately half of which are small. Experiments on both an established dataset and our newly constructed one show that the proposed model outperforms state-of-the-art semantic segmentation and camouflaged object detection models, particularly in detecting small lesions. Moreover, the decisive and faithful salience maps generated by the model at the boundary regions demonstrate its strong robustness.
camouflagedobjects are typically assimilated into their surroundings. Consequently, in contrast to generic objectdetection/segmentation, camouflaged object detection proves to be considerably more intricate due to t...
详细信息
camouflagedobjects are typically assimilated into their surroundings. Consequently, in contrast to generic objectdetection/segmentation, camouflaged object detection proves to be considerably more intricate due to the indistinct boundaries and heightened intrinsic similarities between foreground targets and the surrounding environment. Despite the proposition of numerous algorithms that have demonstrated commendable performance across various scenarios, these approaches may still grapple with blurred boundaries, leading to the inadvertent omission of camouflaged targets in challenging scenes. In this paper, we introduce a multi-stage framework tailored for segmenting camouflagedobjects through a process of coarse-to-fine refinement. Specifically, our network encompasses three distinct decoders, each fulfilling a unique role in the model. In the initial decoder, we introduce the Bi-directional Locating Module to excavate foreground and background cues, enhancing target localization. The second decoder focuses on leveraging boundary information to augment overall performance, incorporating the Multi-level Feature Fusion Module to generate prediction maps with finer boundaries. Subsequently, the third decoder introduces the Mask-guided Fusion Module, designed to process high-resolution features under the guidance of the second decoder's results. This approach enables the preservation of structural details and the generation of fine-grained prediction maps. Through the integration of the three decoders, our model effectively identifies and segments camouflaged targets. Extensive experiments are conducted on three commonly used benchmark datasets. The results of these experiments demonstrate that, even without the application of pre-processing or post-processing techniques, our model outperforms 14 state-of-the-art algorithms.
暂无评论