Depth images captured by depth cameras often suffer from issues such as holes and noise, which can have adverse effects on subsequent practical applications, including the accuracy of 3D reconstruction. To address the...
详细信息
ISBN:
(纸本)9798350386783;9798350386776
Depth images captured by depth cameras often suffer from issues such as holes and noise, which can have adverse effects on subsequent practical applications, including the accuracy of 3D reconstruction. To address these issues and enhance the quality of depth images, we propose depth image restoration algorithm based on improved non-local means (NLM) filtering. Initially, the algorithm utilizes fuzzy C-means (FCM) clustering to cluster the color image and generate a guidance image. Subsequently, neighborhoods are defined centered around the hole points in the depth image. A weight function for NLM filtering is then constructed using bilateral filtering. By calculating the weighted average of all non-hole points within the neighborhood, the holes are repaired. Finally, median filtering is utilized to smoothen and reduce noise in the depth image. Experimental findings show that the proposed algorithm efficiently repairs holes while preserving the edge details of image.
The purpose of low-light image enhancement is to improve the clarity of objects in low-light environments to facilitate the recognition and detection of targets later. Local Contrast Denoising and Fusion Network (LCDF...
详细信息
We show that crowd counting can be viewed as a decomposable point querying process. This formulation enables arbitrary points as input and jointly reasons whether the points are crowd and where they locate. The queryi...
ISBN:
(纸本)9798350307184
We show that crowd counting can be viewed as a decomposable point querying process. This formulation enables arbitrary points as input and jointly reasons whether the points are crowd and where they locate. The querying processing, however, raises an underlying problem on the number of necessary querying points. Too few imply underestimation;too many increase computational overhead. To address this dilemma, we introduce a decomposable structure, i.e., the point-query quadtree, and propose a new counting model, termed Point quEry Transformer (PET). PET implements decomposable point querying via data-dependent quadtree splitting, where each querying point could split into four new points when necessary, thus enabling dynamic processing of sparse and dense regions. Such a querying process yields an intuitive, universal modeling of crowd as both the input and output are interpretable and steerable. We demonstrate the applications of PET on a number of crowd-related tasks, including fully-supervised crowd counting and localization, partial annotation learning, and point annotation refinement, and also report state-of-the-art performance. For the first time, we show that a single counting model can address multiple crowd-related tasks across different learning paradigms. Code is available at https://***/cxliu0/PET.
In this paper, we adopt image style migration technique based on deep learning, use Vgg19 network for content and style feature extraction, combine an image with art design style, and realise the generation of Van Gog...
详细信息
Magnetic Compressors play major role in imageprocessing and microwave applications. AND-OR complex compound gates and XOR-XNOR modules are used to design compressor to achieve low power consumption and less hardware ...
详细信息
Deep neural network models are more and more widely used in image reconstruction and generation tasks. By setting various loss functions, the model adaptively generates images that meet the corresponding constraints, ...
详细信息
This paper deals with blind image separation by exploiting the statistical characteristics of the mixtures (information related to the sources independence) with the sparsity of the signals. More precisely, we investi...
详细信息
Recently there has been a growing interest in learning generative models from a single image. This task is important as in many real world applications, collecting large dataset is not feasible. Existing work like Sin...
详细信息
ISBN:
(纸本)9781728198354
Recently there has been a growing interest in learning generative models from a single image. This task is important as in many real world applications, collecting large dataset is not feasible. Existing work like SinGAN is able to synthesize novel images that resemble the patch distribution of the training image. However, SinGAN cannot learn high level semantics of the image, and thus their synthesized samples tend to have unrealistic spatial layouts. To address this issue, this paper proposes a spatially adaptive style-modulation (SASM) module that learns to preserve realistic spatial configuration of images. Specifically, it extracts style vector (in the form of channel-wise attention) and latent spatial mask (in the form of spatial attention) from a coarse level feature separately. The style vector and spatial mask are then aggregated to modulate features of deeper layers. The disentangled modulation of spatial and style attributes enables the model to preserve the spatial structure of the image without overfitting. Experimental results show that the proposed module learns to generate samples with better fidelity than prior works.
The update of modern intelligent technology has driven the progress of computer related technology. The tide of technological revolution is impacting the economic system all over the world. The emergence of informatio...
详细信息
The proliferation of technologies and unstructured data on the internet poses a persistent challenge in extracting valuable information from diverse formats. To address this, research leverages Machine Learning (ML) a...
详细信息
ISBN:
(纸本)9798350326970
The proliferation of technologies and unstructured data on the internet poses a persistent challenge in extracting valuable information from diverse formats. To address this, research leverages Machine Learning (ML) and Natural Language processing (NLP) techniques. This study contributes to information extraction from unstructured text using a stateof-the-art pipeline, incorporating modules for coreference resolution (Neuralcoref), named entity linking (Wikifier API), and Relationship Extraction (RE) (OpenNRE and REBEL models). The resulting Knowledge Graph (KG) in Neo4j captures entity relationships. Experiments on a BBC news dataset analyzed the pipeline's performance, focusing on RE. Accuracies of 61.4% (OpenNRE) and 87% (REBEL) were achieved. The research demonstrates the efficacy of the proposed pipeline in extracting structured knowledge from unstructured data, facilitating the preservation and utilization of valuable information.
暂无评论