This paper expounds the automatic recognition method of parts based on computer vision. The feature database of the processed parts is constructed by using machine learning method. image preprocessing, threshold segme...
详细信息
Detecting and managing various types of defects that occur in the manufacturing process is important for product quality control. Detecting flaws in product presentation is an ongoing research topic in computer vision...
详细信息
Progress in computer vision, particularly based on machine learning, depends heavily on the availability of appropriate datasets. However, for applications like face recognition or emotion detection, this requires the...
详细信息
ISBN:
(纸本)9798350337266
Progress in computer vision, particularly based on machine learning, depends heavily on the availability of appropriate datasets. However, for applications like face recognition or emotion detection, this requires the collection offace images, which comprise especially privacy-sensitive biometric data. The corresponding valid ethical concerns and legal regulations regarding privacy rights limit the creation of new datasets. To reconcile the need for detailed facial image datasets with the right to privacy protection, appropriate anonymization techniques are needed. To this end, we suggest a pipeline to repurpose a face swapping tool for de-identification by combining it with synthetic image generation and a novel procedure to select source images to improve the trade-off between data utility retention and privacy enhancement. A quantitative comparison of our results to other de-identification approaches shows that our method leads to better retention of facial expressions while providing adequate privacy protection. Thus, applying this procedure to face image datasets before publication could help mitigate privacy concerns.
image-based rendering techniques stand at the core of an immersive experience for the user, as they generate novel views given a set of multiple input images. Since they have shown good performance in terms of objecti...
详细信息
We propose a feature parallel transformation module to strength the matching ability in stereo. We obtain additional information from input images, enhance features from the feature extraction module and design a para...
详细信息
Due to the rapid rise in the identification of digital materials, automatic image classification has emerged as the most difficult topic of computer vision. In comparison to human vision, automatic visual understandin...
详细信息
imageprocessing filters offer several significant applications, making them a crucial component of various consumer electronics and multimedia systems. These image filters are designed as dedicated reusable intellect...
详细信息
Controllable human image generation (HIG) has numerous real-life applications. State-of-the-art solutions, such as ControlNet and T2I-Adapter, introduce an additional learnable branch on top of the frozen pre-trained ...
ISBN:
(纸本)9798350307184
Controllable human image generation (HIG) has numerous real-life applications. State-of-the-art solutions, such as ControlNet and T2I-Adapter, introduce an additional learnable branch on top of the frozen pre-trained stable diffusion (SD) model, which can enforce various conditions, including skeleton guidance of HIG. While such a plugand-play approach is appealing, the inevitable and uncertain conflicts between the original images produced from the frozen SD branch and the given condition incur significant challenges for the learnable branch, which essentially conducts image feature editing for condition enforcement. In this work, we propose a native skeleton-guided diffusion model for controllable HIG called HumanSD. Instead of performing image editing with dual-branch diffusion, we fine-tune the original SD model using a novel heatmap-guided denoising loss. This strategy effectively and efficiently strengthens the given skeleton condition during model training while mitigating the catastrophic forgetting effects. HumanSD is fine-tuned on the assembly of three large-scale human-centric datasets with textimage-pose information, two of which are established in this work. Experimental results show that HumanSD outperforms ControlNet in terms of pose control and image quality, particularly when the given skeleton guidance is sophisticated.
In this paper, the nonlinear filtering algorithm for mixed noise is proposed based on the probabilistic statistical model of local texture direction of computer digital media images. The Radon transform is used to ana...
详细信息
The task of image captioning has seen considerable success using deep neural networks. This assessment offers a thorough overview of the most cutting-edge approaches for deep learning-based unsupervised image captioni...
详细信息
暂无评论