The proceedings contain 23 papers. The topics discussed include: high throughput imaging and analysis for biological interpretation of agricultural plants and environmental interaction;investigation of segmentation ba...
ISBN:
(纸本)9780819499417
The proceedings contain 23 papers. The topics discussed include: high throughput imaging and analysis for biological interpretation of agricultural plants and environmental interaction;investigation of segmentation based pooling for image quantification;illumination invariant 3D change detection;illumination invariant pattern recognition using fringe-adjusted joint transform correlator and monogenic signal;on the use of MKL for cooking action recognition;efficient adaptive thresholding with image masks;hyperspectral image reconstruction using RGB color for foodborne pathogen detection on agar plates;improved wheal detection from skin prick test images;eye gaze tracking using correlation filters;image thresholding using standard deviation;object detection in MOUT: evaluation of a hybrid approach for confirmation and rejection of object detection hypotheses;and scoring recognizability of faces for security applications.
The proceedings contain 76 papers. The special focus in this conference is on machinevision and Augmented Intelligence. The topics include: Survey on Robustness of Deep Learning Techniques on Adversarial Attacks in W...
ISBN:
(纸本)9789819743582
The proceedings contain 76 papers. The special focus in this conference is on machinevision and Augmented Intelligence. The topics include: Survey on Robustness of Deep Learning Techniques on Adversarial Attacks in WBAN;synergizing Collaborative and Content-Based Filtering for Enhanced Movie Recommendations;exploring Transformer-Based Approaches for Hyperspectral image Classification: A Comparative Analysis;deep Learning for Cognitive Task and Seizure Classification with Hilbert–Huang Transform and Variational Mode Decomposition;tracking of Ship and Plane in Satellite Videos Using a Convolutional Regression Network with Deep Features;Tumor Detection and Analysis from Brain MRI images Using Deep Learning;software Maintenance Prediction Using Stack Ensemble Deep Learning Algorithms;resource Allocation in 6G Network for High-Speed Train Using D2D Outband Communication;controlling the Band-to-Band Tunneling Effect in Charge Plasma Based Dopingless Transistor;Comparison of Different CIC Filter Architectures on the Basis of a Novel Parameter Called Noise Factor for Sigma-Delta Based ADCs;the Scientific Analysis on Effective Yoga Posture Recognition Techniques;impact of Gamma Rays on Emerging Devices for Photonic applications;shaft Rotation Monitoring Using Radar Signal processing and Wavelet Transform;gysel Power Divider Miniaturization Using an Inter-Digital Capacitor-Based Slow-Wave Structure;noise Estimation and Removal in Fundus images Using Pyramid Real image Denoising Network;evaluation of Hybrid Encryption Method to Secure Healthcare Data;multimodal Face Recognition System Using Hybrid Deep Learning Feature;Classification of Copy and Move image by Using HELM-FSK Method: An Efficient Approach;analysis of Energy Efficient Smart Home Based on IoT System;role of Explainable Artificial Intelligence Approaches in Cybersecurity.
Currently, screen content video applications are increasingly widespread in our daily lives. The latest Screen Content Coding (SCC) standard, known as Versatile Video Coding (VVC) SCC, employs screen content Coding Mo...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Currently, screen content video applications are increasingly widespread in our daily lives. The latest Screen Content Coding (SCC) standard, known as Versatile Video Coding (VVC) SCC, employs screen content Coding Modes (CMs) selection. While VVC SCC achieves high coding efficiency, its coding complexity poses a significant obstacle to the further widespread adoption of screen content video. Hence, it is crucial to enhance the coding speed of VVC SCC. In this paper, we propose a fast mode and splitting decision for Intra prediction in VVC SCC. Specifically, we initially exploit deep learning techniques to predict content types for all CUs. Subsequently, we examine CM distributions of different content types to predict candidate CMs for CUs. We then introduce early skip and early terminate CM decisions for different content types of CUs to further eliminate unlikely CMs. Finally, we develop Block-based Differential Pulse-Code Modulation (BDPCM) early termination to improve coding speed. Experimental results demonstrate that the proposed algorithm can improve coding speed by 34.95% on average while maintaining almost the same coding efficiency.
Recent years witness the tremendous success of generative adversarial networks (GANs) in synthesizing photo-realistic images. GAN generator learns to compose realistic images and reproduce the real data distribution. ...
详细信息
Recent years witness the tremendous success of generative adversarial networks (GANs) in synthesizing photo-realistic images. GAN generator learns to compose realistic images and reproduce the real data distribution. Through that, a hierarchical visual feature with multi-level semantics spontaneously emerges. In this work we investigate that such a generative feature learned from image synthesis exhibits great potentials in solving a wide range of computer vision tasks, including both generative ones and more importantly discriminative ones. We first train an encoder by considering the pre-trained StyleGAN generator as a learned loss function. The visual features produced by our encoder, termed as Generative Hierarchical Features (GH-Feat), highly align with the layer-wise GAN representations, and hence describe the input image adequately from the reconstruction perspective. Extensive experiments support the versatile transferability of GH-Feat across a range of applications, such as image editing, imageprocessing, image harmonization, face verification, landmark detection, layout prediction, image retrieval, etc. We further show that, through a proper spatial expansion, our developed GH-Feat can also facilitate fine-grained semantic segmentation using only a few annotations. Both qualitative and quantitative results demonstrate the appealing performance of GH-Feat. Code and models are available at https://***/ghfeat/.
Computer vision in precision agriculture analysis has gained increasing attention as recent advancements in deep learning-based methods for various tasks were proven successful. As one of the primary problems in agric...
详细信息
ISBN:
(纸本)9781665464680
Computer vision in precision agriculture analysis has gained increasing attention as recent advancements in deep learning-based methods for various tasks were proven successful. As one of the primary problems in agriculture-visionapplications, semantic segmentation from aerial agricultural images, differs from common object or aerial image segmentation tasks in various ways. Recently, there have been some efforts that aim to apply deep learning techniques to model multi-spectral aerial images and segment field anomaly pattern objects with extremely irregular shapes and scales. However, most existing methods fail to propose effective methods for model initialization and perform poorly in segmenting small objects. To address these challenges, we propose a deep learning framework that leverages momentum contrast learning with a PointRend-based model for aerial image analysis. Extensive experiments have demonstrated the effectiveness of our model for better aerial image semantic segmentation.
This paper investigates advanced techniques in image recognition and classification by integrating deep learning and machine learning approaches to achieve higher accuracy. Through the implementation of sophisticated ...
详细信息
Object detection and classification denotes one of the most extensively-utilized machinevisionapplications given the high requirements put forward for object classification and defect detection with the rise of obje...
详细信息
Object detection and classification denotes one of the most extensively-utilized machinevisionapplications given the high requirements put forward for object classification and defect detection with the rise of object recognition scenes. Notwithstanding, conventional image recognition processing technology encounters specific drawbacks. Its benefits and limitations were duly compared upon selecting several typical conventional image recognition techniques. Resultantly, such recognition approaches required multiple manual participation elements and extensive manpower with restricted object identification. As a branch of machine learning, deep learning has attained more optimal results in the image recognition discipline. In the classification and defect detection of industrial workpieces, over 70 literature reviews of deep learning algorithms across multiple application scenarios for classical algorithm model and network structure assessment based on the deep learning theory. Relevant network model performance was compared and analyzed based on network intricacies parallel to natural image classification. Six research gaps were found based on the reviewed algorithm pros and cons. The corresponding six research proposal in workpiece image classification was highlighted with prospects on the workpiece image classification and defect detection direction development. It provides an empirical solution for the selection of workpiece classification and defect detection deep learning model in the future.
Satellite still image plays a crucial role in various domains, such as law enforcement, disaster response, and environmental monitoring. The ability to manually identify objects and facilities within these images is o...
详细信息
Encoder models have shown remarkable success in various computer vision operations like object detection, image classification, and semantic segmentation. However, the results of one model showed that it was underfitt...
详细信息
One of the most interesting and challenging research focuses on pattern recognition and imageprocessing has emerged in recent days is writing in the air. In many different applications, it can improve the interface b...
详细信息
暂无评论