We introduce novel methods that speed up the pose-graph generation for global Structure-from-Motion algorithms. We replace the widely used "accept-or-reject" strategy for image pairs, where often thousands o...
详细信息
This work presents real time gender prediction using Convolutional Neural Network. Automatic classification of gender has become an area that has garnered importance due to the emergence of breakthroughs in the world ...
详细信息
This work presents real time gender prediction using Convolutional Neural Network. Automatic classification of gender has become an area that has garnered importance due to the emergence of breakthroughs in the world of computing particularly with the advent of machine learning and Artificial intelligence. The goal of gender prediction in computer vision involves accurate prediction of gender from visual data. Gender prediction is an indispensable biological metric as it plays a significant role in many human applications. The application varies from immigration, border access, law enforcement, defence and intelligence, citizen identification and banking. imageprocessing combined with machine learning algorithms have been employed to build solutions from image representation of biological attributes such as facial images, human skeletal radiographs (most notably skull and pelvic bones), gait, smiles and non-biological features such as social media activities, names and other means that could be employed to determine gender by extracting regions of interest, applying necessary filters and normalizing the matrix values obtained. For the purpose of this work, a total of 6760 hand-bone radiographs were acquired from the Radiological Society of North America Repository. The dataset was divided into 70% training datasets and 30% test dataset using Random Sampling Cross-Validation Method. The acquired data images were pre-processed using image cropping, histogram equalization and segmentation techniques. Performance evaluation was carried out on the developed system using the metrics: Accuracy, False Positive Rate (FPR), Recognition Time, Specificity and Sensitivity, these metrics were evaluated at threshold values of 0.25, 0.35, 0.5 and 0.75. Optimal values were gotten at optimum threshold value of 0.75, the values are; 4.79, 93.67, 95.21, 94.53 and 137.87 respectively for metrics (FPR, Sensitivity, Specificity, Accuracy and Time).
Quantum machine Learning (QML) promises the transformative potential in computer vision by utilizing quantum computing to facilitate faster high-dimensional data processing. In this paper, we will go through some of t...
详细信息
ISBN:
(数字)9798350367942
ISBN:
(纸本)9798350367959
Quantum machine Learning (QML) promises the transformative potential in computer vision by utilizing quantum computing to facilitate faster high-dimensional data processing. In this paper, we will go through some of the recent works that employ QML for computer vision problems such as image Segmentation, Classification, and Generation. Demonstrations aimed at showing where QML methods beat the state of art techniques in particular applications like facial recognition, medical imaging, and satellite imagery. QML aspires to make pathbreaking changes in a field limited by current hardware capabilities. This poster abstract summarizes the important studies, methodologies and findings to inform further research in this developing field.
Depth image spatial clustering is an important task in the fields of computer vision and machine learning, aiming to group pixels or point cloud data of depth images into clusters with similar features. This is crucia...
Depth image spatial clustering is an important task in the fields of computer vision and machine learning, aiming to group pixels or point cloud data of depth images into clusters with similar features. This is crucial for tasks such as object recognition, scene segmentation, 3D reconstruction, and other applications. The K-means algorithm is a commonly used clustering method that divides data points into K clusters, ensuring that each data point belongs to the cluster whose center is closest to it. However, traditional K-means algorithms face several challenges in depth image spatial clustering, such as sensitivity to the choice of K and the influence of cluster center initialization on the results. In this study, we propose an improved depth image clustering method based on spatial information. The method includes depth image edge extraction, intra-edge region growing, unclassified point re-clustering, and connected region merging. Through experiments conducted on datasets from TUM, Bonn, and our collected data, our algorithm demonstrates superior performance in depth image clustering tasks, higher clustering accuracy, fewer unclassified points, and faster convergence speed.
The blooming proliferation of aeronautics and astronautics platforms, together with the ever-increasing remote sensing imaging sensors on these platforms, has led to the formation of rapidly-growing earth observation ...
详细信息
The blooming proliferation of aeronautics and astronautics platforms, together with the ever-increasing remote sensing imaging sensors on these platforms, has led to the formation of rapidly-growing earth observation data with the characteristics of large volume, large variety, large velocity, large veracity and large value, which raises awareness about the importance of large-scale imageprocessing, fusion and mining. Unconsciously, we have entered an era of big earth data, also called remote sensing (RS) big data. Although RS big data provides great opportunities for a broad range of applications such as disaster rescue, global security, and so forth, it inevitably poses many additional processing challenges. As one of the most fundamental and important tasks in RS big data mining, image retrieval (i.e., image information mining) from RS big data has attracted continuous research interests in the last several decades. This paper mainly works for systematically reviewing the emerging achievements for image retrieval from RS big data. And then this paper further discusses the RS image retrieval based applications including fusion-oriented RS imageprocessing, geo-localization and disaster rescue. To facilitate the quantitative evaluation of the RS image retrieval technique, this paper gives a list of publicly open datasets and evaluation metrics, and briefly recalls the mainstream methods on two representative benchmarks of RS image retrieval. Considering the latest advances from multiple domains including computer vision, machine learning and knowledge engineering, this paper points out some promising research directions towards RS big data mining. From this survey, engineers from industry may find skills to improve their RS image retrieval systems and researchers from academia may find ideas to conduct some innovative work.
The proceedings contain 108 papers. The special focus in this conference is on Wireless Artificial Intelligent Computing Systems and applications. The topics include: Profinder: Towards Professionals Recognition on...
ISBN:
(纸本)9783031714696
The proceedings contain 108 papers. The special focus in this conference is on Wireless Artificial Intelligent Computing Systems and applications. The topics include: Profinder: Towards Professionals Recognition on Mobile Devices for Users with Cognitive Decline;truthful Double Auction-Based Resource Allocation Mechanisms for Latency-Sensitive applications in Edge Clouds;V2ICooper: Toward Vehicle-to-Infrastructure Cooperative Perception with Spatiotemporal Asynchronous Fusion;recommendation-Aware Collaborative Edge Caching Strategy in the Internet of Vehicles;towards Robust Internet of Vehicles Security: An Edge Node-Based machine Learning Framework for Attack Classification;device-Edge-Cloud Collaborative Video Stream processing and Scheduling Strategy Based on Deep Reinforcement Learning;consistent Low-Latency Scheduling for Microsecond-Scale Tasks in Data Centers;multi-scale Data Reconstruction Based Policy Optimization Algorithm for Skill Learning;Joint Optimization Design of Intelligence Reflecting Surface Assisted MU-MISO System Based on Deep Reinforcement Learning;Secure Motion Verification for High Altitude Platforms with a Hybrid AOA-TDOA-FDOA Scheme;variational Autoencoder Based Automatic Clustering for Multivariate Time Series Anomaly Detection;deep Reinforcement Learning Based Economic Dispatch with Cost Constraint in Cyber Physical Energy System;inceptionNeXt Network with Relative Position Information for Microexpression Recognition;adMarks: image Steganography Based on Adversarial Perturbation;a Review on Binary Code Analysis Datasets;low-Cost Robot Path Planning Mechanism for Escaping from Dead Ends;towards Communication-Efficient Collaborative Perception: Harnessing Channel-Spatial Attention and Knowledge Distillation;improving Anomaly Scene Recognition with Large vision-Language Models;ReSU-Net: State Space Model for 3D Abdominal Multi-organ Segmentation;autocue : Targeted Textual Adversarial Attacks with Adversarial Prompts;universal Sign Language
The proceedings contain 108 papers. The special focus in this conference is on Wireless Artificial Intelligent Computing Systems and applications. The topics include: Profinder: Towards Professionals Recognition on...
ISBN:
(纸本)9783031714634
The proceedings contain 108 papers. The special focus in this conference is on Wireless Artificial Intelligent Computing Systems and applications. The topics include: Profinder: Towards Professionals Recognition on Mobile Devices for Users with Cognitive Decline;truthful Double Auction-Based Resource Allocation Mechanisms for Latency-Sensitive applications in Edge Clouds;V2ICooper: Toward Vehicle-to-Infrastructure Cooperative Perception with Spatiotemporal Asynchronous Fusion;recommendation-Aware Collaborative Edge Caching Strategy in the Internet of Vehicles;towards Robust Internet of Vehicles Security: An Edge Node-Based machine Learning Framework for Attack Classification;device-Edge-Cloud Collaborative Video Stream processing and Scheduling Strategy Based on Deep Reinforcement Learning;consistent Low-Latency Scheduling for Microsecond-Scale Tasks in Data Centers;multi-scale Data Reconstruction Based Policy Optimization Algorithm for Skill Learning;Joint Optimization Design of Intelligence Reflecting Surface Assisted MU-MISO System Based on Deep Reinforcement Learning;Secure Motion Verification for High Altitude Platforms with a Hybrid AOA-TDOA-FDOA Scheme;variational Autoencoder Based Automatic Clustering for Multivariate Time Series Anomaly Detection;deep Reinforcement Learning Based Economic Dispatch with Cost Constraint in Cyber Physical Energy System;inceptionNeXt Network with Relative Position Information for Microexpression Recognition;adMarks: image Steganography Based on Adversarial Perturbation;a Review on Binary Code Analysis Datasets;low-Cost Robot Path Planning Mechanism for Escaping from Dead Ends;towards Communication-Efficient Collaborative Perception: Harnessing Channel-Spatial Attention and Knowledge Distillation;improving Anomaly Scene Recognition with Large vision-Language Models;ReSU-Net: State Space Model for 3D Abdominal Multi-organ Segmentation;autocue : Targeted Textual Adversarial Attacks with Adversarial Prompts;universal Sign Language
Computer vision has become an interdisciplinary subject integrating digital imageprocessing, machine learning, computer science, and many other categories. It can provide contactless data collection and technical ana...
详细信息
images captured in hazy environments need to be processed to increase their contrast and colour integrity. Dehazing, sometimes referred to as haze removal is an important pre-processing step for imageprocessing and c...
详细信息
images captured in hazy environments need to be processed to increase their contrast and colour integrity. Dehazing, sometimes referred to as haze removal is an important pre-processing step for imageprocessing and computer visionapplications. Numerous methods for dehazing images have been proposed in the literature. This study provides a complete evaluation of numerous image dehazing techniques and their notable standards. This study extracted and presented an important direction of the many algorithms to handle the challenges of dehazing such as model-based methods, transform domain methods, variational-based algorithms, learningbased algorithm and transformer-based algorithms. This study attempts to compile and evaluate the most important studies in the domain of image dehazing. A variety of factors have been considered necessary to provide a detailed information in this study. These factors include datasets that utilised in the literature, challenges faced by the prior researchers, motivations, and recommendations for reducing the drawbacks in the available literature. The systematic rules are utilized for searching all relevant papers on image dehazing using several keyword diversities along with a glance for assessment and the benchmark studies. image dehazing, which generally eliminates undesirable pictographic effects is often considered as an image enhancement method. A completely automated process, a valid assessment strategy, and databases based on diverse settings are needed for it to operate under real-time applications. Many relevant studies are conducted in order to achieve these substantial goals. We examined numerous image dehazing methods and assessed the objectivity of the results. The results of our study precisely reflect numerous observations on image dehazing regions in contrast to other review articles. We believe that the findings of the study can be a helpful set of recommendations for professionals looking for a full understanding of ima
In recent years, convolutional neural networks (CNNs) have increasingly become the predominant approach for image classification, demonstrating remarkable performance in this domain. Notably, VGG16 has gained widespre...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
In recent years, convolutional neural networks (CNNs) have increasingly become the predominant approach for image classification, demonstrating remarkable performance in this domain. Notably, VGG16 has gained widespread adoption due to its deep architecture and robust feature extraction capabilities. Consequently, this paper investigates the efficacy of employing the VGG16 convolutional neural network model for image classification on the CIFAR-10 dataset. The architecture of the VGG16 model and its distinctive application characteristics are first introduced, followed by an exploration of key steps including data preprocessing, model training, and hyperparameter tuning. 99.4% accuracy on the training set and 90.6% accuracy on the test set were attained after the training procedure was optimized. To improve the model's classification performance even more, this paper identifies limitations inherent in the current framework and proposes optimization strategies such as incorporating attention mechanisms and transfer learning. These strategies provide valuable insights for future applications of deep learning techniques on small-scale image datasets.
暂无评论