The proceedings contain 56 papers. The topics discussed include: exploring the gradient for video quality assessment;an efficient approach for using expectation maximization algorithm in capsule networks;image waterma...
ISBN:
(纸本)9781728168326
The proceedings contain 56 papers. The topics discussed include: exploring the gradient for video quality assessment;an efficient approach for using expectation maximization algorithm in capsule networks;image watermarking by Q learning and matrix factorization;attention-based face antispoofing of RGB camera using a minimal end-2-end neural network;DeepFaceAR: deep face recognition and displaying personal information via augmented reality;a classified and comparative study of 2-D convolvers;class attention map distillation for efficient semantic segmentation;and monitoring wrist and fingers range of motion using leap motion camera for physical rehabilitation.
The proceedings contain 43 papers. The topics discussed include: NaN attacks: bit-flipping deep neural network parameters to NaN or infinity;cutting tool condition monitoring based on a machine learning approach by us...
ISBN:
(纸本)9798350348842
The proceedings contain 43 papers. The topics discussed include: NaN attacks: bit-flipping deep neural network parameters to NaN or infinity;cutting tool condition monitoring based on a machine learning approach by using vibration signals;vision transformer and residual network-based autoencoder for RGBD data processing in robotic grasping of noodle-like objects;classification of short-interfering RNA through transformer encoder model;fault diagnosis of spur gearbox by image classification using deep CNN;optimal energy management for residential with renewable energy integration;hardware design and implementation of smart energy monitoring device for smart home;human detection and human pose classification for mobile robots interaction;and vortex ring and dynamic lift generated by downstroke of elastic butterfly wing.
Classification is the most important tasks for Epizootic Ulcerative Syndrome (EUS) fish disease image to normal fish image. YCbCr imageprocessing technique and SVM of machine learning is applied for image classificat...
详细信息
Fault diagnosis of two stages gearboxes using deep learning techniques has gained significant attention in the last decade. Spur gearboxes are widely used for power transmission in industries which makes the fault dia...
详细信息
The proceedings contain 46 papers. The special focus in this conference is on Advanced Computing, Machine Learning, robotics and Internet Technologies. The topics include: Galactic Simulation: Visual Perception of Ani...
ISBN:
(纸本)9783031472237
The proceedings contain 46 papers. The special focus in this conference is on Advanced Computing, Machine Learning, robotics and Internet Technologies. The topics include: Galactic Simulation: Visual Perception of Anisotropic Dark Matter;Protocol Anomaly Detection in IIoT;Agricultural Informatics & ICT: The Foundation, Issues, Challenges and Possible Solutions—A Policy Work;a Guava Leaf Disease Identification Application;text to image Generation Using Attentional Generative Adversarial Network;attention-CoviNet: A Deep-Learning Approach to Classify Covid-19 Using Chest X-Rays;a Deep Learning Framework for Violence Detection in Videos Using Transfer Learning;multi-focus image Fusion Methods: A Review;cache Memory and On-Chip Cache Architecture: A Survey;authenticating Smartphone Users Continuously Even if the Smartphone is in the User’s Pocket;Comparative Analysis of Machine Learning Algorithms for COVID-19 Detection and Prediction;machine Learning Classifiers Explanations with Prototype Counterfactual;a Systematic study of Super-Resolution Generative Adversarial Networks: Review;stance Detection in Manipuri Editorial Article Using CRF;deep Learning Based Software Vulnerability Detection in Code Snippets and Tag Questions Using Convolutional Neural Networks;a Comprehensive study of the Performances of Imbalanced Data Learning Methods with Different Optimization Techniques;Smart Parking System Using Arduino and IR Sensor;quMaDe: Quick Foreground Mask and Monocular Depth Data Generation;fine-Grained Air Quality with Deep Air Learning;enhancing Melanoma Skin Cancer Detection with Machine Learning and imageprocessing Techniques;imageprocessing Technique and SVM for Epizootic Ulcerative Syndrome Fish image Classification.
The proceedings contain 16 papers. The topics discussed include: artificial intelligence for the future of construction;cobots and industrial robots;predictive maintenance for wind turbine bearings: an MLOps approach ...
The proceedings contain 16 papers. The topics discussed include: artificial intelligence for the future of construction;cobots and industrial robots;predictive maintenance for wind turbine bearings: an MLOps approach with the DIAFS machine learning model;development of an artificial intelligence tool and sensing in informatization systems of mobile robots;PCA-NuSVR framework for predicting local and global indicators of tunneling-induced building damage;design and deployment of data development toolkit in cloud manufacturing environments;research and development of imageprocessing algorithms for effective recognition of various gestures in real time;machine learning models for the recognition of commands in smart home technologies;responsive dehydration: sensor-driven optimisation of production cycles in a solar dehydrator;and formation of the method of description and control of the relative position of the links of the upper limbs of the grip of an anthropomorphic robot.
Aerial search and response plays an important role in finding and rescuing persons in need. Unmanned Aerial Vehicle (UAV) -acquired aerial images provide an intensive profile search area and facilitate identification ...
详细信息
An uncommon sort of cancer called malignant sinonasal cancer, commonly referred to as sinusoidal carcinoma, arises in the paranasal sinuses or nasal cavity. Most sinonasal tumors occur in the maxillary sinuses located...
详细信息
Early and accurate diagnosis by using retinal imageprocessing is critical for enabling optimized patient care. Existing techniques for the diagnosis in medical imageprocessing often face limitations. This research s...
详细信息
image Caption Generation (ICG), situated at the confluence of computer vision and natural language processing, empowers machines to comprehend visual content and express it in human-like language. This research offers...
详细信息
ISBN:
(数字)9798350372748
ISBN:
(纸本)9798350372748
image Caption Generation (ICG), situated at the confluence of computer vision and natural language processing, empowers machines to comprehend visual content and express it in human-like language. This research offers a comprehensive overview of key concepts, methodologies, and challenges in ICG. The process involves developing algorithms for the automatic generation of contextually relevant captions, utilizing deep neural networks for feature extraction, and employing natural language processing techniques for coherent composition. Recent advancements, particularly in convolutional neural networks for imageprocessing and recurrent neural networks for language modelling, have significantly elevated the performance of image captioning systems. The study delves into the core components of an ICG system, including pre-processing techniques for image data, feature extraction mechanisms, and the integration of language models. Attention mechanisms, a key innovation in this field, enable the model to focus on relevant image regions while generating captions, closely mirroring human attention patterns. Despite notable progress, ICG faces several challenges, such as handling diverse and complex visual scenes, ensuring cross-modal coherence between images and captions, and addressing biases present in training data. Ethical considerations, particularly in applications like automated content generation, are also discussed. The study concludes by highlighting potential future directions in ICG research, including the incorporation of multimodal learning approaches, enhancing the interpretability of generated captions, and addressing societal concerns related to bias and fairness. As ICG continues to evolve, it holds promise for various applications, ranging from accessibility for the visually impaired to improving content indexing and retrieval in multimedia databases. The research also underscores the significance of the accuracy attainments, showcasing the success of the pr
暂无评论