Multi-objective evolutionary algorithms (MOEAs)-based fuzzy clustering have been successfully applied to image segmentation problems. However, these MOEAs require a lot of expensive fitness function evaluations. In or...
详细信息
We present a novel deep-learning architecture and fusion model as a promising approach to characterize objects in recycling streams, particularly electrolyzer scraps. Electrolyzers are of particular interest due to th...
We present a novel deep-learning architecture and fusion model as a promising approach to characterize objects in recycling streams, particularly electrolyzer scraps. Electrolyzers are of particular interest due to their complexity and significance for future green tech. By combining RGB and Hyperspectral data, the model enables a comprehensive understanding of the spectral and spatial properties of the materials. To facilitate near real-time hyperspectral feature extraction, we propose an attention-based band selection method. Further-more, a auxiliary loss function is introduced to enhance the learning capability. Leveraging mid-level feature fusion techniques, the model effectively combines modalities, achieving an impressive up to 98.65% mAP on the dataset. Our proposed architecture offers a promising solution for the efficient and accurate detection of electrolyzer materials, and its adaptability to other material streams could significantly improve recycling processes.
Multi-frame video super-resolution(VSR) aims to restore a high-resolution video from both its corresponding low-resolution frame and multiple neighboring frames, in order to make full use of the inter-frame informatio...
详细信息
ISBN:
(数字)9781665482233
ISBN:
(纸本)9781665482233
Multi-frame video super-resolution(VSR) aims to restore a high-resolution video from both its corresponding low-resolution frame and multiple neighboring frames, in order to make full use of the inter-frame information. However, vast computation complexity hinders the inference speed of video super-resolution. In order to increase the inference speed while ensuring the accuracy of the model, we proposed an efficient and parallel multi-frame VSR network, termed EPVSR. The proposed EPVSR is based on spatio-temporal adversarial learning to achieve temporal consistency and uses TecoGAN as the baseline model. By adding an improved non-deep network, which is composed of parallel subnetworks with multi-resolution streams, these streams are fused together at regular intervals to exchange information. we reduced the number of parameters and make the model lighter. Besides, we implement structural re-parameterization network acceleration technique to optimize the inference process of EPVSR network. Finally, our EPVSR achieves the real-timeprocessing capacity of 4K@36.45FPS. compared with TecoGAN, we achieve 9.75 x performance speedups, but the effect is not reduced. the PSNR of EGVSR are increased by 3.36%. The experimental results show that the nondeep network can effectively speed up the model inference, and the proposed EPVSR has a good super-resolution effect.
This research study shows an effective deformable complex 3D image reconstruction and image synthesis technique by consolidating needed high-level features from a deep convolutional neural network (CNN) system. By rec...
详细信息
This research study shows an effective deformable complex 3D image reconstruction and image synthesis technique by consolidating needed high-level features from a deep convolutional neural network (CNN) system. By recognising the inherent deep features in image patches lead to information discovery in medicinal imaging. Utilising the ADNI and LONI imaging datasets, the performance of the proposed deeplearning algorithm image reconstruction and synthesis performance was verified. For validation, various performance indices obtained with the proposed deeplearning algorithm were compared with two conventional algorithms namely support vector machine and CNN. Likewise, to reveal the adaptability of the proposed image reconstruction and synthesis system, synthesis and reconstruction experiments were directed on the 7 T cerebrum magnetic resonance image. As presented in the study outcomes, the proposed method can accomplish predominant performance compared with other cutting-edge techniques with either low- or high-level features in terms of the synthesis and reconstruction rate. The proposed algorithm has a training time of 5 s with a structural similarity index of 0.97. In all investigations, the outcome shows that the proposed image reconstruction framework reliably exhibited progressively precise outcomes when contrasted with best in class. Hence, it can be used for possible precise image reconstruction and synthesis related applications.
One of the most difficult problems that develop when brain cells start to grow out of control is a brain tumor, which is regarded as the most lethal disease of the century. Finding and identifying malignant brain magn...
详细信息
One of the most difficult problems that develop when brain cells start to grow out of control is a brain tumor, which is regarded as the most lethal disease of the century. Finding and identifying malignant brain magnetic resonance imaging (MRI) images is the major challenge before therapy. Researchers have been putting a lot of effort into creating the best method for more accurate real-world medical image recognition. For manual categorization, it is quite time-consuming to segment large quantities of MRI data. To mitigate these issues, this paper suggests the information exchange gateway-based residual UNet (IEGResUNet) model, which uses the ResUNet model as a base model. Additionally, including principal component analysis (PCA) data augmentation will increase the model's efficiency while also enhancing its speed. The IEGResUNet model shows an ablation investigation on three Brats datasets, with and without PCA augmentation. The results demonstrate that IEGResUNet will improve segmentation effectiveness and can also manage the imbalance in input data when PCA data augmentation models are included. The dice score on BraTS 2019 for whole tumor, region of core tumor, and region of enhancing tumor were 0.9083, 0.883, and 0.8106 respectively. Also, on BraTS 2020, the dice score for WT, CT, and ET 0.9083, 0.883, and 0.8106 was respectively. Similarly, on BraTS 2021, the dice score for WT, CT, and ET was 0.8737, 0.8866, and 0.7963 respectively. Comparing against baseline models, the IEGResUNet scored well in terms of dice score and intersection over union.
The proceedings contain 55 papers. The special focus in this conference is on Machine learning and Information processing. The topics include: Performance Analysis of Different Models for Twitter Sentiment;electricity...
ISBN:
(纸本)9789813348585
The proceedings contain 55 papers. The special focus in this conference is on Machine learning and Information processing. The topics include: Performance Analysis of Different Models for Twitter Sentiment;electricity Forecasting Using Machine learning: A Review;end to End learning Human Pose Detection Using Convolutional Neural Networks;conference Paper Acceptance Prediction: Using Machine learning;Object Identification and Tracking Using YOLO Model: A CNN-Based Approach;real-time Hands-Free Mouse Control for Disabled;accounting Fraud Detection Using K-Means Clustering Technique;transforming the Lives of Socially Dependent to Self-dependent Using IoT;enforcement an Evidence and Quality of Query Services in the Cost-Effective Cloud;Prediction and Classification of Biased and Fake News Using NLP and Machine learning Models;a Machine learning Approach Towards Increased Crop Yield in Agriculture;resume Screening Using Natural Language processing and Machine learning: A Systematic Review;assessment of Osteogenic Sarcoma with Histology images Using deeplearning;SMDSB: Efficient Off-Chain Storage Model for Data Sharing in Blockchain Environment;ray Tracing Algorithm for Scene Generation in Simulation of Photonic Mixer Device Sensors;a Study of Quality Metrics in Agile Software Development;Optimization of Ray-Tracing Algorithm for Simulation of PMD Sensors;real-time Object Detection for Visually Challenged;reinforcement learning: A Survey;smart Leaf Disease Detection Using imageprocessing;data Encryption on Cloud Database Using Quantum Computing for Key Distribution;prediction and Prevention of Addiction to Social Media Using Machine learning;analysis of Block Matching Algorithms for Motion Estimation in Video Data;information Retrieval Based on Telugu Cross-Language Transliteration;Success of H1-B VISA Using ANN.
Detecting objects in natural scenes can be a very challenging task. In several real-life scenarios it is often found that visible spectrum is not ideal for typical computer vision tasks. Going beyond the range of visi...
详细信息
Detecting objects in natural scenes can be a very challenging task. In several real-life scenarios it is often found that visible spectrum is not ideal for typical computer vision tasks. Going beyond the range of visible light spectrum, such as the near infrared spectrum or the thermal spectrum allows us to capture many unique properties of objects that normally not captured with a normal camera. In this work we propose two multi-spectral dataset with three different spectrum, namely, the visible, near infrared and thermal spectrum. The first dataset is a single object dataset where we have common desk objects of 25 different categories comprising of various materials. The second dataset comprises of all possible combination using these 25 objects taking a pair at a time. The objects are captured from 8 different angles using the three different cameras. The images are registered and cropped and provided along with classification and localization ground truths. Additionally classification benchmarks have been provided using the ResNet, InceptionNet and DenseNet architectures on both the datasets. The dataset would be publicly available from .
Acoustic signal processing holds significant promise for real-time fish feeding intensity estimation in aquaculture. Unlike traditional methods reliant on visual cues or sensor data, acoustic analysis provides valuabl...
Acoustic signal processing holds significant promise for real-time fish feeding intensity estimation in aquaculture. Unlike traditional methods reliant on visual cues or sensor data, acoustic analysis provides valuable insights into feeding behavior and demand. By capturing indicators such as water splashing, acoustic techniques can estimate the current feeding demand of fish. Acoustic techniques in aquaculture remain under explored, especially those delving into temporal information within acoustic spectrograms. This paper presents an intelligent monitoring approach using deeplearning and acoustic signals. It investigates the perceptual domain of fish feeding acoustic spectrum recognition, extracting insights from Mel Spectrogram feature maps. Employing a supervised machine learning method with a multi-instance multi-label technique, the study classifies audio events during operational scenarios. Furthermore, the research assesses the effectiveness of the proposed neural network (NN) models for multi-label classification by comparing it with established NN architectures like AlexNet, ResNet18, and VGG11, showcasing its superior performance.
With the development of new technologies such as big data, cloud computing, and the Internet of Things, network attack technology is constantly evolving and upgrading, and network attack detection technology is forced...
详细信息
With the development of new technologies such as big data, cloud computing, and the Internet of Things, network attack technology is constantly evolving and upgrading, and network attack detection technology is forced to undergo corresponding iterative evolution. Three main problems are associated with these technologies: the automatic representation of heterogeneous and complex network traffic data, the uneven network attack samples, and the contradiction between the accuracy of the anomaly detection model and the continuous evolution of attacks. Researchers have proposed several network attack detection techniques based on deeplearning to address these problems. This study reviews and analyzes the studies aimed at dealing with such problems, considering multiple factors, such as models, traffic representation and feature extraction, threat detection model training, and model robustness improvement. Finally, the existing problems and challenges associated with the current research are analyzed with respect to data category imbalance, high-dimensional massive data processing, concept distribution drift, real-time interpretability of the detection model, and the security of the model.
Photometric stereo is a well-established technique to es-timate the surface normal of an object. However, the re-quirement of capturing multiple high dynamic range images under different illumination conditions limits...
详细信息
ISBN:
(数字)9798350353006
ISBN:
(纸本)9798350353013
Photometric stereo is a well-established technique to es-timate the surface normal of an object. However, the re-quirement of capturing multiple high dynamic range images under different illumination conditions limits the speed and real-time applications. This paper introduces EventPS, a novel approach to real-time photometric stereo using an event camera. Capitalizing on the exceptional temporal resolution, dynamic range, and low bandwidth character-istics of event cameras, EventPS estimates surface nor-mal only from the radiance changes, significantly enhancing data efficiency. EventPS seamlessly integrates with both optimization-based and deep-learning-based photo-metric stereo techniques to offer a robust solution for non-Lambertian surfaces. Extensive experiments validate the effectiveness and efficiency of EventPS compared to frame-based counterparts. Our algorithm runs at over 30 fps in real-world scenarios, unleashing the potential of EventPS in time-sensitive and high-speed downstream applications.
1 1
Code available: https://***/ybh1998/EventPS
暂无评论