Today, the high competition among domestic automobile manufacturers is intense situation than previous years. This result gives advantages in a good variety of brands, models, engine sizes and appearances. This can ca...
详细信息
ISBN:
(纸本)9789819916474;9789819916481
Today, the high competition among domestic automobile manufacturers is intense situation than previous years. This result gives advantages in a good variety of brands, models, engine sizes and appearances. This can cause some critical issues in recognizing and recalling a car by manufacturer. In addition, an owner may modify some parts of original vehicle such as the head bumper, the rear bumper, and the head light. This modification also affects the people who are looking for pre-owned cars. Despite the fact, the details are mismatch with the vehicle registration book that issued by the Department of Land Transport. From this incident, the researchers implemented a convolutional neural network (CNN) in the identification of vehicle characteristics to reduce the ambiguity for each car's models. The researchers conducted experiments using five algorithms. SVM, ResNet34, ResNet50 and Inception-ResNetV2. The researchers set up a library of two car models, Toyota Hilux and Honda Civic sedan and Civic Hatch-back, including models from past ten years ago until the present. The images are of 224 x 224 pixels. The data are categorized into two sets, a training set has 1,449 images which is counted as 80% of total images and a testing set is having 362 images which is about 20% of total. The total images are 1,811 and 26 Classes. Our experiments compared the accuracies of SVM, ResNet34, ResNet50, and Inception-ResNetV2, which came out to be 21.4%, 55.5%, 66.6%, and 92.8% respectively. As a result, Inception-ResNetV2 outperforms among all other methods.
作者:
Menconero, SofiaDepartment of History
Representation and Restoration of Architecture Sapienza University of Rome Piazza Borghese 9 Rome00186 Italy
Piranesi printed the 16 etchings of the Carceri d’invenzione in 1761. This version, which is more widespread and better known, derives from the reworking of matrices that the Venetian engraver produced in 1749–50. T...
详细信息
Communicating online without fearing third-party interventions is becoming a challenge in the modern world. Especially the sectors like the military, and government organizations or private companies sharing sensitive...
详细信息
Surface water resource identification is one of the main techniques used in remote sensing image analysis. This is necessary to stop calamities like floods and droughts. Feature selection based on prior information an...
详细信息
Total p-norm Variation (TpV) is a well-established technique in imageprocessing, used to denoise and preserve edges. However, the related non-convex minimization is still a challenging task in optimization, both for ...
详细信息
To article proposes an approach to improve the quality of data used in various processes based on machine vision systems. The paper proposes a combined approach to applying the method of multi-criteria processing base...
详细信息
ISBN:
(数字)9781510662117
ISBN:
(纸本)9781510662100;9781510662117
To article proposes an approach to improve the quality of data used in various processes based on machine vision systems. The paper proposes a combined approach to applying the method of multi-criteria processing based on the use of a combined criterion in order to implement an edge detector, smoothing and separation areas of the background / object in the image. The application of the method allows eliminating the noise caused by external factors (such as dust and water suspension on the lens or space). The generated data make it possible to form an adaptive criterion for changing the correction parameters for a non-linear change in color balance in areas of increased detail or selected masks of changes blocks. The proposed algorithms make it possible to increase the visibility of small elements, reduce the noise component, while maintaining the boundaries of objects, increase the accuracy of selecting the boundaries of objects and the visual quality of data. As test data used to evaluate the effectiveness, nature data and expert evaluation results for test images obtained by a machine vision system with a sensor with a resolution of 1024x768 (8-bit, color image, visible range) are used. images of simple shapes are used as analyzed objects.
A vital and rapidly growing application, remote sensing offers vast yet sparsely labeled, spatially aligned multimodal data;this makes self-supervised learning algorithms invaluable. We present CROMA: a framework that...
详细信息
ISBN:
(纸本)9781713899921
A vital and rapidly growing application, remote sensing offers vast yet sparsely labeled, spatially aligned multimodal data;this makes self-supervised learning algorithms invaluable. We present CROMA: a framework that combines contrastive and reconstruction self-supervised objectives to learn rich unimodal and multimodal representations. Our method separately encodes masked-out multispectral optical and synthetic aperture radar samples-aligned in space and time-and performs cross-modal contrastive learning. Another encoder fuses these sensors, producing joint multimodal encodings that are used to predict the masked patches via a lightweight decoder. We show that these objectives are complementary when leveraged on spatially aligned multimodal data. We also introduce X- and 2D-ALiBi, which spatially biases our cross- and self-attention matrices. These strategies improve representations and allow our models to effectively extrapolate to images up to 17.6x larger at test-time. CROMA outperforms the current SoTA multispectral model, evaluated on: four classification benchmarks-finetuning (*** arrow 1.8%), linear (*** arrow 2.4%) and nonlinear (*** arrow 1.4%) probing, kNN classification (*** arrow 3.5%), and K-means clustering (*** arrow 8.4%);and three segmentation benchmarks (*** arrow 6.4%). CROMA's rich, optionally multimodal representations can be widely leveraged across remote sensing applications.
This paper presents a study of end-to-end methods for predicting autonomous vehicle navigation parameters. image-based and image & Lidar points-based end-to-end models have been trained under Nvidia learning archi...
详细信息
ISBN:
(数字)9781665460262
ISBN:
(纸本)9781665460262
This paper presents a study of end-to-end methods for predicting autonomous vehicle navigation parameters. image-based and image & Lidar points-based end-to-end models have been trained under Nvidia learning architectures as well as Densenet-169, Resnet-152 and Inception-v4. Various learning parameters for autonomous vehicle navigation, input models and pre-processing data algorithms i.e. image cropping, noise removing, semantic segmentation for image data have been investigated and tested. The best ones, from the rigorous investigation, are selected for the main framework of the study. Results reveal that the Nvidia architecture trained image & Lidar points-based method offers the better results accuracy rate-wise for steering angle and speed.
The rapid development of the Internet of Things (IoT) is enabling a wide range of applications in intelligent medical systems. Among others, medical imaging equipment produces sensitive user privacy information, howev...
详细信息
ISBN:
(纸本)9798350366457;9798350366440
The rapid development of the Internet of Things (IoT) is enabling a wide range of applications in intelligent medical systems. Among others, medical imaging equipment produces sensitive user privacy information, however, current solutions from academia and industry often neglect the importance of secure communication mechanisms. There are open research challenges such as low real-time processing and poor security, even when using cryptography. This paper proposes EDBNet, an efficient encryption network based on deep and broad learning to improve patient privacy for medical images. To be specific, a four-layer convolutional neural network is employed to extract the horizontal and vertical factors and utilize broad learning to guide the training model to obtain two feature matrices. The training process includes pre-training and fine-tuning, with the open-source COVID-CT-Dataset enabling dual-stream encryption. To further enhance ciphertext image security in a privacy-protected environment, chaotic cryptography is utilized to consummate the encryption network, which includes scrambling and diffusion combing with the SHA3-256 algorithm. The proposed EDBNet is evaluated by extensive experiments, which show that it outperforms several state-of-the-art algorithms, such as the average cipher entropy of 7.9971, encryption quality of 248, and encrypted/decrypted time of around 1 second.
作者:
Han, XinyuGong, XunSouthwest Jiaotong Univ
Sch Comp & Artificial Intelligence Chengdu 611756 Peoples R China Minist Educ
Engn Res Ctr Sustainable Urban Intelligent Transp Chengdu 611756 Peoples R China Southwest Jiaotong Univ
Natl Engn Lab Integrated Transportat Big Data App Chengdu 611756 Peoples R China Southwest Jiaotong Univ
Mfg Ind Chains Collaborat & InformationSupport Te Chengdu 611756 Peoples R China
Ultrasound images are vital for medical diagnostics but often suffer from information loss and blurred details due to limitations in imaging systems and sensor technologies. Many researchers have proposed super-resolu...
详细信息
ISBN:
(纸本)9798350377620;9798350377613
Ultrasound images are vital for medical diagnostics but often suffer from information loss and blurred details due to limitations in imaging systems and sensor technologies. Many researchers have proposed super-resolution algorithms to enhance medical images. But existing super-resolution methods struggle with the uneven distribution of noise and clarity. To address this, we propose a super-resolution algorithm for medical images based on multi-scale feature aggregation Leveraging the architecture of Unet as our primary framework, our method enhances output details through multi-scale hole convolutions. Taking into consideration the characteristics of ultrasonic images, we propose a frequency domain-based module to enhances edge information while effectively denoising the image. Furthermore, to improve the quality of the output images, we introduce an imageprocessing nodule grounded in global information. The module ensures clarity while considering the overarching context, thereby preserving global consistency and enhancing output quality. We experiment on three datasets to demonstrate the effectiveness of our model. Additionally, significant improvements in medical image segmentation are observed, proving the practicability of our proposed approach.
暂无评论