In a highly digitalised world, this paper aims at closing the gap towards automatic digitisation from 2D architectural drawings. We present the new image dataset Plan, and Elevation Representations of Doors And Window...
详细信息
ISBN:
(纸本)9783031624940;9783031624957
In a highly digitalised world, this paper aims at closing the gap towards automatic digitisation from 2D architectural drawings. We present the new image dataset Plan, and Elevation Representations of Doors And Windows (Perdaw) which provides a baseline for different classification problems with varying complexity. We investigate the performance of three machinelearning models in distinguishing different types of doors and windows in their plan and elevation views. Our findings show that Inception V3 slightly outperforms MobileNet V2, which suggests that the latter solves the same classification tasks with less computational resources with only a minimal compromise in accuracy. Among the three investigated models, ResNet50 yields the lowest quality metrics within a small margin. Overall, all models perform better at classifying building components in their elevation views compared to their plan views. We consistently observed that the models yield the best results with 100% accuracy for the binary classification problems, and dropped to close to 70% accuracy for the 40-class classification problems.
Object detection is a fundamental task in computer vision and image understanding, with the goal of identifying and localizing objects of interest within an image while assigning them corresponding class labels. Tradi...
详细信息
This paper investigates the usage of generative opposed networks (GANs) and recurrent neuralnetworks (RNNs) for medical photo segmentation. First, clinical picture segmentation is described and discussed, after which...
详细信息
The first paper investigating the use of machinelearning to learn the relationship between an image of a scene and the color of the scene illuminant was published by Funt et al. in 1996. Specifically, they investigat...
详细信息
One of the most important applications of UAVs is person detection for security or rescue tasks. The goal of the proposed paper is to develop, experiment, and compare the performance of two new neuralnetworks based o...
详细信息
ISBN:
(纸本)9798350369458;9798350369441
One of the most important applications of UAVs is person detection for security or rescue tasks. The goal of the proposed paper is to develop, experiment, and compare the performance of two new neuralnetworks based on the transformer architecture, Detection Transformer and Vision Transformer. Two datasets were used, an own one for testing and COCO for learning. The results are promising to take into account the difficulties of person detection at a distance.
Brain tumour to be effectively treated, it is essential to identify and diagnose *** imageprocessing has advanced tremendously in recent years due to machine techniques for learning. Convolutional neuralnetworks (CN...
详细信息
This study investigates the machinelearning techniques for unsupervised image classification and quality assessment in the domain of ultrasound imaging. Leveraging Convolutional neuralnetworks (CNNs) for feature ext...
详细信息
The precise detection of plant centres is important for growth monitoring, enabling the continuous tracking of plant development to discern the influence of diverse factors. It holds significance for automated systems...
详细信息
ISBN:
(纸本)9798350372977;9798350372984
The precise detection of plant centres is important for growth monitoring, enabling the continuous tracking of plant development to discern the influence of diverse factors. It holds significance for automated systems like robotic harvesting, facilitating machines in locating and engaging with plants. In this paper, we explore the YOLOv4 (You Only Look Once) real-time neural network detector for plant centre detection. Our dataset, comprising over 12,000 images from 151 Arabidopsis thaliana accessions, is used to fine-tune the model. Evaluation of the dataset reveals the model's proficiency in centre detection across various accessions, boasting an mAP of 99.79% at a 50% IoU threshold. The model demonstrates real-time processing capabilities, achieving a frame rate of approximately 50 FPS. This outcome underscores its rapid and efficient analysis of video or image data, showcasing practical utility in time-sensitive applications.
Shadow performs an important role in the image, which can enhance the image effect and convey important visual clues. We propose a method based on deep learning to automatically generate stylized shadows for line draw...
详细信息
ISBN:
(纸本)9783031442001;9783031442018
Shadow performs an important role in the image, which can enhance the image effect and convey important visual clues. We propose a method based on deep learning to automatically generate stylized shadows for line drawings. Based on StarGAN, a shadow generation adversarial network (ShadowGAN) is designed, which can automate the creation of stylized shadows with different light directions. This method defines eight light directions. Users can select one of the eight light directions around the 2D image to specify the light source according to the encoding of the light direction, and generate the shadow corresponding to the light direction. We use a new dataset containing line drawings with shadows and label information corresponding to the light direction. Experiments show that our method can generate stylized shadows for line drawing with satisfactory quality, which can simplify the user's workflow, and save the time of drawing line drawing shadows.
Nowadays, deep learning architectures like CNN have proven their superiority in image recognition tasks. To effectively deploy CNN networks in practice, especially for AIoT applications, it is essential to find a netw...
详细信息
暂无评论