In this paper we describe the problem of painting style classification into five classes: impressionism, realism, expressionism, post-impressionism and romanticism. While most previous approaches relied on image proce...
详细信息
ISBN:
(数字)9781728188119
ISBN:
(纸本)9781728188126
In this paper we describe the problem of painting style classification into five classes: impressionism, realism, expressionism, post-impressionism and romanticism. While most previous approaches relied on imageprocessing and manual feature extraction from painting images, our model based on the ResNet architecture and pre-trained on the imageNet dataset operates on the raw pixel level. The training has been performed on a large dataset (about 43k images for five class style classification problem). To increase the quality of final model a large number of various augmentations were used: random Affine transform, crop, flip, color jitter (i.e. contrast, hue, saturation), normalization, a scheduler for the optimizer. Finally model weights were pruned which allowed increasing accuracy up to 51.5% and decreasing computation time as well.
Convolutional neural network based on machine learning has become one of the most popular methods of image recognition and segmentation, but it needs huge data samples to get better results. In this study, based on U-...
详细信息
ISBN:
(纸本)9781450376273
Convolutional neural network based on machine learning has become one of the most popular methods of image recognition and segmentation, but it needs huge data samples to get better results. In this study, based on U-Net network, a two-stage convolution neural network method for automatic segmentation of Terminal bulb of Caenorhabditis elegans in small samples was proposed. The method solves the problem that traditional single-stage network cannot be implemented in small samples. The Dice coefficient reaches 89.5%, which is higher than that of the baseline U-Net method.
Saliency detection is a challenging direction in the field of computervision. It can help us quickly obtain important information of image and video, and plays an important role in image compression, image segmentati...
详细信息
RAW is a digital file contains the camera-captured image data regarding the sensor pixels values and text information. The raw is being highlighted as digital negative and varies with the formats, which depend on hard...
详细信息
ISBN:
(纸本)9789811358029;9789811358012
RAW is a digital file contains the camera-captured image data regarding the sensor pixels values and text information. The raw is being highlighted as digital negative and varies with the formats, which depend on hardware manufacturer. The raw processing is significant to ignore the duplication of data, to economize the space needed, to ease image file operations, and to have an uninterrupted capturing. The image quality is the substantial parametric quantity which determines the visual of the captured raw. The most extreme resolution with no inbuilt compression (raw) results in high image from any digital camera. The proposed workflow is to extract the contents of raw sensor information from the raw files and processing and displaying the information in image format. The raw test files were gathered from cameras by different manufactures. The MATLAB R2016a has been used for executing the workflow and analysis purpose. The display quality is ensured by the performance parametric-Quality of image Improvement (QOII), and also the file size reduction ratio was analyzed.
The process of extracting textual regions from the scene images is a significant matter in the field of imageprocessing & computervision. It is very challenging due to different fonts, variable font size, illumi...
详细信息
ISBN:
(纸本)9789811359927;9789811359910
The process of extracting textual regions from the scene images is a significant matter in the field of imageprocessing & computervision. It is very challenging due to different fonts, variable font size, illumination conditions and complex background etc. In last decade, image segmentation using Maximal Stable Extremal Regions (MSERs) played an important role in this area due to its various advantages. The generation of MSERs is controlled by variation of stability factor delta in deciding the promising stable areas. The aim of this paper is to study the effect of parameter delta and calculate the optimal delta on the different versions of MSER for detection and localization of text in scene images. Four different features Stroke Width Heterogeneity, Perpetual Color Contrast, Histogram of Oriented Gradients at Edges, Occupy Rate are used to evaluate the probability of text using naive Bayes Model for each version of MSERs. The Training is accomplished on the ICDAR 2013 training dataset and experiments for testing our method are carried out on ICDAR datasets to show the importance of delta (optimal value) parameter of MSER in providing the optimum results expressed as f-measure, recall and precision.
Scene perception of mobile robot is that the robot realizes the perception and understanding of the surrounding environment through a series of sensors configured by itself, and the perception technology based on visi...
详细信息
ISBN:
(纸本)9781450362948
Scene perception of mobile robot is that the robot realizes the perception and understanding of the surrounding environment through a series of sensors configured by itself, and the perception technology based on vision has always been a hot and difficult point in research. Therefore, in terms of visual environment perception algorithms, there have been numerous valuable research achievements in recent years. In particular, the object detection and segmentation algorithm based on convolutional neural network has shown good performance in simple scene, but there are still some limitations when these algorithms are directly applied to actual scenes. In this paper, we study the practical application of vision-based environmental perception of mobile robots in complex scenes, presents a unified algorithm architecture of object detection and road segmentation, and build a vision-based mobile robot's environment perception system. Firstly, the image acquisition of the surrounding environment is completed by the computer camera mounted on the robot, and then the obstacle detection and the segmentation of the drivable area are achieved by using the target detection and segmentation algorithm. In order to meet the real-time requirements, the detection and segmentation algorithms share the same feature extraction network, and are jointly trained as one framework. Finally, according to the detection and segmentation results, the robot can automatically avoid obstacles and move in the drivable area.
Aiming at the problem of moving objects occluded in the field of automatic driving, in this paper, we propose a framework for occlusion relationship recognition based on distance measurement of the moving objects in v...
详细信息
ISBN:
(纸本)9781450376822
Aiming at the problem of moving objects occluded in the field of automatic driving, in this paper, we propose a framework for occlusion relationship recognition based on distance measurement of the moving objects in video. The framework performs moving objects detection by YOLOv3 and occlusion recognition of moving objects by the information entropy. When the occlusion occurs, the framework will obtain the distance between the moving objects and the camera by binocular stereo vision distance measurement. Finally, the framework completes occlusion relationship recognition using a principle derived from this paper. The experimental results show that distance measurement of moving objects in video is effective and feasible for occlusion relationship recognition, which can provide some help for visual technology in the field of automatic driving.
This research contrasts and compares the state-of-the-art techniques of the two approaches within the domain of news sentiment analysis, as well as, investigates a novel document encoding representation of the 39;TF...
详细信息
ISBN:
(纸本)9781450376259
This research contrasts and compares the state-of-the-art techniques of the two approaches within the domain of news sentiment analysis, as well as, investigates a novel document encoding representation of the 'TF-IDF momentum matrix'. The presented lexicon-based methodology is centred around Loughran & McDonald financial sentiment word lists and reaches 86.4% explained stock momentum variance, whereas the classification approach follows a thematic analysis pipeline implementing Latent Dirichlet Allocation and achieves that of 94.8%. As an additional element of model evaluation, the research implements Thermal Optimal Path method which relies on a dynamic programming approach for performance optimisation.
Software simulation is one of the most common methods to study worm propagation. However, simulation process and software interface, which lack unified description, cause inefficiencies of worm propagation simulation....
详细信息
Software simulation is one of the most common methods to study worm propagation. However, simulation process and software interface, which lack unified description, cause inefficiencies of worm propagation simulation. In order to achieve a unified description for worm propagation simulation process, we propose a Worm Propagation Simulation Description Language (WPSDL), which characterizes the software interfaces and parameters of each module in simulation process in a unified way. Meanwhile, we develop a simulation system of worm propagation oriented our presented language. In this system, we use related commands of the description language as interactive instructions, and then implement worm propagation simulation for different types of network worms. In addition, we conduct a series of experiments to analyse the impacts of worm propagation mechanism and network topology structure. The experiments show that increasing infection ability and scanning rate can improve the propagation scale significantly, and the propagation delay can influence the outbreak time and explosion intensity of worm propagation. The experiments not only provide support for the study of worm propagation mechanisms, but also verify the practicability of the description language and the simulation system.
Disease identification plays a vital role in agricultural sector. Turmeric being a rhizomatous crop and well known for its therapeutic effects, monitoring such crops is crucial. The turmeric leaves are mainly exposed ...
详细信息
暂无评论