Diabetic retinopathy (DR) is the leading cause of avertable blindness globally. Retinal scanning of eyes is critical for examining the disease at an early stage. The concern of this study is to develop a robust mechan...
详细信息
Super Resolution Convolutional Neural Network (SRCNN) solves the problems of poor robustness and complex calculation of traditional image super-resolution reconstruction algorithm, but its training data set and the nu...
Super Resolution Convolutional Neural Network (SRCNN) solves the problems of poor robustness and complex calculation of traditional image super-resolution reconstruction algorithm, but its training data set and the number of layers of neural network is relatively small, and the edge and texture detail information are not handled well. For the above problems, the Maxout activation function is adopted in this paper to avoid the problems encountered by traditional activation functions such as gradient disappearance or overflow. Then the combination of Maxout and Dropout can train large data set and deepen neural network. Experimental results show that, compared with the classical algorithm, the algorithm proposed in this paper can train a large amount of data, improve the quality of reconstructed images and the generalization ability of the network model, and can enhance the robustness of the model.
Satellite images are widely used in the study of land cover, being used for many tasks such as: the evaluation of urban growth, the monitoring of cultivation areas, the assessment of natural disasters and other applic...
详细信息
ISBN:
(纸本)9781450372435
Satellite images are widely used in the study of land cover, being used for many tasks such as: the evaluation of urban growth, the monitoring of cultivation areas, the assessment of natural disasters and other applications, to perform these tasks is resorted to to the use of satellite images, they provide a variety of information that depends on the optical instrument carried on board in their payload, the information in the observation satellites, is provided through a data matrix that can be represented by a image, one of the characteristics is the spatial resolution where a pixel in the image corresponds to a coverage in meters on land, with this spatial resolution we can evaluate large tracts of land, among the spatial resolutions, we have the metrics where a pixel in the image corresponds to more than one meter on the ground and the sub meter, where a pixel in the image corresponds e less than one meter on land. With the sub-metric resolution we have a greater detail of the area of interest on land, having the disadvantage that a smaller area of land is evaluated in comparison with the metric resolution. The image provided by the satellite is composed of a set of matrices commonly called spectral bands, characterized by the bands of colors: red, green, blue, we also have the middle and near infrared bands, among others;additionally, the band of the panchromatic is presented, which is the band where the maximum spatial resolution is identified, therefore the image size is of high resolution. A methodology is presented to process the panchromatic satellite images through the use of the GPGPU programming using the MATLAB tool, a test with a high resolution image and with a weight in 1270 Megabits, with a size of 26012 X 25512 pixels, was performed. which was applied an algorithm where it evaluates the value of the pixel analyzing the whole matrix of the image pixel by pixel, the calculation was made in a Core i7 CPU with a processing time of 2.81 hours, with GPGP
The world population is expected to grow by over a third by 2050. Market demand for food will continue to grow. Automated drones and different robots in savvy cultivating applications offer the possibility to screen r...
详细信息
ISBN:
(纸本)9781728123851
The world population is expected to grow by over a third by 2050. Market demand for food will continue to grow. Automated drones and different robots in savvy cultivating applications offer the possibility to screen ranch arrive on a for each plant premise, which thus can diminish the measure of herbicides and pesticides that must be applied. There is a gap between current food productivity growth and needed growth. To boost the yield, farmers switched to extensive use of chemical fertilizers. Excessive fertilizer usage has its negative impact like decreased yield, wastage of fertilizer, damage to soil, and groundwater contamination. Currently, farmers mostly rely on guesswork, estimation, experience when deciding the crop that should grow, and the fertilizer that should be used. In this paper, we have proposed a solution that uses technologies like machinelearning, Image processing, and the Internet of things to improvise farm productivity and at the same time, decrease the fertilizer usage. This paper describes the outcomes of a prototype implemented in Rajasthan, India.
Health monitoring is an important parameter to determine the health status of a person. Measuring the heart rate is an easy way to gauge our health. Normal heart rate may vary from person to person and a usually high ...
Health monitoring is an important parameter to determine the health status of a person. Measuring the heart rate is an easy way to gauge our health. Normal heart rate may vary from person to person and a usually high or low resting heart rate can be a sign of trouble. There are several methods for the measurement of heart rate monitoring such as ecg, ppg etc. Such methods having a disadvantage that these are invasive and have a continuous contact with the human body. In order to overcome this problem a new system is proposed using camera. In this method a blind source separation algorithm is used for extracting the heart rate signal from the face image. Viola jones based face detection algorithm is used to track the face. FastICA algorithm is exploited to separate heart rate signal from noise and artefacts. machinelearning algorithm is implemented to standardize the signal. The data is successfully tested with real time video.
Inpainting a large hole of an image is always a challenging task, especially for high-resolution images. Recently, deep learning based approach has brought excellent ability to deal with this issue. However, these met...
详细信息
ISBN:
(纸本)9781450372619
Inpainting a large hole of an image is always a challenging task, especially for high-resolution images. Recently, deep learning based approach has brought excellent ability to deal with this issue. However, these methods always generate blurred edge and distorted details, which makes the inpainting images unreal. Besides, assessment for generated images is also challenging. Traditional assessment way uses various formulations, which only shows the partial image condition or distorted degree compared to original images, so this method cannot correctly reflect people's perception. To deal with these problems, we propose a multi-scale generative model which can gradually generate novel text to avoid distorted details, and the multi-scale losses can also eliminate blurred edges between inpainting results and the original region. To evaluate images, we also propose a deep convolutional neural network to do image quality assessment, which is closer to human perception. Experiments on multiple datasets show that our approach can generate more plausible inpainting results both on traditional evaluation criteria and our image quality assessment network.
Sign Language is argued as the first Language for hearing impaired people. It is the most physical and obvious way for the deaf and dumb people who have speech and hearing problems to convey themselves and general peo...
详细信息
An increasing number of people are sharing information through text messages, emails, and social media without proper privacy checks. In many situations, this could lead to serious privacy threats. This paper presents...
详细信息
Neural networks have progressively used at a surprising rate now a days. Deep neural networks are majorly used machinelearning models in different areas, from analysis of image to natural language processing (NLP) an...
Neural networks have progressively used at a surprising rate now a days. Deep neural networks are majorly used machinelearning models in different areas, from analysis of image to natural language processing (NLP) and broadly conveyed in scholarly community and industry. These progresses to have enormous possibilities for medical imaging, analysis of medical data, medical diagnostics. This paper illustrates a thorough literature review of deep learning techniques. This paper illustrates brief discussion about current deep learning architectures used for medical imaging and magnetic resonance imaging (MRI) for image classification, detection, segmentation, registration, etc. This review mainly focuses on the applications of deep learning methods in medical diagnosis that uses MRI modality along with the recent developments and various challenges in deep learning related to analysis of medical images.
Recently, object detection has presented superior performance using the deep convolutional neural network (CNN). However, most CNN-based models need the bounding box information of the input image in pairs. To overcom...
Recently, object detection has presented superior performance using the deep convolutional neural network (CNN). However, most CNN-based models need the bounding box information of the input image in pairs. To overcome this limitation, we propose the Generative Object Detection which learns with only cropped images that are not in pairs. Our model based on Generative Adversarial Networks (GAN) creates cropped images by making a mask that represents the object region. To achieve this goal, we devise a novel mask mean loss (MML) that helps the GAN be able to estimate the distribution of training data and uses dilated convolution for a wider reception field in the generator. The experimental results show that Generative Object Detection improves the mIoU and accuracy.
暂无评论