One difficulty that remains in imageprocessing is the accurate location of key points in depth images. This paper presents an intelligent location method for identifying key points in depth images based on deep convo...
详细信息
One difficulty that remains in imageprocessing is the accurate location of key points in depth images. This paper presents an intelligent location method for identifying key points in depth images based on deep convolutional neural networks. This study used Kinect to process images, calculating the differences in depth as well as the directional gradient in subject depth images. The entirety of each depth image was traversed through a sliding window to identify the feature vector. Principal component analysis was used to reduce image dimensions. The random forest technique was used to select characteristics of strong classification as well as to actualize training and testing. A depth convolutional neural network was used to detect key points in images of pedestrians. During the study, an experimental test was conducted in a general environment under various conditions, including occlusion and low light. Even under these suboptimal conditions, the detection rate of the proposed method was 87.72%. Furthermore, this method was compared with the GEBCF and FCF algorithms, and proved to increase the detection rate by 0.92% and 0.68%, respectively. Using the depth convolutional neural network in the pedestrian key point positioning experiment, the average error obtained when comparing the predicted point coordinates to the sample mark coordinates was 2.102 pixels. These experimental results show that this method has good accuracy and robustness for the key point location problem of pedestrians in depth images.
Detection of humans, e.g. for search and rescue operations has been enabled by the availability of compact, easy to use cameras and drones. On the other hand, aerial photogrammetry techniques for inspection applicatio...
详细信息
Detection of humans, e.g. for search and rescue operations has been enabled by the availability of compact, easy to use cameras and drones. On the other hand, aerial photogrammetry techniques for inspection applications allow for precise geographic localization and the generation of an overview orthomosaic and 3D terrain model. The proposed solution is based on nadir drone imagery and combines both deep learning and photogrammetric algorithms to detect people and position them with geographical coordinates on an overview orthomosaic and 3D terrain map. The drone imageprocessing chain is fully automated and near real-time and therefore allows search and rescue teams to operate more efficiently in difficult to reach areas.
In the following technical design paper, the Technion Aerial systems (TAS) team presents the Ninox system and its development process for the AUVSI Student UAS Competition 2018. The team comprises 32 undergraduate stu...
详细信息
Wireless devices have been used to investigate the environment and to understand our physical world. In this work, we undertake the challenging problem of identifying location of obstacles and objects by WiFi signals....
详细信息
ISBN:
(数字)9781728150895
ISBN:
(纸本)9781728150901
Wireless devices have been used to investigate the environment and to understand our physical world. In this work, we undertake the challenging problem of identifying location of obstacles and objects by WiFi signals. Gathering wireless sensory data to form an image is difficult since wireless signals are susceptible to multipath. Moreover, reconstructing an image of unknown objects based on the measurements of sparse signals is an ill-posed problem. To tackle these problems, we first present a linear model using received signal strength indicator (RSSI) measurements. We define the sparse beamforming problem as an ℓ 0 -norm optimization problem, then use the iterative reweighted ℓ 1 heuristic algorithm to obtain an optimal solution as a multipath. Finally, the multipath fading is removed by using Machine Learning. More specifically, we use Support Vector Regression (SVR) to identify a clear image of the unknown object. Our results show that the proposed method can reconstruct signals as a 3D image with a satisfactory visual appearance, i.e. the generated data mesh is well defined and smooth compared to previous work.
Chronic Otitis Media (COM) causes deformation of the middle ear ossicles with perforation as a result of long-lasting inflammation of the middle ear and it is one of the basic reasons for hearing loss. The middle ear ...
详细信息
ISBN:
(纸本)9781728139647
Chronic Otitis Media (COM) causes deformation of the middle ear ossicles with perforation as a result of long-lasting inflammation of the middle ear and it is one of the basic reasons for hearing loss. The middle ear images are examined by otolaryngologists in the diagnosis of the disease in clinical practice. The observers make a decision considering the status of the tympanic membrane images. Decision support systems using imageprocessing techniques and machine learning algorithms are quite useful in the diagnosis process, however, the usage of such systems in this field is limited. In this study, we propose a diagnostic model using a pretrained deep convolutional neural network (DCNN) called AlexNet. The experiments were carried out on a private dataset consisting of totally 598 tympanic membrane images collected from patients admitted to Ozel Van Akdamar Hospital. Firstly, a set of preprocessing procedures were applied to the eardrum images. Then, the tympanic membrane images were used to feed the DCNN model. The proposed model was trained using transfer learning approach. To evaluate and validate the success of the proposed model, the 10-fold cross-validation method was used. As a result, the model provided satisfactory results with an accuracy of 98.77%. Consequently, the proposed DCNN model was determined as a robust tool in separating chronic and normal tympanic membrane images.
Acetone gas is a breath marker for diabetes detection. Metal oxide gas sensors have been widely used for gas sensing applications. Zinc oxide (ZnO) is a promising material for Acetone gas detection. Metal oxide semico...
详细信息
ISBN:
(数字)9781728154756
ISBN:
(纸本)9781728154763
Acetone gas is a breath marker for diabetes detection. Metal oxide gas sensors have been widely used for gas sensing applications. Zinc oxide (ZnO) is a promising material for Acetone gas detection. Metal oxide semiconductors require a high-temperature environment for detecting the target gas. The temperature generated from the micro-heater has a great influence on the concentration of gas adsorbed on the surface and also the resistance of the sensing layer. The temperature should be uniformly distributed on the surface of the heater for the better performance of the sensor. In this work, a meander shaped micro-heater is used for the ZnO sensor for Acetone gas detection. The Scanning electron microscopic image of the synthesized ZnO was found to have a spherical shape and the same structure was given for the sensing layer in the simulation. Platinum is used as the heating material because of its high thermal conductivity and low power consumption. The variation of the temperature of the heating material with respect to the input voltage was analyzed. The variation of the resistance of the sensing layer with respect to the target gas was also analyzed. The resistance of the sensing layer was found to decrease in the presence of Acetone gas. The simulation was done using COMSOL Multiphysics.
A single training session of tennis requires 30-40 balls and these scattered balls have to be collected at the end of each session which costs time and induces unnecessary physical stress on players. This paper propos...
详细信息
In order to guide the production of cigarette products and improve the quality of cigarette products, this paper proposes a classification method for cigarette combustion cones based on deep convolutional neural netwo...
详细信息
ISBN:
(数字)9781728161365
ISBN:
(纸本)9781728161372
In order to guide the production of cigarette products and improve the quality of cigarette products, this paper proposes a classification method for cigarette combustion cones based on deep convolutional neural network model. The method is optimized based on the Inception Resnet V2 model and is innovatively used in the detection of cigarette burning cones. The classification accuracy of combustion cone fallout is characterized by the overall classification accuracy (OA) and the Kappa coefficient (Kappa). The experimental results show that the overall classification accuracy is 97.22%, and the Kappa coefficient is 0.9583. The deep convolutional neural network has better classification effect. Based on the classification method of deep convolutional neural network, the cigarette burning cone can be accurately identified.
High resolution aerial and satellite borne hyperspectral imagery provides a wealth of information about an imaged scene allowing for many earth observation applications to be investigated. Such applications include ge...
详细信息
ISBN:
(纸本)9781510626386
High resolution aerial and satellite borne hyperspectral imagery provides a wealth of information about an imaged scene allowing for many earth observation applications to be investigated. Such applications include geological exploration, soil characterisation, land usage, change monitoring as well as military applications such as anomaly and target detection. While this sheer volume of data provides an invaluable resource, with it comes the curse of dimensionality and the necessity for smart processing techniques as analysing this large quantity of data can be a lengthy and problematic task. In order to aid this analysis dimensionality reduction techniques can be employed to simplify the task by reducing the volume of data and describing it (or most of it) in an alternate way. This work aims to apply this notion of dimensionality reduction based hyperspectral analysis to target detection using a multivariate Percentage Occupancy Hit or Miss Transform that detects objects based on their size shape and spectral properties. We also investigate the effects of noise and distortion and how incorporating these factors in the design of necessary structuring elements allows for a more accurate representation of the desired targets and therefore a more accurate detection. We also compare our method with various other common Target Detection and Anomaly Detection techniques.
In this paper, we propose a bio-inspired Fuzzy Lévy Taxis algorithm to solve the robotic odor source localization problem in dynamic odor plumes. According to the proposed algorithm, the robot is programmed to mo...
详细信息
ISBN:
(数字)9781728164793
ISBN:
(纸本)9781728164809
In this paper, we propose a bio-inspired Fuzzy Lévy Taxis algorithm to solve the robotic odor source localization problem in dynamic odor plumes. According to the proposed algorithm, the robot is programmed to move for a length with a turning angle at each step until reaching the odor source. The movement length and the turning angle follow two specific probability distribution, of which the parameters are adaptive through a fuzzy logic system. The proposed algorithm was compared with the Adaptive Lévy Taxis algorithm in simulated pseudo-Gaussian plumes. Our proposed algorithm shows a higher success rate and efficiency. The algorithm has also been systematically evaluated in simulated filament-based odor plumes under various environmental conditions. The results revealed that the performance of the proposed algorithm is consistently good in various environmental conditions in terms of success rate, number of steps and distance overhead to find the odor source.
暂无评论