Deep convolutional neural networks (CNNs) have proven their effectiveness in the acoustic scene classification (ASC) task, becoming the state-of-the-art solution to build pow-erful ASC models. Usually, the architectur...
详细信息
ISBN:
(纸本)9781665408684
Deep convolutional neural networks (CNNs) have proven their effectiveness in the acoustic scene classification (ASC) task, becoming the state-of-the-art solution to build pow-erful ASC models. Usually, the architectures of these networks and their hyper-parameters have been manually designed by expert researchers in both the investigated problem and CNN architecture design, which can be a challenging task. In this work, we propose a genetic algorithm (GA) that benefits from the characteristics of state-of-the-art CNN architectures in the field of ASC to specifically optimize CNN architectures for the ASC task, within an acceptable search cost compared to other GAs. After searching for 14 GPU days using a single NVIDIA Tesla K80 GPU on the development dataset of DCASE2020 Task lA, the proposed GA has optimized a CNN architecture that achieved a test accuracy of 68.5 % on the same dataset. The same network has been evaluated on the development dataset of DCASE2018 Task 1A and achieved a classification accuracy of 73.7% without repeating the search process, which demonstrates the generalization capability of the generated CNNs.
In this paper, an efficient blind signal processing algorithm called relative-gradient bound component analysis algorithm that combines the concepts of the relative-gradient approach and the bound-component analysis i...
详细信息
ISBN:
(数字)9781728183312
ISBN:
(纸本)9781728183329
In this paper, an efficient blind signal processing algorithm called relative-gradient bound component analysis algorithm that combines the concepts of the relative-gradient approach and the bound-component analysis is applied to the image watermarking systems. Unlike to the other blind-signal-processingalgorithms, neither the inverse matrix nor the covariance matrix is necessary to be computed in this one. It can successfully extract the embedded watermark without the whittling process. The time complexity and the space complexity of this algorithm are both lower than those of the original bound component analysis algorithm.
The quality of medical images is paramount. Being of high grade, it guarantees the quality of medical diagnosis, treatment and quality of patient’s life through the means of health care or using automate intelligent ...
详细信息
ISBN:
(纸本)9781665458429
The quality of medical images is paramount. Being of high grade, it guarantees the quality of medical diagnosis, treatment and quality of patient’s life through the means of health care or using automate intelligent systems for medical diagnosing, treatment and monitoring. The paper presents the computational challenges in medical images processing. The great challenges are to propose parallel computational models and parallel program implementations based on the algorithms for medical images filtering. Parallel computational model based on two-dimensional filters is designed. The proposed parallel model is verified by multithreaded parallel program implementation. An investigation of the efficiency of medical images filters based on parallel multithreaded program implementation, applying two-dimensional filters on a given list of compressed jpeg medical images and generating output jpeg images for each type of applied filter. The applied filters are Brightness Control, horizontal and vertical filter of Sobel, Laplace and Blur. A number of experiments have been carried out for the case of dataset consisted of 162 whole mount slide images of Breast Cancer (BCa) specimens scanned at 40x and various number of threads. Parallel performance parameters execution time and speedup are estimated experimentally. The performance estimation and scalability analyses show that the suggested model has good scalability.
Bulgaria is a culturally rich country which has been kept unexplored for years. Modern information and communication technologies can shrink the gap between tourists and its rich historical heritage. A digital tourist...
详细信息
Classic vision-based navigation solutions, which are utilized in algorithms such as Simultaneous Localization and Mapping (SLAM), usually fail to work underwater when the water is murky and the quality of the recorded...
详细信息
ISBN:
(纸本)9781450377409
Classic vision-based navigation solutions, which are utilized in algorithms such as Simultaneous Localization and Mapping (SLAM), usually fail to work underwater when the water is murky and the quality of the recorded images is low. That is because most SLAM algorithms are feature-based techniques and often it is impossible to extract the matched features from blurry underwater images. To get more useful features, imageprocessing techniques can be used to dehaze the images before they are used in a navigation/localization algorithm. There are many well-developed methods for image restoration, but the degree of enhancement and the resource cost of the methods are different. In this paper, we propose a new visual SLAM, specifically-designed for the underwater environment, using Generative Adversarial Networks (GANs) to enhance the quality of underwater images with underwater image quality evaluation metrics. This procedure increases the efficiency of SLAM and gets a better navigation and localization accuracy. We evaluate the proposed GANs-SLAM combination by using different images with various levels of turbidity in the water. Experiments were conducted and the data was extracted from the Carnegie Lake in Princeton, and the Raritan river both in New Jersey, USA.
Palmprint recognition allows accurate identity verification to build a security system. Recently, researchers introduce deep learning to this area that largely improves the recognition accuracy. However, as a supervis...
详细信息
ISBN:
(纸本)9781538662496
Palmprint recognition allows accurate identity verification to build a security system. Recently, researchers introduce deep learning to this area that largely improves the recognition accuracy. However, as a supervised approach, its performance relies on availability of data and labels for every registered identity. For large-scale security systems, after image acquisition, we need to check the whole dataset and manually assign labels through comparison, which is a time-consuming task. Besides, labelling some redundant training samples contributes little to the recognition result. In this paper, we introduce an active learning framework to select the best sample set for label assignment. We regard the active learning as a binary classification task and attempt to make the labeled and unlabeled set indistinguishable. Experiments on different datasets demonstrate our model can reduce the annotation cost while achieving comparable recognition performance.
Most forms of optical image formation involve the use of an optical system to form a real image on an array of sensing elements. The output from the sensing elements is a sampled image. Mathematically, this process is...
详细信息
ISBN:
(纸本)9781510626683
Most forms of optical image formation involve the use of an optical system to form a real image on an array of sensing elements. The output from the sensing elements is a sampled image. Mathematically, this process is described by convolution of the point spread function of the system (including the sensing elements) and the projection of objects in the image plane. In general, this process cannot be done mathematically in closed form for arbitrary images. Sampling and imageprocessingalgorithms are often assessed with respect to their performance on sampled images that are accepted standards. By applying known degradations such as noise and blur to a standard image, operating on the degraded image with a prospective algorithm, and comparing the result with the original uncorrupted image, an imageprocessing algorithm or sampling scheme can be assessed. A weakness of this approach is the fact that the accepted standard is just that: an accepted standard. The image contains within itself uncertainties associated with the original image acquisition process. These uncertainties place bounds on the utility of the image. In this research we introduce the concept of canonical images. Canonical images are closed form, mathematically computable images that retain the essentials of the linear shift invariant image formation process. We derive one form of a canonical image, show its properties, and show how complex images can be generated using superposition. We also demonstrate how arbitrary images can be decomposed into canonical images that approximate them. We discuss applications for canonical images that include modeling and simulation, sensor testing, perception testing, and algorithm development.
On-board imageprocessing technologies in the satellite domain are subject to extremely strict requirements with respect to reliability and accuracy in hard real-time. Due to their large input domain, it is infeasible...
详细信息
ISBN:
(纸本)9789897583797
On-board imageprocessing technologies in the satellite domain are subject to extremely strict requirements with respect to reliability and accuracy in hard real-time. Due to their large input domain, it is infeasible to execute all possible test cases. To overcome this problem, we define a novel test approach that efficiently and systematically captures the input domain of satellite on-board imageprocessing applications. To achieve this, we first present a dedicated partitioning into equivalence classes for each input parameter. Then, we define multidimensional coverage criteria to assess a given test suite for its coverage on the complete input domain. Finally, we present a test generation algorithm that automatically inserts missing test cases into a given test suite based on our multidimensional coverage criteria. This results in a reasonably small test suite that covers the whole input domain of satellite on-board imageprocessing applications. We demonstrate the effectiveness of our approach with experimental results from the ESA medium-class mission PLATO.
In this studies, we conduct research on identification of tree species prior to logging is one of the important new initiative to support the management of tropical forests. The use of drone and the improved technique...
详细信息
ISBN:
(纸本)9783030190637;9783030190620
In this studies, we conduct research on identification of tree species prior to logging is one of the important new initiative to support the management of tropical forests. The use of drone and the improved technique of imageprocessing will increase the potential of identifying tree species, particularly the commercial species, so that the value of timber in that area could be assessed. Only when the value of timber is found to be attractive for timber operation, the area could be opened for timber operation. Otherwise that area could be used for protection of biodiversity purposes. The drone will be used to capture the image of forests as the image data sets. Several algorithms are used for visual enhancement as well as the edge detection operator and morphological operation for segmentation and identification of crown image.
Color model is also referred to as a mathematical organization or arrangement of colors as numerical values as three or four color components or channels. Every color space has unique features and application oriented...
详细信息
ISBN:
(数字)9781728151977
ISBN:
(纸本)9781728151984
Color model is also referred to as a mathematical organization or arrangement of colors as numerical values as three or four color components or channels. Every color space has unique features and application oriented. The paper presented an extravagant investigation on color spaces for imageprocessing applications. Fuzzy c-means clustering is one of the widely applied algorithms for image segmentation. This work too utilized FCM to test the influence of color space on imageprocessing applications.
暂无评论