In this modern era, the popularity of autonomous systems has increased manifold because they are replacing people in various jobs and are expected to become the backbone of modern society. Specifically, self-driving v...
详细信息
In this modern era, the popularity of autonomous systems has increased manifold because they are replacing people in various jobs and are expected to become the backbone of modern society. Specifically, self-driving vehicles and robots are gaining popularity in a wide range of applications. However, traffic sign detection, a critical component of intelligent transportation systems, remains quite challenging since it needs to be done quickly, with high precision, and with high dependability. A fast, real-time, robust automatic traffic sign detection and recognition can support and relieve the driver and significantly increase driving safety and comfort. Out of many algorithms and frameworks available for traffic sign identification, i.e., object detection, one of the most popular ones is You Only Look Once (YOLO) since it provides accurate results with minimal background errors in most real-time processing tasks and has excellent learning capabilities. Motivated by this, this research article provides a novel framework, CLEAR, for traffic sign identification in adverse climate conditions using YOLOv5. We compare different models’ speed, accuracy, and other metrics on a Traffic Sign Dataset obtained from the Open image Dataset v6. Experimental results demonstrated that the proposed CLEAR model achieved the best performance with a Mean Average Precision of 0.73392 and Recall of 0.74194 compared to the existing schemes.
Nowadays copyright protection is mandatory in the field of imageprocessing to removes the illegitimate utilization and imitation of digital images. The digital image watermarking is one of the most reliable methods f...
详细信息
Nowadays copyright protection is mandatory in the field of imageprocessing to removes the illegitimate utilization and imitation of digital images. The digital image watermarking is one of the most reliable methods for protecting the illegal validation of data. In this paper, singular value decomposition based digital image watermarking scheme is proposed in complex wavelet transform (CWT) domain using intelligence algorithms like particle swarm optimization (PSO) and recently proposed Jaya algorithm. The watermark image is embedded into high frequency CWT subband of cover image. At the time of watermark embedding and extraction, optimization algorithms Jaya and PSO are applied to improve the robustness and imperceptibility by assessing the fitness function. The perceptual quality of watermarked image and robustness of extracted watermark image are verified under the filtering, rotation, scaling, Gaussian noise and JPEG compression attacks. From the comparative analysis it is proved that Jaya algorithm is better as compared to PSO algorithm under most types of attacks with higher magnitudes whereas identical under the lower magnitude of applied attacks. Moreover, using variety of cover images, it is found that, the elapse time and value of fitness function given by Jaya algorithm are also better as compared to PSO.
Method is proposed to improve the visual quality of images based on algorithms based on histogram equalization. The work examined widely used algorithms of histograms equalization, described their advantages and disad...
详细信息
The infinite variety of image subjects, the dependence of analysis algorithms and decision rules on the shooting conditions and image quality lead to the need to configure and retrain the computer vision system for al...
详细信息
ISBN:
(纸本)9783030500979
The infinite variety of image subjects, the dependence of analysis algorithms and decision rules on the shooting conditions and image quality lead to the need to configure and retrain the computer vision system for almost every next series of images. The paper proposes the principles of organization and high-level language for description of strategies of content-based analysis of aerospace images. The decision maker specifies the strategy for processing and analyzing the image as a sequence of points of selection of actions or subtasks. In general, each action can be performed by different software modules, which require their own data structures, restrictions and rules. Accordingly, the results of the action will vary. The solver, which is controlled by the given strategy, selects variants of actions, data and constraints for each subtask, builds a decision tree, and monitors the progress of the decision. Examples of object detection strategies and results of their work on urban area aerial images characterized by a very high spatial resolution are given. Applied semantic models of actions and resources make the process of structuring and describing more visual and, at the same time, machine-readable. The process of describing the image analysis strategy is transferred from the level of specifying instructions/commands to the level of planning works and resources.
In this article, we use among and the best-known library is Open Computer vision we call it for short OpenCv. It is used for imageprocessing, to do all operations we want, to isolate and detect a specific object, whi...
详细信息
Traditional casting character recognition algorithms need to select appropriate position features for different scenes in the character location step, so it is difficult to realize the recognition task of casting embo...
详细信息
Traditional casting character recognition algorithms need to select appropriate position features for different scenes in the character location step, so it is difficult to realize the recognition task of casting embossed concave and convex characters in the different distribution in complex scenes. In this letter, a recognition method of casting embossed characters based on YOLO v5 is proposed. The fast and reliable depth learning algorithm YOLO v5 is used to automatically extract the image features and realize the recognition of casting embossed characters (including numbers and letters) Recognition. The experimental results show that the accuracy of the network model for steel seal character recognition is higher than traditional computer vision algorithms, the average processing time of the algorithm is quickly, and the weight file volume is small, which meets the accuracy and efficiency requirements of engineering application.
Single image super-resolution (SISR) aims to obtain a high-resolution output from one low-resolution image. Currently, deep learning-based SISR approaches have been widely discussed in medical imageprocessing, becaus...
详细信息
Single image super-resolution (SISR) aims to obtain a high-resolution output from one low-resolution image. Currently, deep learning-based SISR approaches have been widely discussed in medical imageprocessing, because of their potential to achieve high-quality, high spatial resolution images without the cost of additional scans. However, most existing methods are designed for scale-specific SR tasks and are unable to generalize over magnification scales. In this paper, we propose an approach for medical image arbitrary-scale super-resolution (MIASSR), in which we couple meta-learning with generative adversarial networks (GANs) to super-resolve medical images at any scale of magnification in (1, 4]. Compared to state-of-the-art SISR algorithms on single-modal magnetic resonance (MR) brain images (OASIS-brains) and multi-modal MR brain images (BraTS), MIASSR achieves comparable fidelity performance and the best perceptual quality with the smallest model size. We also employ transfer learning to enable MIASSR to tackle SR tasks of new medical modalities, such as cardiac MR images (ACDC) and chest computed tomography images (COvID-CT). The source code of our work is also public. Thus, MIASSR has the potential to become a new foundational pre-/post-processing step in clinical image analysis tasks such as reconstruction, image quality enhancement, and segmentation.
imageprocessingalgorithms are finding various applications in manufacturing and materials industries such as identification of cracks in the fabricated samples, calculating the geometrical properties of the given mi...
详细信息
AI-powered edge devices currently lack the ability to adapt their embedded inference models to the ever-changing environment. To tackle this issue, Continual Learning (CL) strategies aim at incrementally improving the...
详细信息
ISBN:
(纸本)9781728180991
AI-powered edge devices currently lack the ability to adapt their embedded inference models to the ever-changing environment. To tackle this issue, Continual Learning (CL) strategies aim at incrementally improving the decision capabilities based on newly acquired data. In this work, after quantifying memory and computational requirements of CL algorithms, we define a novel HW/SW extreme-edge platform featuring a low power RISC-v octa-core cluster tailored for on-demand incremental learning over locally sensed data. The presented multi-core HW/SW architecture achieves a peak performance of 2.21 and 1.70 MAC/cycle, respectively, when running forward and backward steps of the gradient descent. We report the trade-off between memory footprint, latency, and accuracy for learning a new class with Latent Replay CL when targeting an image classification task on the CORe50 dataset. For a CL setting that retrains all the layers, taking 5h to learn a new class and achieving up to 77.3% of precision, a more efficient solution retrains only part of the network, reaching an accuracy of 72.5% with a memory requirement of 300 MB and a computation latency of 1.5 hours. On the other side, retraining only the last layer results in the fastest (867 ms) and less memory hungry (20 MB) solution but scoring 58% on the CORe50 dataset. Thanks to the parallelism of the low-power cluster engine, our HW/SW platform results 25x faster than typical MCU device, on which CL is still impractical, and demonstrates an 11x gain in terms of energy consumption with respect to mobile-class solutions.
Understanding human mobility plays a vital role in urban and environmental planning as cities continue to grow. Ubiquitous geo-location, localization technology, and availability of big-data-ready computing infrastruc...
详细信息
Understanding human mobility plays a vital role in urban and environmental planning as cities continue to grow. Ubiquitous geo-location, localization technology, and availability of big-data-ready computing infrastructure have enabled the development of more sophisticated models to characterize human mobility in urban areas. In this work, our main goal is to extract spatio-temporal features that characterize user mobility and, based on the similarity of these features, identify user communities. To this end, we propose a novel approach that leverages imageprocessing techniques to represent user geographical preferences as images and then apply deep convolutional autoencoders to extract latent spatio-temporal mobility features from these images. These features are then fed to a clustering algorithm that identifies the underlying community structures. We use a diverse urban mobility dataset to validate the proposed framework. Our results show that the proposed framework is able to significantly increase the similarity between intra-community nodes (by up to 107%) as well as dissimilarity between inter-community nodes (up to 54%) when compared against no pre-processing of the datasets, i.e without pre-processing the datasets through any feature fusion method. Moreover, it was also able to reach up to 100% improvement when compared against community identification using Principal Component Analysis (PCA). Our results also show that the proposed approach yields significant increase in contact time amongst users belonging to the same community, by up to 80% when compared to the average contact time when not considering community structures, and by up to 150% when compared to the baseline. To the best of our knowledge, our proposal is the first to consider deep convolutional autoencoding to perform automatic extraction of non-linear spatio-temporal mobility features characterizing individual users from raw mobility datasets with the goal of identifying user communities.
暂无评论