Recommender systems make individualized suggestions for users based on their actions and preferences by utilizing machine learning (ML) and artificial intelligence (AI).. These systems have evolved significantly, inco...
详细信息
The push-relabel algorithm is recognized as one of the efficient algorithms in the field of graph cut, finding widespread applications in computer vision. While its pixel-level parallel implementations are prevalent, ...
详细信息
The push-relabel algorithm is recognized as one of the efficient algorithms in the field of graph cut, finding widespread applications in computer vision. While its pixel-level parallel implementations are prevalent, existing methods predominantly rely on checkerboard scheduling, imposing inherent constraints on neighborhood size, limited to four. This limitation compromises both algorithm precision and efficiency, hindering real-time and high-precision applications. To address these issues, this article introduces a novel approach to accelerate push-relabel algorithm implementation on FPGA in a more universal and efficient manner, supporting variable-sized image block operations. First, by introducing the deferred update strategy, we realize the pulse-enhanced parallel push-relabel (PEPPR) algorithm to address data contention and conflict in parallel processing. Second, the simultaneous weighted push method is proposed, further enhancing parallel operations. Lastly, we introduce the efficient diffusion wave search (DWS) algorithm to expedite algorithm convergence and reduce redundancy. While achieving a modest $1.7\times $ acceleration compared to state-of-the-art implementations, the proposed algorithm (PEPPR-DWS) successfully overcomes the inherent limitations of checkerboard scheduling in full pixel-level parallelism. In the test based on Middlebury benchmark v3, the proposed 8-neighborhood implementation exhibits a reduction of error rate by over 1% compared to the typical 4-neighborhood implementation. It provides a versatile and efficient solution for high-precision and real-time applications, holding substantial potential for practical applications.
Estimating vehicles' locations is one of the key components in intelligent traffic management systems (ITMSs) for increasing traffic scene awareness. Traditionally, stationary sensors have been employed in this re...
详细信息
Estimating vehicles' locations is one of the key components in intelligent traffic management systems (ITMSs) for increasing traffic scene awareness. Traditionally, stationary sensors have been employed in this regard. The development of advanced sensing and communication technologies on modern vehicles (Mvs) makes it feasible to use such vehicles as mobile sensors to estimate the traffic data of observed vehicles. This study aims to explore the capabilities of a monocular camera mounted on an Mv in order to estimate the geolocation of the observed vehicle in a global positioning system (GPS) coordinate system. We proposed a new methodology by integrating deep learning, imageprocessing, and geometric computation to address the observed-vehicle localization problem. To evaluate our proposed methodology, we developed new algorithms and tested them using real-world traffic data. The results indicated that our proposed methodology and algorithms could effectively estimate the observed vehicle's latitude and longitude dynamically. (c) 2022 The Author(s). Published by Elsevier B.v. This is an open access article under the CC BY license (http://***/licenses/by/4.0/).
Lung cancer being one of the catastrophic diseases is haunting mankind from past seven decades. Unfortunately, early detection of lung cancer is unlikely, hence leading to highest mortality rates. However, various ima...
详细信息
With the increasing demand for high-resolution video and real-time processing, the limited efficiency of video-denoising algorithms has become a critical factor. This paper proposes a spatio-temporal video denoising c...
详细信息
With the increasing demand for high-resolution video and real-time processing, the limited efficiency of video-denoising algorithms has become a critical factor. This paper proposes a spatio-temporal video denoising co-processor to suppress an image sequence's spatial and temporal noise. Temporal denoising is achieved by merging the current and previous frames at the pixel level in which the current frame is processed by a spatial filter. After exploiting noise estimation and motion detection, the Wiener filter calculates the merge ratio. Rather than buffering the entire previous frame, the JPEG-like codec can dynamically adjust the compression ratio through a predefined quantization table to satisfy the designed on-chip storage. The experimental results demonstrate that the spatio-temporal denoising co-processor can effectively eliminate the fluctuation of the grayscale value of the noise in videos. Simultaneously, the adaptive codec can reduce the storage space consumption for the frame buffer by at least 80% of the original size. To the best of our knowledge, this is the first fully integrated spatio-temporal denoising co-processor without any external memory. Additionally, the grayscale, RGB, and RAW versions of the co-processor are also implemented on the Stratix v FPGA platform and synthesized in 28nm CMOS technology.
The rapid expansion of urban areas has intensified the challenge of finding parking spaces for drivers. Intelligent parking systems emerge as a crucial solution by providing real-time detection of available spaces. Wh...
详细信息
Animal incursion is a significant problem in many locations, including roads, rural communities, and base camps for different camps. This has an influence on food security since the primer of animals will decrease agr...
详细信息
Animal incursion is a significant problem in many locations, including roads, rural communities, and base camps for different camps. This has an influence on food security since the primer of animals will decrease agricultural land output and reduce farmers' profits. There must be a means to identify animals before they enter locations they should not be, so that appropriate measures may be taken to stop their incursion and its effects. Since a few decades ago, the study of detecting wild animals has been active. In this paper, a method for detecting animals is proposed utilizing developments in Deep Learning algorithm approaches. With the help of this deep learning technology, it is possible to identify animals in advance and take the proper security measures to prevent harm from animal entry. This study includes imageprocessing and artificial intelligence for animal detection, species categorization, and automatic animal identification using CNN, an alarm unit, and an animal repellent circuit design. The proposed system experiments on the Amur Tiger Re-identification in the Wild datasets, Animals Detection images Dataset, and Using Google Open images v6+, the dataset for detecting animals (objects) was retrieved. Deep Convolutional Neural Network is used to extract these characteristics from cluttered photos. Hybrid CNN models were employed in the system's implementation, which was done on a regular camera trap database. Finally, CNN features were supplied to the top categorization deep learning algorithms. Our results show that our proposed method is better than other systems that have been mentioned in the literature.
Neural Networks have achieved a reputation for their ability to outperform traditional machine learning algorithms. The robustness of network models to unseen data is important. In this study the implementation of an ...
详细信息
Neural Networks have achieved a reputation for their ability to outperform traditional machine learning algorithms. The robustness of network models to unseen data is important. In this study the implementation of an attention module to an existing network architecture is studied for robustness improvement with extensive image pre-processing for facial emotion recognition, a component in a closed loop feedback system being developed for the emotional training of people with autism spectrum disorder. The squeeze and excitation (SE) attention module was selected and combined with the network architecture of vGG16. The performance of the model, trained with the OULU-CASIA dataset, was based on statistical data generated from validation accuracies with a 10-fold partition, while the robustness analysis was based on data generated from cross database prediction accuracies. The visualization technique of Grad-CAM was used to observe the regions of impact on classification. validation accuracies of up to 98.39% were achieved and generalization performance was 52.43% on the FACES, and 26.10% on the JAFFE datasets. The improvements of the SE over the base model were observed in the visualizations of the validation datasets. Data analyzed showed that extensive image pre-processing diminished effects of SE in relation to model robustness. Copyright (C) 2024 The Authors. This is an open access article under the CC BY-NC-ND license (https://***/licenses/by-nc-nd/4.0/)
Identifying and mapping corrosion represents a significant challenge in maintaining oil production systems. Mechanical and chemical phenomena, such as abrasion and hydrogen attack, cause corrosion on the internal surf...
详细信息
ISBN:
(纸本)9780791888162
Identifying and mapping corrosion represents a significant challenge in maintaining oil production systems. Mechanical and chemical phenomena, such as abrasion and hydrogen attack, cause corrosion on the internal surface of pipes. Knowing the dimensions of corrosion enables technicians to make decisions about the structural stability of pipelines under operating conditions. Using ultrasound imaging is a non-destructive approach to assessing the state of internal surfaces. We present a web application for processing ultrasound signals and image reconstruction. However, it can be used in any non-destructive ultrasound inspection application. The software is developed mainly in Python and Typescript, using a different approach than conventional web applications. Instead of using a standard structure, with the front-end running in the browser and the back-end running the data processing on a remote server, our application runs entirely on the browser through Pyodide, a Python distribution for the browser and *** based on WebAssembly whilst the server side only hosts the web application files. The application is divided into modules for (i) graphical interface, (ii) reading inspection files, (iii) signal preprocessing, (iv) estimation of inspection parameters, (v) external surface detection (for inspection by immersion), and (vi) image reconstruction. The web application allows Python script execution and offers a user-friendly interface for running image reconstruction algorithms. This paper describes the development of critical features and indicates the chosen implementation of algorithms. The current state of development is presented, and the next steps toward the ultimate goal of corrosion mapping are defined.
The paper examines the features of combining multi-temporal and multi-angle images of building structures in order to identify critical changes. It is proposed to carry out such a combination on the basis of high-spee...
详细信息
暂无评论