In colleges and universities in China, physical education is a crucial public fundamental subject. Traditional instruction often takes place during the designated class period in the training program. The instructor i...
详细信息
Pneumonia is an infectious disease of the lungs, caused by viruses, bacteria or fungi. Pneumonia is distinguished by acute inflammation of the lung tissue, causing the consolidation of the terminal bronchioles and alv...
详细信息
The rapid expansion of urban areas has intensified the challenge of finding parking spaces for drivers. Intelligent parking systems emerge as a crucial solution by providing real-time detection of available spaces. Wh...
详细信息
Color Filter Arrays (CFA) are essential components of digital cameras and image sensors to capture the color information needed to produce full-color images from only a single image sensor per pixel. Many methods and ...
详细信息
ISBN:
(纸本)9798350388787;9798350388770
Color Filter Arrays (CFA) are essential components of digital cameras and image sensors to capture the color information needed to produce full-color images from only a single image sensor per pixel. Many methods and algorithms have been proposed to recover the missing color information of CFAs. In this work, we use a simplified version of the Theshold-based Variable Number of Gradients algorithm proposed by Chang et al. to estimate the full-color information from Bayer images. We also show that the slight modification to algorithm does not effect images quality while making it more compatible with hardware. We propose an efficient implementation of the algorithm that reduces the number of calculations per pixel at the cost of increased memory resources. Our implementation targets an imageprocessing pipeline in an FPGA platform which is short on LUTs and FF resources but has DSPs and BRAMs to spare. We buffer the absolute differences and average color components to be shared and re-used between neighboring pixels, on two levels: within the same row, and between different rows. The latter strategy reduces the number of absolute differences calculated every cycle from 32 to 4 and average color components from 32 to 6. However, the memory requirements are increased from storing 4 image rows to 18 image rows. We implement the solutions on an FPGA using high-level synthesis (HLS) and optimize it to further reduce resources.
Industry 4.0, the digitalization of manufacturing promises to lead to lowered cost, efficient processes and even discovery of new business models. However, many of the enterprises have huge investments in legacy machi...
详细信息
ISBN:
(纸本)9783031702587;9783031702594
Industry 4.0, the digitalization of manufacturing promises to lead to lowered cost, efficient processes and even discovery of new business models. However, many of the enterprises have huge investments in legacy machines which are not 'smart'. In this study, we thus designed a cost-efficient solution to retrofit a legacy conveyor belt-based cutlery washing machine with a commodity web camera. We then applied computer vision (using both traditional imageprocessing and deep learning techniques) to infer the speed and utilization of the machine. We detailed the algorithms that we designed for computing both speed and utilization. With the existing operational constraints of our client, frequent re-training of the deep learning model for object detection is not feasible. Thus, we compared the generalizability of the two techniques across 'unseen' cutleries and found traditional imageprocessing to be generalizable across 'unseen' images. Our proposed final solution uses traditional imageprocessing for computation of utilization but a hybrid of traditional imageprocessing and deep learning model for speed computation as it is more reliable. Our client has implemented our proposed solution for one conveyor belt-based cutlery washing machine and will be planning to scale this to multiple conveyor belt-based cutlery washing machines.
Super-resolution algorithms often struggle with images from surveillance environments due to adverse conditions such as unknown degradation, variations in pose, irregular illumination, and occlusions. However, acquiri...
详细信息
ISBN:
(纸本)9798350376043;9798350376036
Super-resolution algorithms often struggle with images from surveillance environments due to adverse conditions such as unknown degradation, variations in pose, irregular illumination, and occlusions. However, acquiring multiple images, even of low quality, is possible with surveillance cameras. In this work, we develop an algorithm based on diffusion models that utilize a low-resolution image combined with features extracted from multiple low-quality images to generate a super-resolved image while minimizing distortions in the individual's identity. Unlike other algorithms, our approach recovers facial features without explicitly providing attribute information or without the need to calculate a gradient of a function during the reconstruction process. To the best of our knowledge, this is the first time multi-features combined with low-resolution images are used as conditioners to generate more reliable super-resolution images using stochastic differential equations. The FFHQ dataset was employed for training, resulting in state-of-the-art performance in facial recognition and verification metrics when evaluated on the CelebA and Quis-Campi datasets. Our code is publicly available at https://***/marcelowds/fasr.
Improved fuzzy c-means (FCM) clustering algorithms have been widely used for image recognition and localization. However, in industrial assembly systems, the unsatisfactory pixel merging and segmentation results betwe...
详细信息
Manufacturing process (MP) selection systems require a large amount of labelled data, typically not provided as design outputs. This issue is made more severe with the continuous development of Additive Manufacturing ...
详细信息
One of the interesting fields in video processing is motion detection and human action detection (HAR) in video. In some applications where both objects in the scene and the camera may be moving, camera movement cance...
详细信息
ISBN:
(纸本)9783031456503;9783031456510
One of the interesting fields in video processing is motion detection and human action detection (HAR) in video. In some applications where both objects in the scene and the camera may be moving, camera movement cancellation is very important to increase accuracy in extracting motion features. HAR systems usually use image matching/registration algorithms to remove the camera movement. In these methods, the source (fixed) image frame is compared with moved image frame, and the best match is determined geometrically. In video processing, due to the existence of a set of frames, one can correct errors using previous data, but at the same time, it is needed a fast frame registration algorithm. According to the above explanations, this article proposes a method to detect and minimize camera movement in video using phase information. In addition to having the acceptable speed and the ability to be implemented online, the proposed method, by combining texture and phase congruency (PC), can significantly increase the accuracy of detecting the objects in the scene. The proposed method was implemented on a HAR dataset, which includes camera movement, and its ability to compensate for camera motion and pre-serve object motion was verified. Finally, the speed and accuracy of the proposed method were compared with a number of the latest image registration methods, and its efficiency in terms of camera movement cancellation and execution time is discussed.
The increase in amount of vehicles in the past few years have made traffic management a difficult job. Technologies play an important role in these systems to regulate the traffic. Number plates are distinguished by t...
详细信息
暂无评论