Understanding the driving environment is one of the key factors in achieving an autonomous vehicle. In particular, the detection of anomalies in the traffic lane is a high priority scenario, as it directly involves ve...
详细信息
ISBN:
(纸本)9781665462198
Understanding the driving environment is one of the key factors in achieving an autonomous vehicle. In particular, the detection of anomalies in the traffic lane is a high priority scenario, as it directly involves vehicle's safety. Recent state of the art imageprocessing techniques for anomaly detection are all based on deep learning of neural networks. These algorithms require a considerable amount of annotated data for training and test purposes. While many datasets exist in the field of autonomous road vehicles, such datasets are extremely rare in the railway domain. In this work, we present a new innovative dataset relevant for railway anomaly detection called RailSet. It consists of 6600 high-quality manually annotated images containing normal situations and 1100 images of railway defects such as hole anomaly and rails discontinuity. Due to the lack of anomaly samples in public images and difficulties to create anomalies in the railway environment, we generate artificially images of abnormal scenes, using a deep learning algorithm named StyleMapGAN. This dataset is created as a contribution to the development of autonomous trains able to perceive tracks damage in front of the train. The dataset is available at this link.
Musical descriptors play a very crucial role in automatic music genre classification. With the increase in digitized music, query and retrieval of musical data has always been subjective and controversial. More often,...
详细信息
Retinal prostheses are designed to aid individuals with retinal degenerative conditions such as Retinitis Pigmentosa (RP) and Age-related Macular Degeneration (AMD). These prostheses seek to restore vision and improve...
详细信息
Assessment of human movement is necessary for physical therapy management. This article presents a development of motion tracking system for human Upper Extremity (UE) function analysis. We proposed the optical motion...
详细信息
ISBN:
(纸本)9781665494755
Assessment of human movement is necessary for physical therapy management. This article presents a development of motion tracking system for human Upper Extremity (UE) function analysis. We proposed the optical motion capture system made by a single smart phone camera. It was used to capture the Reach-to-Grasp (RTG) movement of participants in sitting position. imageprocessing were used to detect color markers placed on chosen hand anatomical landmarks. With our simple camera calibration technique, the 3D coordinates of hand movement were obtained. Two clinical parameters, grasp aperture and hand transport velocity were computed. These results were compared with the outputs, collected at the same time, from the higher accuracy Electromagnetic Motion (EM) tracking system. Qualitatively, the result patterns from two systems were parallel to each other. Our ongoing work is to improve the algorithms according to the feedback from clinicians. This system may provide implication for physical therapist to assess the clients' movement in the clinical setting.
Cloud-based data processing latency mainly depends on the transmission delay of data to the cloud and the used data processing algorithm. To minimize the transmission delay, it is important to compress the transferred...
Cloud-based data processing latency mainly depends on the transmission delay of data to the cloud and the used data processing algorithm. To minimize the transmission delay, it is important to compress the transferred data without reducing the quality of the data. When using data compression algorithms, it is important to validate the impact of these algorithms on the detection quality. This work evaluates the effects of image compression and transmission over wireless interfaces on state of the art neural networks. Therefore, a modern imageprocessing platform for next generation automotive processing architectures, as used in software defined vehicles, is introduced. The impacts of different image encoders as well as data transmission parameters are investigated and discussed.
This study describes a novel way for improving automatic license plate recognition (ALPR) systems, with a focus on addressing obstacles associated with indistinct license plate photos. The suggested method combines ad...
详细信息
ISBN:
(数字)9798350378177
ISBN:
(纸本)9798350378184
This study describes a novel way for improving automatic license plate recognition (ALPR) systems, with a focus on addressing obstacles associated with indistinct license plate photos. The suggested method combines advanced picture deblurring algorithms with reliable information extraction methods to improve the accuracy and dependability of ALPR systems. The major goal is to create a comprehensive pipeline capable of efficiently extracting license plate information while improving image clarity. This entails using sophisticated deblurring algorithms to reduce distortion effects and hence improve overall image quality. Subsequently, advanced computer vision algorithms recognize relevant elements such as characters and patterns, allowing for accurate content recognition. Finally, post-processing techniques are used to thoroughly evaluate and refine the collected plate content. By greatly enhancing accuracy and dependability, particularly in practical circumstances, the proposed ALPR system promises to make major contributions to traffic management and security applications.
The applications include scene interpretation, medical imageprocessing, robotic perception, video based scrutiny systems, augmented and virtual reality, among many others the image segmentation is a being a key topic...
详细信息
The paper considers the main procedures of digital processing of endoscopic images aimed at increasing their diagnostic value. It is shown that the existing approaches, solving private problems, lead to the general in...
详细信息
ISBN:
(数字)9798331541460
ISBN:
(纸本)9798331541477
The paper considers the main procedures of digital processing of endoscopic images aimed at increasing their diagnostic value. It is shown that the existing approaches, solving private problems, lead to the general inconsistency of color transformations in the endoscopic video system. A new multi-objective quality criterion for the synthesis of color transformations, taking into account both the accuracy of color reproduction and the transmission of local color contrast, is proposed. Exemplary methodologies for target-specific color transformation synthesis are developed and validated, addressing critical challenges in endoscopic image enhancement. Experimental results underscore the efficacy of the approach in maintaining color integrity while significantly amplifying the diagnostic value through enhanced contrast articulation. The proposed algorithms were evaluated on diverse datasets of endoscopic images, demonstrating a substantial improvement in local contrast and image clarity compared to existing techniques. The virtual chromoendoscopy method achieved a balance between natural color representation and enhanced diagnostic features, while the bimodal visualization approach optimized the integration of fluorescence and white-light imaging. These results contribute to the evolution of visualization and perceptually optimized imaging solutions, marking a substantial advancement in medical imageprocessing. The study establishes a robust foundation for the development of context-aware imaging methodologies in modern medical systems.
Traffic scene text detection is an important technology in the field of imageprocessing, which has significant practical significance in the applications of assisted driving, automatic driving and intelligent transpo...
详细信息
ISBN:
(数字)9798350366204
ISBN:
(纸本)9798350366211
Traffic scene text detection is an important technology in the field of imageprocessing, which has significant practical significance in the applications of assisted driving, automatic driving and intelligent transportation. However, traffic scene text detection faces many challenges, such as background complexity and text Angle diversity. In this paper, the YOLOv8 model is applied to text detection tasks in traffic scenarios. In YOLOv8s model, attention mechanism is introduced to enhance feature extraction ability. At the same time, Circular Smooth Label (CSL) is used for Angle classification, so as to achieve accurate prediction of text with different oblique *** experimental results show that compared with DBNet algorithm, F-measure on self-made datasets CTST and public datasets ICDAR2015 is improved by 1.7% and 2.1%, respectively. In addition, the proposed algorithm is 2.03 times faster than the EAST algorithm in terms of imageprocessing speed, and the detection time of a single image is about 0.03 seconds, which effectively meets the needs of real-time detection during driving.
Retinal images have been used in the diagnosis of many ocular diseases such as glaucoma and diabetic retinopathy. Here, automatic detection of optic disk (OD) is essential in deriving clinical parameters to assist cli...
详细信息
ISBN:
(数字)9781510649484
ISBN:
(纸本)9781510649484;9781510649477
Retinal images have been used in the diagnosis of many ocular diseases such as glaucoma and diabetic retinopathy. Here, automatic detection of optic disk (OD) is essential in deriving clinical parameters to assist clinical diagnosis. In fact, detecting OD center and its boundary is the essential step of most vessel segmentation, disease diagnostic, and retinal recognition algorithms. In this study, we proposed a new approach for localizing OD by combining local histogram matching and the concept of deep learning. The algorithm is composed of 4 steps, image partitioning, Local histogram matching and validation, Convolutional Neural Network (CNN) classification, and OD detection. Here, we used OD of the five reference retinal images in each dataset to extract the histograms of each color channel. Then, we calculated the mean of histograms for each channel as template for creating some OD candidates. An AlexNet-like CNN was applied to classify candidates as ODs or nonODs. The candidates used as an input to feed the CNN for final classification. In this study, we worked on three databases (one rural, MUMS-DB, and two publicly available databases, DRIVE, STARE) including 520 retinal images to evaluate the proposed method. The accuracy of our algorithm was 100%, 90%, and 95% for the DRIVE, STARE, and MUMS-DB respectively. It is shown that this method provides higher detection rates than the existing methods that have reported.
暂无评论