Manufacturing process (MP) selection systems require a large amount of labelled data, typically not provided as design outputs. This issue is made more severe with the continuous development of Additive Manufacturing ...
详细信息
One of the interesting fields in video processing is motion detection and human action detection (HAR) in video. In some applications where both objects in the scene and the camera may be moving, camera movement cance...
详细信息
ISBN:
(纸本)9783031456503;9783031456510
One of the interesting fields in video processing is motion detection and human action detection (HAR) in video. In some applications where both objects in the scene and the camera may be moving, camera movement cancellation is very important to increase accuracy in extracting motion features. HAR systems usually use image matching/registration algorithms to remove the camera movement. In these methods, the source (fixed) image frame is compared with moved image frame, and the best match is determined geometrically. In video processing, due to the existence of a set of frames, one can correct errors using previous data, but at the same time, it is needed a fast frame registration algorithm. According to the above explanations, this article proposes a method to detect and minimize camera movement in video using phase information. In addition to having the acceptable speed and the ability to be implemented online, the proposed method, by combining texture and phase congruency (PC), can significantly increase the accuracy of detecting the objects in the scene. The proposed method was implemented on a HAR dataset, which includes camera movement, and its ability to compensate for camera motion and pre-serve object motion was verified. Finally, the speed and accuracy of the proposed method were compared with a number of the latest image registration methods, and its efficiency in terms of camera movement cancellation and execution time is discussed.
The increase in amount of vehicles in the past few years have made traffic management a difficult job. Technologies play an important role in these systems to regulate the traffic. Number plates are distinguished by t...
详细信息
Vision based runway identification using 'marked or unmarked terrain' image sequences captured from a fixed wing unmanned aerial vehicle through onboard stereovision sensor is presented in this paper. An innov...
详细信息
Vision based runway identification using 'marked or unmarked terrain' image sequences captured from a fixed wing unmanned aerial vehicle through onboard stereovision sensor is presented in this paper. An innovative convolutional neural netwok (CNN) based YOLO-V8 object detection algorithm is used to detect the runway during approach segment of UAV. This deep learning algorithm detects the region of interest in real time and in a computationally efficient manner. The captured unknown road segment or runway image frames are processed and examined for width, length, level and smoothness aspects to qualify as a suitable runway for UAV landings. Also, it is ensured that there are no obstacles, patches or holes on the detected road or runway. Runway start and end threshold lines and regions, touchdown point and runway edge lines are considered as the region of interest. imageprocessingalgorithms are applied on the captured runway or road images to detect strong features in the region of interest. Feature detector based imageprocessing algorithm with stereo vision constraint is used to establish the relation between unmanned aerial vehicle's center of gravity and detected runway feature points imageprocessingalgorithms like hough line detection, RANSAC, Oriented FAST and Rotated BRIEF (ORB), median filters, morphological methods are applied to extract terrain features. Based on the detected runway orientation and position with respect to UAV position. An automatic landing manoeuvre is performed by UAV autopilot to land the UAV on intended touchdown point on runway computed through detected feature points.
With the increasing integration of functional systems, nanoscale characterization has become crucial not only for material investigation but also for advancing the understanding of local behavior and optimizing perfor...
详细信息
The rise of mobile devices has spurred advancements in camera technology and image quality. However, mobile photography still faces issues like scattering and reflective flares. While previous research has acknowledge...
详细信息
This research paper presents a website that leverages the power of the image GPT engine for image generation. The website allows users to input a textual prompt and generate a corresponding image using image GPT's...
详细信息
Early detection and accurate prediction of liver disease play a crucial role in improving patient outcomes and reducing the burden on healthcare systems. Segmenting the liver and its tumors using computed tomography (...
详细信息
Early detection and accurate prediction of liver disease play a crucial role in improving patient outcomes and reducing the burden on healthcare systems. Segmenting the liver and its tumors using computed tomography (CT) images is an essential undertaking for the diagnosis and treatment of liver illnesses. Because of the uneven distribution, hazy boundaries, varied densities, forms, and sizes of lesions, segmenting the liver and associated tumor is a challenging task. Up until this point, our primary focus has been on developing deep learning algorithms that can separate the liver and its tumor from CT scan pictures of the abdomen, saving time and effort when diagnosing liver illnesses. A deep learning-based automatic segmentation method is presented that uses the improved densenet121 model to segment the liver and its tumor. In this model imageprocessing is used for the accurate automated segmentation of tumors, the proposed method demonstrates the ability to accurately segment the liver as well, as indicated by the confusion matrix obtained when comparing to the previous work on liver and tumor segmentation. The Densenet121 architecture serves as the foundation for the algorithm employed here, we introduced an autonomous technique to segment the liver from CT scans and lesions from the segmented liver region, based on semantic segmentation convolutional neural networks ( CNNs). For liver and tumor segmentations, respectively, the suggested system achieves an accuracy of 95.31% to 95.39%.
This paper delves into an innovative image recognition algorithm that merges deep learning techniques with Generative Adversarial Networks (GANs) and offers a comparative analysis against traditional image recognition...
详细信息
Artificial intelligence (AI) has been a key research area since the 1950s, initially focused on using logic and reasoning to create systems that understand language, control robots, and offer expert advice. With the r...
详细信息
暂无评论