Low-light image enhancement (LLIE) is essential for numerous computer vision tasks, including object detection, tracking, segmentation, and scene understanding. Despite substantial research on improving low-quality im...
详细信息
Only a few clinical procedures include the use of clinical methods for the early detection, observing, evaluation, and treatment evaluation of a range of medical illnesses. Knowing the analysis of medical images in co...
详细信息
Only a few clinical procedures include the use of clinical methods for the early detection, observing, evaluation, and treatment evaluation of a range of medical illnesses. Knowing the analysis of medical images in computer vision necessitates being acquainted with the core concepts and uses of deep learning and artificial neural networks. The A rapidly expanding area of study is the Deep Learning Approach (DLA) in medical imageprocessing. DLA is often used in medical imaging to determine if an ailment is present or not. By producing speedier, more accurate results in real time, deep learning algorithms may make the jobs of radiologists and orthopaedic surgeons easier. But the standard deep learning approach has reached its efficiencies. While offering an ideal solution known as boost-Net, we study numerous optimization strategies to increase the effectiveness of deep neural networks in this research. From a selection of well-known deep learning models, Champion-Net was selected as the deep learning model. The musculoskeletal radiograph-bone classification (MURA-BC) dataset is used in this investigation. Utilizing the train and test datasets, Enhance-Net's classification precision was evaluated.
This paper presents a comparative study on the application of drone-assisted infrared thermography coupled with state-of-the-art machine learning models, including vision Transformers (ViTs) and YOLOv8, for efficient ...
详细信息
The electrocardiogram signal of the heart is used to monitor the health status and function of the human heart and to a doctor in diagnosing the type of disease. For this purpose, first, the scalogram of the different...
详细信息
Mamba, a State Space Model (SSM), has recently shown competitive performance to Convolutional Neural Networks (CNNs) and Transformers in Natural Language processing and general sequence modeling. Various attempts have...
详细信息
CEPST algorithm is an important step in imageprocessing, which is widely used in computer vision, pattern recognition, and machine learning. In order to improve the efficiency and performance of CEPST algorithm, more...
详细信息
Understanding the distribution and characteristics of impact craters on planetary surfaces is essential for unraveling geological processes and the evolution of celestial bodies. Several machine learning and AI-based ...
详细信息
Understanding the distribution and characteristics of impact craters on planetary surfaces is essential for unraveling geological processes and the evolution of celestial bodies. Several machine learning and AI-based approaches have been proposed to detect craters on planetary surface images automatically. However, designing a robust tool for an entire complex planet such as Mars, is still an open problem. This article presents a novel approach using the Faster Region-based Convolutional Neural Network (Faster R-CNN) for such a detection. The proposed method involves the pre-processing, training and crater detection steps, which are especially designed for robustness regarding latitude and complex geomorphological features. The objectives of this studies are to (i) be robust at all latitudes and (ii) for >= 1 km diameter crater sizes. (iii) To propose an open-source and re-usable algorithm that (iv) only needs an image to run. Extensive experiments on high-resolution planetary imagery demonstrate excellent performances with an average precision AP(50)>0.82 with an intersection over union criterion IoU >= 0.5, irrespective of crater scale. For mid and high latitudes (higher than 48 degrees north and south), performance decreases down to AP(50)similar to 0.7, which is still better than the current state of the art. Loss of performance is mostly due to strong shadowing effects. Our results also highlight the versatility and potential of our robust model for automating the analysis of craters across different celestial bodies. The automated crater detection tool presented in this article is publicly available as open-source and holds great promise for future scientific research of space exploration missions.
Neural volume rendering methods, especially NeRF, have demonstrated remarkable performance in novel view synthesis. However, NeRF relies solely on image data and lacks explicit geometric information, necessitating a l...
详细信息
ISBN:
(纸本)9789819607730;9789819607747
Neural volume rendering methods, especially NeRF, have demonstrated remarkable performance in novel view synthesis. However, NeRF relies solely on image data and lacks explicit geometric information, necessitating a large number of posed images and a computationally intensive ray sampling strategy to learn accurate scene representations. This poses challenges and may result in incomplete or locally optimal scene geometry when views are sparse or incomplete, as the limited views may not provide sufficient constraints to determine a unique geometry solution for complex scenes. Meanwhile, sparse point clouds provide an attractive source of scene information, especially for geometry, to complement images in neural scene representations, particularly when input views are sparse. To overcome these limitations, we propose (SNeRF)-Ne-2, a novel Neural Radiance Field that simultaneously incorporates features from both point clouds and images for volume rendering. Specifically, (SNeRF)-Ne-2 extracts patch-wise point features from point clouds and raywise image features from adjacent views. Then the scene feature volume is constructed by implicitly fusing these point and image features through self-attention. Finally, the volume feature is utilized to render novel views of the scene. Experimental results on the challenging TartanAir dataset demonstrate that, thanks to the integration of feature volume from point clouds and images, (SNeRF)-Ne-2 achieves state-of-the-art performance in novel view synthesis.
Computer vision and Biometrics benefit from the recent advances in Pattern Recognition and Artificial Intelligence, which tends to make model-based face recognition more efficient. Also, deep learning combined with da...
详细信息
Computer vision and Biometrics benefit from the recent advances in Pattern Recognition and Artificial Intelligence, which tends to make model-based face recognition more efficient. Also, deep learning combined with data augmentation tends to enrich the training sets used for learning tasks. Nevertheless, face recognition still is challenging, especially because of imaging issues that occur in practice, such as changes in lighting, appearance, head posture and facial expression. In order to increase the reliability of face recognition, we propose a novel supervised appearance-based face recognition method which creates a low-dimensional orthogonal subspace that enforces the face class separability. The proposed approach uses data augmentation to mitigate the problem of training sample scarcity. Unlike most face recognition approaches, the proposed approach is capable of handling efficiently grayscale and color face images, as well as low and high-resolution face images. Moreover, proposed supervised method presents better class structure preservation than typical unsupervised approaches, and also provides better data preservation than typical supervised approaches as it obtains an orthogonal discriminating subspace that is not affected by the singularity problem that is common in such cases. Furthermore, a soft margins Support Vector machine classifier is learnt in the low-dimensional subspace and tends to be robust to noise and outliers commonly found in practical face recognition. To validate the proposed method, an extensive set of face identification experiments was conducted on three challenging public face databases, comparing the proposed method with methods representative of the state-of-the-art. The proposed method tends to present higher recognition rates in all databases. In addition, the experiments suggest that data augmentation also plays an essential role in the appearance-based face recognition, and that the CIELAB color space (L*a*b) is generally mor
This paper presents an agricultural robotics system designed to automate the detection and manipulation of male hemp plants, addressing the challenge of manually removing these to enhance crop quality. The solution us...
详细信息
This paper presents an agricultural robotics system designed to automate the detection and manipulation of male hemp plants, addressing the challenge of manually removing these to enhance crop quality. The solution uses an industry-standard, inexpensive RGB-D camera as the input data source to derive control signals controlling the robotic arm and end effector. Input image data processing is performed by a dedicated neural network model trained using a dataset created specifically for the described task to achieve detection by stalk segmentation and postprocessing. The research involved assessing various neural network models, including UNet, DeepLabV3+, and YOLOv8 in various variants, for their capability to detect stalks accurately and swiftly. Fast operation is necessary for effective real-time feedback in robotic grasping tasks. Among tested architectures, the integration of UNet with ResNet50 was found to provide a good trade-off between detection precision and operational speed on edge AI devices. The resulting solution offers good accuracy and significantly outperforms existing methods in terms of processing speed, promising substantial improvements in agricultural robotics by enabling on-line adaptive grasping using low-cost components. The applications can be extended beyond hemp tending to include various other crops, eliminating tedious manual labor. (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic). (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic),(sic)(sic)(sic)RGB-D(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic). (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(si
暂无评论