In this paper YOLOv8 deeplearning model is proposed for vehicle detection, classification, and counting for urban traffic surveillance applications on custom dataset. The model was trained with images and annotations...
详细信息
Accurate identification of fish species is critical for applications in fisheries management., biodiversity monitoring., and conservation efforts. Conventional manual identification techniques take a lot of time and a...
详细信息
Edible mushrooms are rich in nutrients;however,harvesting mainly relies on manual *** localization of each mushroom is necessary to enable a robotic arm to accurately pick edible *** studies used detection algorithms ...
详细信息
Edible mushrooms are rich in nutrients;however,harvesting mainly relies on manual *** localization of each mushroom is necessary to enable a robotic arm to accurately pick edible *** studies used detection algorithms that did not consider mushroom pixel-level *** these algorithms are combined with a depth map,the information is ***,in instance segmentation algorithms,convolutional neural network(CNN)-based methods are lightweight,and the extracted features are not *** guarantee real-time location detection and improve the accuracy of mushroom segmentation,this study proposed a new spatial-channel transformer network model based on Mask-CNN(SCT-Mask-RCNN).The fusion of Mask-RCNN with the self-attention mechanism extracts the global correlation outcomes of image features from the channel and spatial ***,Mask-RCNN was used to maintain a lightweight structure and extract local features using a spatial pooling pyramidal structure to achieve multiscale local feature fusion and improve detection *** results showed that the SCT-Mask-RCNN method achieved a segmentation accuracy of 0.750 on segm_Precision_mAP and detection accuracy of 0.638 on Bbox_Precision_*** to existing methods,the proposed method improved the accuracy of the evaluation metrics Bbox_Precision_mAP and segm_Precision_mAP by over 2%and 5%,respectively.
With the development of deeplearning, deep convolution neural networks for medical image segmentation tasks have become more and more complex in pursuit of higher accuracy. In most scenarios, medical image segmentati...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
With the development of deeplearning, deep convolution neural networks for medical image segmentation tasks have become more and more complex in pursuit of higher accuracy. In most scenarios, medical image segmentation pursues accuracy rather than speed, However, real-time performance is crucial in some scenarios, such as surgical navigation and diagnosis of acute stroke. So design of high-precision, lightweight and real-time medical image segmentation network has become an urgent need. To this end, a novel lightweight dual-domain network (LDD-Net) has been proposed in this paper. LDD-Net is comprised of two branches, learning respectively from the frequency domain and the spatial domain. In the frequency domain branch, the image spatial resolution is compressed via discrete cosine transform to have a large receptive field, so that better semantic context features can be learned. In the spatial domain branch, high-resolution feature representations with more details are learned. Finally, the learned features of these two branches are fused to yield high accuracy with low computational cost. The proposed method has been validated on two medical image segmentation datasets to yield the state-of-the-art performances with greatly reduced inference time and parameters of the learned models.
The automatic defect detection of casting X-ray inspection images based on deeplearning has the problems of single defect sample topography and poor network generalization. Therefore, the research on casting inspecti...
详细信息
Low probability of intercept (LPI) radar waveform recognition is a crucial branch in the field of electronic reconnaissance, serving as a vital method for acquiring information from non-cooperative radar sources. In r...
详细信息
Obesity rates have surged globally, necessitating effective weight management strategies. Documenting dietary intake is pivotal for weight loss management. This paper proposes a novel deeplearning-based approach, Foo...
详细信息
Due to the sparsity of point clouds obtained by LIDAR, the depth information is usually not complete and dense. The depth completion task is to recover dense depth information from sparse depth information. However, m...
详细信息
Due to the sparsity of point clouds obtained by LIDAR, the depth information is usually not complete and dense. The depth completion task is to recover dense depth information from sparse depth information. However, most of the current deep completion networks use RGB images as guidance, which are more like a processing method of information fusion. They are not valid when there is only sparse depth data and no other color information. Therefore, this paper proposes an information-reinforced completion network for a single sparse depth input. We use a multi-resolution dense progressive fusion structure to maximize the multi-scale information and optimize the global situation by point folding. At the same time, we re-aggregate the confidence and impose another depth constraint on the pixel depth to make the depth estimation closer to the ground trues. Our experimental results on KITTI and NYU Depth v2 datasets show that the proposed network achieves better results than other unguided deep completion methods. And it is excellent in both accuracy and real-time performance.
A fast scanner of optical-resolution photoacoustic microscopy is inherently vulnerable to perturbation, leading to severe image distortion and significant misalignment among multiple 2D or 3D images. Restoration and r...
详细信息
A fast scanner of optical-resolution photoacoustic microscopy is inherently vulnerable to perturbation, leading to severe image distortion and significant misalignment among multiple 2D or 3D images. Restoration and registration of these images is critical for accurately quantifying dynamic information in long-term imaging. However, traditional registration algorithms face a great challenge in computational throughput. Here, we develop an unsupervised deeplearning based registration network to achieve real-timeimage restoration and registration. This method can correct artifacts from B-scan distortion and remove misalignment among adjacent and repetitive images in realtime. Compared with conventional intensity based registration algorithms, the throughput of the developed algorithm is improved by 50 times. After training, the new deeplearning method performs better than conventional feature based image registration algorithms. The results show that the proposed method can accurately restore and register the images of fast-scanning photoacoustic microscopy in realtime, offering a powerful tool to extract dynamic vascular structural and functional information.
The contemporary landscape of sophisticated and intelligent jamming techniques presents significant challenges for traditional radar anti-jamming methods, necessitating advancements in radar operational capabilities. ...
详细信息
The contemporary landscape of sophisticated and intelligent jamming techniques presents significant challenges for traditional radar anti-jamming methods, necessitating advancements in radar operational capabilities. To address the intricate and dynamic nature of jamming scenarios, alongside the limitations in performance assurance of manually crafted anti-jamming strategies and the suboptimal real-time responsiveness of radar systems, this study introduces an intelligent decision-making model founded on deep reinforcement learning (DRL). This model is meticulously structured, comprising a defined action space, state space, and reward function. Concurrently, the paper advocates a novel radar anti-jamming strategy learning approach based on the deep Q-Network (DQN), adept at mitigating external malicious interference. This approach enhances the integration efficiency and the doppler frequency resolution in radar echo processing. Comparative simulation outcomes affirm the superiority of the proposed intelligent decision model and training methodology over established methods like Proximal Policy Optimization (PPO) and Q-learning. Notably, the model demonstrates enhanced jamming suppression, robust generalization capabilities, accelerated response times, and a significant augmentation in the radar's autonomous decision-making prowess.
暂无评论