The distribution of specific tree species and single tree information can shift down the research scale of geography and ecology, bridging research from regional to local scales. However, efficiently and finely extrac...
详细信息
Stereo dense image matching (DIM) is a key technique in generating dense 3D point clouds at low cost, among which semi-global matching (SGM) is one of the best compromise between the matching accuracy and the time cos...
Stereo dense image matching (DIM) is a key technique in generating dense 3D point clouds at low cost, among which semi-global matching (SGM) is one of the best compromise between the matching accuracy and the time cost. Most commercial or open-source DIM software packages therefore adopt SGM as the core algorithm for the 3D point generation, which computes matching results in 2D image space by simply aggregating the matching results of multi-directional 1D paths. However, such aggregations of SGM did not consider the disparity consistency between adjacent pixels in 2D image space, which will finally decrease the matching accuracy. To achieve higher-accuracy while keep the high time efficiency of SGM, this paper proposes an improved SGM with a novel matching aggregation optimization constraint. The core algorithm formulates the matching aggregation as the optimization of a global energy function, and a local solution of the energy function is utilized to impose the disparity consistency between adjacent pixels, which is capable of removing noises in the matching aggregation results and increasing the final matching accuracy at low time cost. Experiments on aerial image dataset show that the proposed method outperformed the traditional SGM method and another improved SGM method. Compared with the traditional SGM, our proposed method can increase the average matching accuracy by at most 11%. Therefore, our proposed method can applied in some smart 3D applications, e.g. 3D change detection, city-scale reconstruction, and global survey mapping.
Medical image segmentation has always been a challenging task. This paper proposes a new LUneXt medical image segmentation model based on the characteristics analysis of medical image data sets and testing of differen...
详细信息
Medical image segmentation has always been a challenging task. This paper proposes a new LUneXt medical image segmentation model based on the characteristics analysis of medical image data sets and testing of different nonlinear activation units. Both normalized activations for the original negative input image activation have good optimization capabilities for the tokenization module parameters proposed in the original UneXt model. Using different activation coefficients for different foreground and background areas has achieved better results. The experimental results of this paper on the Breast Ultrasound Images (BUSI) data set reached an intersection over union (IoU) value of 62.64%, a Dice value of 76.12%, and a single inference speed of 807.57 ms. The experimental IoU value of the International Skin Imaging Collaboration (ISIC 2018) data set reached 82.95%, and the Dice value reached 90.50%. The single inference speed reached 842.58 ms. The LUneXt model is more robust than other models. While improving model performance, it does not introduce higher computational complexity and does not have a major impact on the processing speed of a single image.
The application of the combination of unmanned aerial vehicles (UAVs) and artificial intelligence is a hot topic in the intelligent inspection of substations, and meter reading is a very challenging task. This paper p...
详细信息
The application of the combination of unmanned aerial vehicles (UAVs) and artificial intelligence is a hot topic in the intelligent inspection of substations, and meter reading is a very challenging task. This paper proposes a method based on the combination of YOLOv6n object detection and Deeplabv3 + image segmentation and performs post-processing on the segmented images to obtain meter readings. First, YOLOv6n is used to detect the meter area of the aerial image and classify the meters. Second, the detected meter images are fed into the image segmentation model. The backbone network of the Deeplabv3 + algorithm is improved by using the MobileNetv3 network, which not only effectively extracts pointers and scales, but also makes the model more lightweight. Third, License Plate Recognition Network (LPRNet) is used to recognize digital meter images. In order to solve the problem of inaccurate pointer meter readings, to begin with, the segmented image is corroded; in addition, the circular dial area is flattened into a rectangular area by concentric circle sampling method. Finally, the meter reading is calculated by the position of the pointer, the scale and the total range of the meter. The post-processing part uses numba to optimize the inference speed. The experimental results show that in two datasets, The mean average precision of 50 (mAP50) accuracy of the YOLOv6n model using this method reached 99.71% and 98.60%, respectively, and the inference speed of a single image was 17.1 ms and 13.2 ms, respectively. The mean intersection over union (mIoU) of the image segmentation model reached 82.00%, 74.73%, 73.50%, 82.26% and 73.20%, respectively, and the single segmentation speed reached 33.7 ms. The LPRNet model has a recognition accuracy of 99.17% and a single image inference speed of 14.7 ms. At the same time, several mainstream object detection and semantic segmentation algorithms are compared. The experimental results show that the method in this paper greatly im
暂无评论