Text similarity calculation is an important component of many text mining techniques. Most of the existing studies only consider model fusion of textual features or semantic features, which are not suitable for long t...
详细信息
For the current brightness gain and maximum output brightness adjustment method of image intensifier has low efficiency, time-consuming, low precision, and other problems, this paper firstly studies a communication pr...
详细信息
Generative AI has demonstrated significant capabilities in text-to-video synthesis, using advanced models characterized by extensive parameters. Given that current disaster alert services are mostly text-based and can...
详细信息
Convolution neural network is widely used in the field of computervision, which can complete the tasks of target detection, image segmentation, semantic generation and so on in complex scenes. As the main optical sen...
详细信息
Type-II diabetes is growing at an alarming rate worldwide. Eventually, it leads to visual impairment by damaging the retinal blood vessels, which is known as Diabetic Retinopathy (DR). A plethora of research is ongoin...
详细信息
The recent progress in artificial intelligence has led to an ever-increasing usage of images and videos by machine analysis algorithms, mainly neural networks. Nonetheless, compression, storage and transmission of med...
详细信息
Segmentation is a challenging and important part of image analysis. Various kinds of eye diseases are identified through the retinal blood vessels, so retinal image segmentation plays an important role in medical imag...
详细信息
Traffic congestion has become one of the major issues in Bangladesh. The vehicle density on the road is slowly becoming greater than the road capacity and resulting in difficult commutes. This traffic delay leads to w...
详细信息
This paper focuses on an accurate and fast interpolation approach for image transformation employed in the design of CNN architectures. Standard Spatial Transformer Networks (STNs) use bilinear or linear interpolation...
详细信息
ISBN:
(纸本)9781665488679
This paper focuses on an accurate and fast interpolation approach for image transformation employed in the design of CNN architectures. Standard Spatial Transformer Networks (STNs) use bilinear or linear interpolation as their interpolation, with unrealistic assumptions about the underlying data distributions, which leads to poor performance under scale variations. Moreover, STNs do not preserve the norm of gradients in propagation due to their dependency on sparse neighboring pixels. To address this problem, a novel Entropy STN (ESTN) is proposed that interpolates on the data manifold distributions. In particular, random samples are generated for each pixel in association with the tangent space of the data manifold, and construct a linear approximation of their intensity values with an entropy regularizer to compute the transformer parameters. A simple yet effective technique is also proposed to normalize the non-zero values of the convolution operation, to fine-tune the layers for gradients' norm-regularization during training. Experiments on challenging benchmarks show that the proposed ESTN can improve predictive accuracy over a range of computervision tasks, including image reconstruction, and classification, while reducing the computational cost.
Ore and mineral identification is an excellent obstacle for today's geological informatization popularization work. At the same time, identification plays a vital role in many research areas. Therefore, intelligen...
详细信息
ISBN:
(纸本)9798350339765
Ore and mineral identification is an excellent obstacle for today's geological informatization popularization work. At the same time, identification plays a vital role in many research areas. Therefore, intelligent identification of ores and minerals is the basis for pushing for the work to popularize geological science. Today's computervision technology and deep learning theory made the intelligent identification of ore minerals possible. This paper introduces artificial intelligence to the field of geological science by utilizing a combination of a cloud server, recognition model, convolutional neural network, and deep learning. This integration realizes the 'face ID' for minerals and rocks, which can help people in the industry and the public quickly identify minerals and rocks found in various environments. At the same time, it can widely promote geology's informatization and the work's *** the research and analysis of ore identification-related technologies, the author selected the Pytorch framework to optimize the ResNet50 deep learning model to identify ore samples. The main contents of the essay are as follows:(1)Fully studied and researched the related technologies in computervision technology and ore identification, developed an in-depth understanding of the identification framework, analyzed its shortcomings, and finally chose the Pytorch framework to identify ore samples.(2)Eleven ore samples were selected to make the dataset, including quartz, biotite, bornite, malachite, granite, limestone, and malachite.(3)The image is denoised. Then, combined with the preprocessed images, data augmentation is performed on the ore color image by rotation and translation to reduce the sample imbalance.(4)The ResNet50 model is optimized based on the Pytorch framework, and the optimized model is used to prepare, evaluate, and validate the data set. As a result, the resulting model achieved an impressive recognition accuracy rate of 98%.(5)Combining t
暂无评论