Segmentation of lung nodules is critical to computer-aided diagnosis systems for lung cancer diagnosis. In recent times, withthe application of deep learning in medical imageprocessing, novel architectures have been...
详细信息
this paper proposes a novel algorithm to recover blindly a sharp image from its degraded form by pixel pattern classification and filtering by using the deconvolution filters trained for the corresponding pixel patter...
详细信息
Numerous 3-D GIS are being developed today that are both commercially and freely available. A multicity web-based 3-D GIS (named as GLC3d) using open source and freely available software/packages exclusively has been ...
详细信息
ISBN:
(纸本)9789811021046;9789811021039
Numerous 3-D GIS are being developed today that are both commercially and freely available. A multicity web-based 3-D GIS (named as GLC3d) using open source and freely available software/packages exclusively has been developed. this paper presents the architecture, design, and overview of GLC3d. Open source tools and software's QGIS, Cesium, and MySQL are employed to develop this application. QGIS is utilized for data preparation such as raster, vector, and subsidiary data. MySQL is utilized as database store and data population. GLC3d is based on Cesium (a JavaScript library for creating 3-D globes and 2-D maps in a web browser) and WebGL (a JavaScript API for rendering interactive 3-D computergraphics and 2-D graphics) to display 3-D Globe on the web browser. 3-D visualization for the urban/city geodata is presented on an interactive 3-D Globe. Various city information are generated and incorporated in to the 3-D WebGIS for data representation and decision making.
Detecting and managing various types of defects that occur in the manufacturing process is important for product quality control. Detecting flaws in product presentation is an ongoing research topic in computervision...
详细信息
We consider the license plate re-identification task, treated here as a one-shot image retrieval problem. Our objective is to learn a feature representation for license plate images, such that a single training image ...
详细信息
ISBN:
(纸本)9781450366151
We consider the license plate re-identification task, treated here as a one-shot image retrieval problem. Our objective is to learn a feature representation for license plate images, such that a single training image of a given license plate (referred to as a template image) is sufficient to perform nearest-neighbour retrieval with high accuracy at test time. Also, the feature representation should ideally be generalisable across datasets and should be extractable in real-time on resource-constrained embedded hardware or a moderately powerful cellphone. We evaluate representations from person re-identification (re-id) literature, learned from a trained deep convolutional network as well withthose derived from a trained Fisher vector. While the convolutional network features perform better than the Fisher vector, we obtain comparable results from a hybrid model projecting the Fisher vector into a lower-dimensional space via two fully connected layers called f2nn using the triplet loss. the proposed hybrid model f2nn generates features which outperform and generalise better than convolutional features on datasets dissimilar to the training corpus. the model can be trained in stages and takes significantly less time to extract features. Further, it uses much smaller feature dimensions for license plate images resulting in faster re-identification, and is therefore well-suited for resource-constrained platforms such as mobile devices.
Few-shot image classification is a critical issue in the field of computervision, facing challenges related to data scarcity and model generalization. Transformer models, representing self-attention mechanisms, have ...
详细信息
ISBN:
(纸本)9798350349122;9798350349115
Few-shot image classification is a critical issue in the field of computervision, facing challenges related to data scarcity and model generalization. Transformer models, representing self-attention mechanisms, have made significant strides in recent years in the domain of few-shot classification. this paper commences with an introduction to the background and challenges of few-shot classification, along with a description of the principles and structure of the Transformer model. Subsequently, the paper categorizes Transformer-based few-shot image classification methods into meta-learning-based, metric-learning-based, fine-tuning-based, and feature-enhancement-based approaches, whose theoretical foundations of each method are expounded and the comparative analysis of representative algorithms are also provided. Furthermore, the paper delves into prospective research directions in this field.
this book constitutes the refereed proceedings of the 6th International conference on Recent Trends in imageprocessing and Pattern Recognition, RTIP2R 2023, held in Derby, UK, during December 2023, in collaboration w...
详细信息
ISBN:
(数字)9783031530852
ISBN:
(纸本)9783031530845
this book constitutes the refereed proceedings of the 6th International conference on Recent Trends in imageprocessing and Pattern Recognition, RTIP2R 2023, held in Derby, UK, during December 2023, in collaboration withthe Applied AI Research Lab at the University of South Dakota.;the 62 full papers included in this book were carefully reviewed and selected from 216 submissions. the papers are organized in the following topical sections:;Volume I:;Artificial intelligence and applied machine learning; applied imageprocessing and pattern recognition; and biometrics and applications.;Volume II:;Healthcare informatics; pattern recognition in blockchain, IOT, cyber plus network security, and cryptography.
Automatic techniques to recognize and evaluate digital logic circuits are more efficient and require less human intervention, as compared to, traditional pen and paper methods. In this paper, we propose LEONARDO (Logi...
详细信息
images acquired in hazy conditions have degradations induced in them. Dehazing such images is a vexed and ill-posed problem. Scores of prior-based and learning-based approaches have been proposed to mitigate the effec...
详细信息
ISBN:
(数字)9783031581816
ISBN:
(纸本)9783031581809;9783031581816
images acquired in hazy conditions have degradations induced in them. Dehazing such images is a vexed and ill-posed problem. Scores of prior-based and learning-based approaches have been proposed to mitigate the effect of haze and generate haze-free images. Many conventional methods are constrained by their lack of awareness regarding scene depth and their incapacity to capture long-range dependencies. In this paper, a method that uses residual learning and vision transformers in an attention module is proposed. It essentially comprises two networks: In the first one, the network takes the ratio of a hazy image and the approximated transmission matrix to estimate a residual map. the second network takes this residual image as input and passes it through convolution layers before superposing it on the generated feature maps. It is then passed through global context and depth-aware transformer encoders to obtain channel attention. the attention module then infers the spatial attention map before generating the final haze-free image. Experimental results including several quantitative metrics demonstrate the efficiency and scalability of the suggested methodology.
Fingerprint-based human authentication being most widely deployed systems are also exposed to many security threats. Among all, spoofing attacks is widely attempted that involves circumventing the sensor module of the...
详细信息
ISBN:
(纸本)9781665483285;9781665483292
Fingerprint-based human authentication being most widely deployed systems are also exposed to many security threats. Among all, spoofing attacks is widely attempted that involves circumventing the sensor module of the system by presenting a fake replica of the original trait of a genuine user. A counter measuring mechanism is deployed in these systems that intelligently measure the vitality characteristics of a presented fingerprint image. these sub-modules are integrated besides sensing device of the system and popularly known as anti-spoofing methods or fingerprint spoof detection (FSD) mechanisms. the evaluation of data-driven enabled computervision paradigm has facilitated researchers to design intelligent FSD. Pre-processing data to train these FSD models is required to improve the overall quality of the input images. this paper presents a focused review of pre-processing approaches used in deep learning-based FSD models. the proposed research study has clearly observed that image pre-processing is a vital step to significantly improve the overall accuracy and efficiency of FSD approaches.
暂无评论