the proceedings contain 8 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Real-World Indoor Location Assessment with Unmodified RFID Antennas;M...
ISBN:
(纸本)9783031547256
the proceedings contain 8 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Real-World Indoor Location Assessment with Unmodified RFID Antennas;MSAA-Net: Multi-Scale Attention Assembler Network Based on Multiple Instance Learning for Pathological Image Analysis;analysis of Generative Data Augmentation for Face Antispoofing;improving Person Re-identification through Low-Light Image Enhancement;gender-Aware Speech Emotion recognition in Multiple Languages;patternrecognition Techniques in Image-Based Material Classification of Ancient Manuscripts.
the proceedings contain 8 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Retinotopic Image Encoding by Samples of Counts;gesture recognition and...
ISBN:
(纸本)9783031245374
the proceedings contain 8 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Retinotopic Image Encoding by Samples of Counts;gesture recognition and Multi-modal Fusion on a New Hand Gesture Dataset;Reduced Precision Research of a GAN Image Generation Use-case;similarity Constrained Conditional Generative Auto-encoder with Generalized Dilated Networks;perusal of Camera Trap Sequences Across Locations;preface.
this article presents a solution for Speech Emotion recognition (SER) in multilingual setting using a hierarchical approach. the approach involves two levels, the first level identifies the gender of the speaker, whil...
详细信息
ISBN:
(纸本)9783031547256;9783031547263
this article presents a solution for Speech Emotion recognition (SER) in multilingual setting using a hierarchical approach. the approach involves two levels, the first level identifies the gender of the speaker, while the second level predicts their emotional state. We evaluate the performance of three classifiers of increasing complexity: k-NN, transfer learning based on YAMNet, and Bidirectional Long Short-Term Memory neural networks. the models were trained, validated, and tested on a dataset that includes the big-six emotions and was collected from well-known SER datasets representing six different languages. Our results indicate that there are differences in classification accuracy when considering all data versus only female or male data, across all classifiers. Interestingly, prior knowledge of the speaker's gender can improve the overall classification performance.
the distance transform (DT) serves as a crucial operation in numerous image processing and patternrecognitionmethods, finding broad applications in areas such as skeletonization, map-matching robot self-localization...
详细信息
ISBN:
(纸本)9783031547256;9783031547263
the distance transform (DT) serves as a crucial operation in numerous image processing and patternrecognitionmethods, finding broad applications in areas such as skeletonization, map-matching robot self-localization, biomedical imaging, and analysis of binary images. the concept of DT computation can also be extended to non-grid structures and graphs for the calculation of the shortest paths within a graph. this paper introduces two distinct algorithms: the first calculates the DT within a connected plane graph, while the second is designed to compute the DT in a binary image. Both algorithms demonstrate parallel logarithmic complexity of O(log(n)), with n representing the maximum diameter of the largest region in either the connected plane graph or the binary image. To attain this level of complexity, we make the assumption that a sufficient number of independent processing elements are available to facilitate massively parallel processing. Bothmethods utilize the hierarchical irregular pyramid structure, thereby retaining topological information across regions. these algorithms operate entirely on a local level, making them conducive to parallel implementations. the GPU implementation of these algorithms showcases high performance, with memory bandwidth posing the only significant constraint. the logarithmic complexity of the algorithms boosts execution speed, making them particularly suited to handling large images.
this paper describes a method of real-time facial-feature extraction which is based on matching techniques. the method is composed of facial-area extraction and mouth-area extraction using colour histogram matching, a...
详细信息
pattern classification has been successfully applied in many problem domains, such as biometric recognition, document classification or medical diagnosis. Missing or unknown data are a common drawback that pattern rec...
详细信息
pattern classification has been successfully applied in many problem domains, such as biometric recognition, document classification or medical diagnosis. Missing or unknown data are a common drawback that patternrecognition techniques need to deal with when solving real-life classification tasks. Machine learning approaches and methods imported from statistical learning theory have been most intensively studied and used in this subject. the aim of this work is to analyze the missing data problem in pattern classification tasks, and to summarize and compare some of the well-known methods used for handling missing values.
作者:
Pham, Tuan D.Univ Aizu
Aizu Res Cluster Med Engn & Informat Ctr Adv Informat Sci & Technol Aizu Wakamatsu Fukushima 9658580 Japan
Artificial-intelligence (AI)-base patternrecognition is of particular interests to many scientific disciplines ranging from life science to engineering. Practical applications of pattern or object recognitionmethods...
详细信息
ISBN:
(纸本)9781479926053;9781479926046
Artificial-intelligence (AI)-base patternrecognition is of particular interests to many scientific disciplines ranging from life science to engineering. Practical applications of pattern or object recognitionmethods are numerous but still encountering many problems including the inherent difficulty in computerized feature extraction and classification. this paper proposes a strategy for object recognition resembling the active template matching strategy in birds. Experimental results on several databases suggest that using the active vision processing can improve classification rates implemented with various classifiers.
Classifying ancient manuscripts based on their writing surfaces often becomes essential for palaeographic research, including writer identification, manuscript localization, date estimation, and, occasionally, forgery...
详细信息
ISBN:
(纸本)9783031547256;9783031547263
Classifying ancient manuscripts based on their writing surfaces often becomes essential for palaeographic research, including writer identification, manuscript localization, date estimation, and, occasionally, forgery detection. Researchers continually perform corroborative tests to classify manuscripts based on physical materials. However, these tests, often performed on-site, require actual access to the manuscript objects. these procedures involve specific expertise in manuscript handling, a considerable amount of time, and cost. Additionally, any physical inspection can accidentally damage ancient manuscripts that already suffer degradation due to aging, natural corrosion, and damaging chemical treatments. Developing a technique to classify such documents using noninvasive techniques with only digital images can be extremely valuable and efficient. this study uses images from a famous historical collection, the Dead Sea Scrolls, to propose a novel method to classify the materials of the manuscripts. the proposed classifier uses the two-dimensional Fourier transform to identify patterns within the manuscript surfaces. Combining a binary classification system employing the transform with a majority voting process is adequate for this classification task. this initial study shows a classification percentage of up to 97% for a confined amount of manuscripts produced from either parchment or papyrus material. In the extended work, this study proposes a hierarchical k-means clustering method to group image fragments that are highly likely to originate from a similar source using color and texture features calculated on the image patches, achieving 77% and 68% for color and texture clustering with 100% accuracy on primary material classification. Finally, this study explores a convolutional neural network model in a self-supervised Siamese setup with a large number of images that obtains an accuracy of 85% on the pretext task and an accuracy of 66% on the goal task to
the covariance analysis of linear predictive coding has wide applications, especially in speech recognition and speech signal processing. Real-time applications demand very high processing speed for linear predictive ...
详细信息
暂无评论