the proceedings contain 8 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Retinotopic Image Encoding by Samples of Counts;gesture recognition and...
ISBN:
(纸本)9783031245374
the proceedings contain 8 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Retinotopic Image Encoding by Samples of Counts;gesture recognition and Multi-modal Fusion on a New Hand Gesture Dataset;Reduced Precision Research of a GAN Image Generation Use-case;similarity Constrained Conditional Generative Auto-encoder with Generalized Dilated Networks;perusal of Camera Trap Sequences Across Locations;preface.
the proceedings contain 74 papers. the topics discussed include: multi-level feature selection for oriented object detection;demonstrating the vulnerability of RGB-D based face recognition to GAN-generated depth-map i...
ISBN:
(纸本)9789897584862
the proceedings contain 74 papers. the topics discussed include: multi-level feature selection for oriented object detection;demonstrating the vulnerability of RGB-D based face recognition to GAN-generated depth-map injection;speech recognition using deep canonical correlation analysis in noisy environments;capsule networks with intersection over union loss for binary image segmentation;generalized dilation structures in convolutional neural networks;exploring motion boundaries in an end-to-end network for vision-based Parkinson’s severity assessment;converting image labels to meaningful and information-rich embeddings;exploring slow feature analysis for extracting generative latent factors;state tracking in the presence of heavy-tailed observations;and active output selection strategies for multiple learning regression models.
the proceedings contain 10 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Interactive Design Support for Architecture Projects During Early Phases ...
ISBN:
(纸本)9783030054984
the proceedings contain 10 papers. the special focus in this conference is on patternrecognitionapplications and methods. the topics include: Interactive Design Support for Architecture Projects During Early Phases Based on Recurrent Neural Networks;CNN-Based Deep Spatial Pyramid Match Kernel for Classification of Varying Size Images;earth Mover’s Distance Between Rooted Labeled Unordered Trees Formulated from Complete Subtrees;TIMIT and NTIMIT Phone recognition Using Convolutional Neural Networks;An Efficient Hashing Algorithm for NN Problem in HD Spaces;stochastic Analysis of Time-Difference and Doppler Estimates for Audio Signals;detection and Classification of Faulty Weft threads Using Both Feature-Based and Deep Convolutional Machine Learning methods;video Activity recognition Using Sequence Kernel Based Support Vector Machines.
How to do patternrecognition without artificial neural networks, Bayesian classifiers, vector support machines and other mechanisms that are widely used for machine learning? the problem withpatternrecognition mach...
详细信息
ISBN:
(纸本)9789897584862
How to do patternrecognition without artificial neural networks, Bayesian classifiers, vector support machines and other mechanisms that are widely used for machine learning? the problem withpatternrecognition machines is time and energy demanding training because lots of coefficients need to be worked out. the paper introduces an indexing model that performs training by memorizing inverse patterns mostly avoiding any calculations. the computational experiments indicate the potential of the indexing model for artificial intelligence applications and, possibly, its relevance to neurobiological studies as well.
the identification of source cameras from videos, though it is a highly relevant forensic analysis topic, has been studied much less than its counterpart that uses images. In this work we propose a method to identify ...
详细信息
ISBN:
(纸本)9789897584862
the identification of source cameras from videos, though it is a highly relevant forensic analysis topic, has been studied much less than its counterpart that uses images. In this work we propose a method to identify the source camera of a video based on camera specific noise patterns that we extract from video frames. For the extraction of noise pattern features, we propose an extended version of a constrained convolutional layer capable of processing color inputs. Our system is designed to classify individual video frames which are in turn combined by a majority vote to identify the source camera. We evaluated this approach on the benchmark VISION data set consisting of 1539 videos from 28 different cameras. To the best of our knowledge, this is the first work that addresses the challenge of video camera identification on a device level. the experiments show that our approach is very promising, achieving up to 93:1% accuracy while being robust to the WhatsApp and YouTube compression techniques. this work is part of the EU-funded project 4NSEEK focused on forensics against child sexual abuse.
In this paper, we propose a blended Attention-Connectionist Temporal Classification (CTC) network architecture for a unique script, Amharic, text-image recognition. Amharic is an indigenous Ethiopic script that uses 3...
详细信息
ISBN:
(纸本)9789897584862
In this paper, we propose a blended Attention-Connectionist Temporal Classification (CTC) network architecture for a unique script, Amharic, text-image recognition. Amharic is an indigenous Ethiopic script that uses 34 consonant characters withtheir 7 vowel variants of each and 50 labialized characters which are derived, with a small change, from the 34 consonant characters. the change involves modifying the structure of these characters by adding a straight line, or shortening and/or elongating one of its main legs including the addition of small diacritics to the right, left, top or bottom of the character. Such a small change affects orthographic identities of character and results in shape similarly among characters which are interesting, but challenging task, for OCR research. Motivated withthe recent success of attention mechanism on neural machine translation tasks, we propose an attention-based CTC approach which is designed by blending attention mechanism directly within the CTC network. the proposed model consists of an encoder module, attention module and transcription module in a unified framework. the efficacy of the proposed model on the Amharic language shows that attention mechanism allows learning powerful representations by integrating information from different time steps. Our method outperforms state-of-the-art methods and achieves 1.04% and 0.93% of the character error rate on ADOCR test datasets.
this book contains revised and extended versions of selected papers from the 10th and 11;internationalconference on patternrecognition, icpram 2021 and 2022, held in February 2021 and 2022. Due to COVID-19 pandemic ...
详细信息
ISBN:
(数字)9783031245381
ISBN:
(纸本)9783031245374
this book contains revised and extended versions of selected papers from the 10th and 11;internationalconference on patternrecognition, icpram 2021 and 2022, held in February 2021 and 2022. Due to COVID-19 pandemic the conferences were held virtually. Bothconferences received in total 204 submissions from which 8 full papers were carefully reviewed and selected for presentation in this volume. the papers span a wide range of investigation as well as development lines, which of course always reflect the last trends of research in the patternrecognition community.
A directional wideband microstrip line fed rectangular patch antenna has been proposed for the 28 GHz 5G applications. Initially, a conventional rectangular microstrip patch antenna has been designed thereafter to enh...
详细信息
the proceedings contain 44 papers. the topics discussed include: a wideband rectangular microstrip patch antenna with partial ground plane for 5g applications;development of a Bangla speech to text conversion system u...
ISBN:
(纸本)9781665449212
the proceedings contain 44 papers. the topics discussed include: a wideband rectangular microstrip patch antenna with partial ground plane for 5g applications;development of a Bangla speech to text conversion system using deep learning;automatic identification of mice social behavior through multi-modal latent space clustering;a deep learning approach for Bangla image captioning system;semantic representation of sentences employing an automated threshold;a deep learning approach for Bangla speech to text conversion;a systematic review on the chronological development of Bangla sign language recognition systems;static output feedback stabilizing control for Takagi-Sugeno fuzzy systems;improved transfer learning architecture to classify covid-19 affected chest x-rays using noisy student pre-training;human action recognition based on a sequential deep learning model;and a vision-based lane detection approach for autonomous vehicles using a convolutional neural network architecture.
暂无评论