This article presents a first phase’s results of research project, dedicated to developing prognostic models of User’s image with the automated Social Networks data processing methods. A pilot study carried out a co...
详细信息
Due to the vast developments in the media communication field and the quality of the visual imaging, image data compression has been one of the most interesting field. The main purpose of the image compression is to p...
详细信息
With the increasing demand for indoor localization service in our daily life, vision-based indoor localization has become a hot topic since image recording and application are very popular in the indoor environment. B...
详细信息
ISBN:
(纸本)9789811065712;9789811065705
With the increasing demand for indoor localization service in our daily life, vision-based indoor localization has become a hot topic since image recording and application are very popular in the indoor environment. Based on the epipolar geometry algorithm, more images are required in the database to achieve better localization performance, which would inevitably lead to high time consuming for image retrieval. Therefore, in this paper we propose a vision-based indoor localization method by using the BoVW (Bag of visual Word)- based image retrieval method, which could achieve less time consuming and good localization performance. The experiment results show that the localization error of the system by utilizing our proposed method could achieve an accuracy of less than 2 meters by a chance of 75%, while the time for localization sharply decreases by 60%. Compared with the traditional localization system, the proposed method could make a balance between the localization accuracy and efficiency in practice.
Single-slice CT is still widely used in many hospitals because it prolongs the life cycle of the CT device components, especially that of x-ray. Using CT 3D image, it can be very helpful for a doctor in diagnosing med...
详细信息
ISBN:
(纸本)9781728133393
Single-slice CT is still widely used in many hospitals because it prolongs the life cycle of the CT device components, especially that of x-ray. Using CT 3D image, it can be very helpful for a doctor in diagnosing medical information. In the development of 3D image reconstruction and computer technology, it is possible for the doctor to communicate with the patient through gadget media. This paper presents the determination of the threshold in single-slice Computerized Tomography (CT) for interactive 3D image reconstruction without user interface. The 3D image reconstruction method is the improved marching cube algorithm. The medical image object that consists of skull bone and sternum-pelvis in Digital Imaging and communications in Medicine (DICOM) format is taken from single-slice CT. The resulting threshold for bone 3D image object is 200, except for the 3D image with quantity slice less than 10 can not be reconstructed. The difference in the surface volume and surface area between the 3D image reconstruction output from InVesalius software and project, for skull bone and sternum-pelvis is less than 0.5%. The justification of the visual shape match from three radiology doctors for skull bone and sternum-pelvis is approximately 99% match. processing time to reconstruct the 3D image is around five minutes.
Existing Neural Style Transfer (NST) algorithms do not migrate styles well to a reasonable location where the output image can render the correct spatial structure of the object being painted. We propose a deep semant...
详细信息
ISBN:
(纸本)9789811379864;9789811379857
Existing Neural Style Transfer (NST) algorithms do not migrate styles well to a reasonable location where the output image can render the correct spatial structure of the object being painted. We propose a deep semantic matching-based multi-scale (DSM-MS) neural style transfer method, which can achieve the reasonable transfer of styles guided by the prior spatial segmentation and illumination information of input images. First, according to real drawing process, before an artist decides how to paint a stroke, he/she needs to observe and then understand subjects, segmenting space into different regions, objects and structures and analyzing the illumination conditions on each object. To simulate the two visual cognition processes, we define a deep semantic space (DSS) and propose a method for calculating DSSs using manual image segmentation, automatic illumination estimation and convolutional neural network (CNN). Second, we define a loss function, named deep semantic loss, which uses DSS to guide reasonable style transfer. Third, we propose a multi-scale optimization strategy for improving the efficiency of our method. Finally, we achieve an interdisciplinary application of our method for the first time-painterly rendering 3D scenes by neural style transfer. The experimental results show that our method can synthesize images in better original structures, with more reasonable placement of each styles and visual aesthetic feeling.
Robust visual tracking is a challenging task due to factors motion blur, fast motion, partial occlusion and illumination variation. Existing tracking algorithms represent a target candidate by templates or a linear co...
详细信息
Video anomaly detection is a very challenging task because of the rarity, openness, and the definition of the anomalies. Researchers pay more attention to the characteristics of anomalies and have proposed a variety o...
详细信息
ISBN:
(数字)9781728180687
ISBN:
(纸本)9781728180694
Video anomaly detection is a very challenging task because of the rarity, openness, and the definition of the anomalies. Researchers pay more attention to the characteristics of anomalies and have proposed a variety of anomaly detection models. However, most existing methods only use normal events to construct anomaly detection models and ignore the diversity and openness of normal events. Actually, because real-world video data often have an open-ended distribution, some normal patterns hardly ever appeared in the training data. In addition, analogous to human experience in identifying anomalies, rare abnormal events can play a certain role in the detection of similar abnormal events in the dataset. Therefore, assuming that a small number of abnormal events are known, we propose a novel supervised anomaly detection model which explicitly detects open normal events and open abnormal events in the dataset and treats open data and seen data with different classifiers. First, we use the training video to train an imbalanced classifier as the seen data classifier. Then, during the testing phase, an open data filter module isused to divide the test data into seen data and open data. Finally, we directly use the seen data classifier to generate anomaly scores for the seen test data. For the open test data, we adopt a domain adaptation method to reduce the distribution difference between it and the training data and train a new classifier to score for it. Extensive experimental results prove the effectiveness of our model.
When using traditional image search engines, smartphone users often complain about their poor user interface including poor user experience, and weak interaction. Moreover, users are unable to find a desired picture p...
详细信息
The proceedings contain 25 papers. The special focus in this conference is on Pattern Recognition and Information processing. The topics include: Brands and Caps Labeling Recognition in images Using Deep Learning;infl...
ISBN:
(纸本)9783030354299
The proceedings contain 25 papers. The special focus in this conference is on Pattern Recognition and Information processing. The topics include: Brands and Caps Labeling Recognition in images Using Deep Learning;influence of Control Parameters and the Size of Biomedical image Datasets on the Success of Adversarial Attacks;performance of Sequential Tests for Random Data Monitoring Under Distortion;equipment Condition Identification Based on Telemetry Signal Clustering;robots’ Vision Humanization Through Machine-Learning Based Artificial visual Attention;automatic Analysis of Moving Particles by Total Internal Reflection Fluorescence Microscopy;fuzzy Morphological Filters for processing of Printed Circuit Board images;Detection of Bulbar Dysfunction in ALS Patients Based on Running Speech Test;thresholding Neural Network image Enhancement Based on 2-D Non-separable Quaternionic Filter Bank;method of Creating the 3D Face Model of Character Based on Textures Maps Module;shadow Detection in Satellite images by Computing Its Characteristics;nearest Convex Hull Classifier with Simplified Proximity Measurement;image Semantic Segmentation Based on Convolutional Neural Networks for Monitoring Agricultural Vegetation;reliability Analysis Based on Incompletely Specified Data;semantic-Based Linguistic Platform for Big Data processing;cell Nuclei Counting and Segmentation for Histological image Analysis;robust Person Tracking Algorithm Based on Convolutional Neural Network for Indoor Video Surveillance Systems;modeling of Intelligent Systems Architecture Based on the Brain Topology;temporal Convolutional and Recurrent Networks for image Captioning;Video-Based Content Extraction Algorithm from Bank Cards for iOS Mobile Devices;FPGA Based Arbiter Physical Unclonable Function Implementation with Reduced Hardware Overhead;preface.
暂无评论