检索结果-内蒙古大学图书馆

4th International Workshop on Brain-Inspired Computing, BrainComp 2019

作者： Khellat-Kihel, Souad Sun, Zhenan Tistarelli, Massimo Computer Vision Laboratory University of Sassari Viale Italia 39 Sassari07100 Italy Center for Research on Intelligent Perception and Computing National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences Room 1605 Intelligence Bulding 95 Zhongguancun East Road Beijing100190 China Computer Vision Laboratory Department of Biomedical Sciences and Information Technology University of Sassari Viale S. Pietro 43/b Sassari07100 Italy

ISBN: (纸本)9783030824266

Recent research on face analysis has demonstrated the richness of information embedded in feature vectors extracted from a deep convolutional neural network. Even though deep learning achieved a very high performance on several challenging visual tasks, such as determining the identity, age, gender and race, it still lacks a well grounded theory which allows to properly understand the processes taking place inside the network layers. Therefore, most of the underlying processes are unknown and not easy to control. On the other hand, the human visual system follows a well understood process in analyzing a scene or an object, such as a face. The direction of the eye gaze is repeatedly directed, through purposively planned saccadic movements, towards salient regions to capture several details. In this paper we propose to capitalize on the knowledge of the saccadic human visual processes to design a system to predict facial attributes embedding a biologically-inspired network architecture, the HMAX. The architecture is tailored to predict attributes with different textural information and conveying different semantic meaning, such as attributes related and unrelated to the subject’s identity. Salient points on the face are extracted from the outputs of the S2 layer of the HMAX architecture and fed to a local texture characterization module based on LBP (Local Binary pattern). The resulting feature vector is used to perform a binary classification on a set of pre-defined visual attributes. The devised system allows to distill a very informative, yet robust, representation of the imaged faces, allowing to obtain high performance but with a much simpler architecture as compared to a deep convolutional neural network. Several experiments performed on publicly available, challenging, large datasets demonstrate the validity of the proposed approach. © 2021, The Author(s).

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

Multiple Classifier Systems 1

引用

丛书名： Lecture Notes in computer Science

1000年

作者： Nikunj C. Oza Robi Polikar Josef Kittler Fabio Roli

来源：评论

学校读者我要写书评

暂无评论

Sketch-based Facial Expression recognition for Human Figure Drawing Psychological Test

Sketch-based Facial Expression Recognition for Human Figure ...

引用

Frontiers of Information Technology (FIT)

作者： Momina Moetesum Tasneem Aslam Hassan Saeed Imran Siddiqi Uzma Masroor Bahria University Islamabad PK Center of Computer Vision and Pattern Recognition Bahria University Islamabad Pakistan Dept. of Professional Psychology Bahria University Islamabad Pakistan

Drawing tests have been long used by practitioners for early screening of a number of psychological and neurological impairments. These brain functioning tests are used by psychologists to understand feelings, personality and reactions of individuals to different circumstances. Among these, Human Figure Drawing Test (HFDT) is a popular instrument for the assessment of cognitive functioning of individuals. While the HFDT has various dimensions, the focus of this study lies on the face of the drawn figure. A computerized system that analyzes the hand-drawn facial images to extract the expressions from the image is proposed. Sketch of human face is drawn by the subject and then fed to the system, the image is then binarized and segmented into different facial components. Features (based on local binary patterns, gray level co-occurrence matrices and histogram of oriented gradients) computed from the facial components are used to train an SVM classifier to learn to distinguish between four expression classes, `happy', `sad', `angry' and `neutral'. The system evaluated on a custom developed database of sketches realized promising results. The developed system could serve as a useful module toward development of a complete automated system to score human figure drawing test.

关键词： Feature extraction Psychology Face recognition Face Databases Support vector machines Lips

来源：评论

学校读者我要写书评

暂无评论

Keyword spotting for self-training of BLSTM NN based handwriting recognition systems

Keyword spotting for self-training of BLSTM NN based handwri...

引用

作者： Frinken, Volkmar Fischer, Andreas Baumgartner, Markus Bunke, Horst Computer Vision Center Autonomous University of Barcelona Edifici O 08193 Bellaterra Barcelona Spain Centre for Pattern Recognition and Machine Intelligence Concordia University 1455 de Maisonneuve Blvd West Montreal QC H3G 1M8 Canada Institute of Computer Science and Applied Mathematics University of Bern Neubrückstrasse 10 CH-3012 Bern Switzerland

The automatic transcription of unconstrained continuous handwritten text requires well trained recognition systems. The semi-supervised paradigm introduces the concept of not only using labeled data but also unlabeled data in the learning process. Unlabeled data can be gathered at little or not cost. Hence it has the potential to reduce the need for labeling training data, a tedious and costly process. Given a weak initial recognizer trained on labeled data, self-training can be used to recognize unlabeled data and add words that were recognized with high confidence to the training set for re-training. This process is not trivial and requires great care as far as selecting the elements that are to be added to the training set is concerned. In this paper, we propose to use a bidirectional long short-term memory neural network handwritten recognition system for keyword spotting in order to select new elements. A set of experiments shows the high potential of self-training for bootstrapping handwriting recognition systems, both for modern and historical handwritings, and demonstrate the benefits of using keyword spotting over previously published self-training schemes. © 2013 Elsevier Ltd. All rights reserved.

关键词： Character recognition

来源：评论

学校读者我要写书评

暂无评论

Tracking of human motion based on adaptive foreground segmentation and particle filter

引用

Journal of Test and Measurement Technology 2005年第3期19卷 320-326页

作者： Li, Wei Zhang, Yun-Hua Wang, Ya-Ming Research Center for Computer Vision and Pattern Recognition Zhejiang University of Sciences Hangzhou 310018 China

An approach to the tracking of trajectories of 3-D human motion from image sequence based on adaptive foreground segmentation and particle filter is proposed in this paper. First, the Gaussian model for image pixel is presented. Based on this, the adaptive segmentation of human body is finished using the information of difference image and the prior distribution of pixel density. Then, the tracking model under perspective imaging for body plane is established. Due to the fact that image function is nonlinear and the distribution of the noise in images is unknown, the particle filter based tacking is used. Finally, the 3-D trajectory of body plane is obtained. Experimental results show that 3-D trajectory of body plane can be effectively tracked, and the tracking results of particle filter is better than that of extended Kalman filter for this human motion tracking problem.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Motion constraint patterns

Motion constraint patterns

引用

IEEE Workshop on Qualitative vision

作者： C. Fermuller Department for Pattern Recognition and Image Processing Institute for Automation Technical University of of Vienna Vienna Austria Computer Vision Laboratory Center for Automation Research University of Maryland College Park MD USA

The problem of egomotion recovery has been treated by using as input local image motion, with the published algorithms utilizing the geometric constraint relating 2-D local image motion (optical flow, correspondence, derivatives of the image flow) to 3-D motion and structure. Since it has proved very difficult to achieve accurate input (local image motion), a lot of effort has been devoted to the development of robust techniques. A new approach to the problem of egomotion estimation is taken, based on constraints of a global nature. It is proved that local normal flow measurements form global patterns in the image plane. The position of these patterns is related to the three dimensional motion parameters. By locating some of these patterns, which depend only on subsets of the motion parameters, through a simple search technique, the 3-D motion parameters can be found. The proposed algorithmic procedure is very robust, since it is not affected by small perturbations in the normal flow measurements. As a matter of fact, since only the sign of the normal flow measurement is employed, the direction of translation and the axis of rotation can be estimated with up to 100% error in the image measurements.< >

关键词： Motion estimation computer vision Automation Image motion analysis Fluid flow measurement Motion measurement Rotation measurement Laboratories Educational institutions Geometrical optics

来源：评论

学校读者我要写书评

暂无评论

Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation

Box-driven Class-wise Region Masking and Filling Rate Guided...

引用

IEEE/CVF Conference on computer vision and pattern recognition

作者： Chunfeng Song Yan Huang Wanli Ouyang Liang Wang Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition (NLPR) Institute of Automation Chinese Academy of Sciences (CASIA) The University of Sydney SenseTime Computer Vision Research Group

ISBN: (纸本)9781728132945

Semantic segmentation has achieved huge progress via adopting deep Fully Convolutional Networks (FCN). However, the performance of FCN based models severely rely on the amounts of pixel-level annotations which are expensive and time-consuming. To address this problem, it is a good choice to learn to segment with weak supervision from bounding boxes. How to make full use of the class-level and region-level supervisions from bounding boxes is the critical challenge for the weakly supervised learning task. In this paper, we first introduce a box-driven class-wise masking model (BCM) to remove irrelevant regions of each class. Moreover, based on the pixel-level segment proposal generated from the bounding box supervision, we could calculate the mean filling rates of each class to serve as an important prior cue, then we propose a filling rate guided adaptive loss (FR-Loss) to help the model ignore the wrongly labeled pixels in proposals. Unlike previous methods directly training models with the fixed individual segment proposals, our method can adjust the model learning with global statistical information. Thus it can help reduce the negative impacts from wrongly labeled proposals. We evaluate the proposed method on the challenging PASCAL VOC 2012 benchmark and compare with other methods. Extensive experimental results show that the proposed method is effective and achieves the state-of-the-art results.

关键词： bounding boxes filling rate Masking Semantics Supervision Regulated industries

来源：评论

学校读者我要写书评

暂无评论

Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation

Med-DANet V2: A Flexible Dynamic Architecture for Efficient ...

引用

IEEE Workshop on Applications of computer vision (WACV)

作者： Haoran Shen Yifu Zhang Wenxuan Wang Chen Chen Jing Liu Shanshan Song Jiangyun Li School of Automation and Electrical Engineering University of Science and Technology Beijing Center for Research in Computer Vision University of Central Florida National Lab of Pattern Recognition Institute of Automation Chinese Academy of Sciences

Recent works have shown that the computational efficiency of 3D medical image (e.g. CT and MRI) segmentation can be impressively improved by dynamic inference based on slice-wise complexity. As a pioneering work, a dynamic architecture network for medical volumetric segmentation (i.e. Med-DANet [44]) has achieved a favorable accuracy and efficiency trade-off by dynamically selecting a suitable 2D candidate model from the pre-defined model bank for different slices. However, the issues of incomplete data analysis, high training costs, and the two-stage pipeline in Med-DANet require further improvement. To this end, this paper further explores a unified formulation of the dynamic inference framework from the perspective of both the data itself and the model structure. For each slice of the input volume, our proposed method dynamically selects an important foreground region for segmentation based on the policy generated by our Decision Network and Crop Position Network. Besides, we propose to insert a stage-wise quantization selector to the employed segmentation model (e.g. U-Net) for dynamic architecture adapting. Extensive experiments on BraTS 2019 and 2020 show that our method achieves comparable or better performance than previous state-of-the-art methods with much less model complexity. Compared with previous methods Med-DANet and TransBTS with dynamic and static architecture respectively, our framework improves the model efficiency by up to nearly 4.1 and 17.3 times with comparable segmentation results on BraTS 2019. Code will be available at https://***/Rubics-Xuan/Med-DANet.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Automated scoring of Bender Gestalt Test using image analysis techniques

Automated scoring of Bender Gestalt Test using image analysi...

引用

International Conference on Document Analysis and recognition

作者： Momina Moetesum Imran Siddiqi Uzma Masroor Chawki Djeddi Center of Computer Vision and Pattern Recognition Bahria University Islamabad Pakistan Dept. of Professional Psychology Bahria University Islamabad Pakistan LAMIS Laboratory Larbi Tebessi University Tebessa Algeria

ISBN: (纸本)9781479918065

Drawing tests have been long used by practitioners and researchers for early detection of psychological and neurological impairments. These tests allow subjects to naturally express themselves as opposed to an interview or a written assessment. Bender Gestalt Test (BGT) is a well-known and established neurological test designed to detect signs of perceptual distortions. Subjects are shown a number of geometric patterns for reconstruction and assessments are made by observing properties like rotation, angulations, simplification and closure difficulty. The manual scoring of the test, however, is a time consuming and lengthy procedure especially when a large number of subjects is to be analyzed. This paper proposes the application of image analysis techniques to automatically score a subset of hand drawn images in the BGT test. A comparison of the scores reported by the automated system with those assigned by the psychologists not only reveals the effectiveness of the proposed system but also reflects the huge research potential this area possesses.

关键词： Manuals Psychology Biomedical imaging Computational modeling

来源：评论

学校读者我要写书评

暂无评论

Color constancy and non-uniform illumination: Can existing algorithms work?

Color constancy and non-uniform illumination: Can existing a...

引用

International Conference on computer vision Workshops (ICCV Workshops)

作者： Michael Bleier Christian Riess Shida Beigpour Eva Eibenberger Elli Angelopoulou Tobias Tröger André Kaup Pattern Recognition Lab University of Erlangen-Nuremberg Germany Computer Vision Center Universidad Autónoma de Barcelona Spain Multimedia Communications and Signal Processing University of Erlangen-Nuremberg Germany

The color and distribution of illuminants can significantly alter the appearance of a scene. The goal of color constancy (CC) is to remove the color bias introduced by the illuminants. Most existing CC algorithms assume a uniformly illuminated scene. However, more often than not, this assumption is an insufficient approximation of real-world illumination conditions (multiple light sources, shadows, interreflections, etc.). Thus, illumination should be locally determined, taking under consideration that multiple illuminants may be present. In this paper we investigate the suitability of adapting 5 state-of-the-art color constancy methods so that they can be used for local illuminant estimation. Given an arbitrary image, we segment it into superpixels of approximately similar color. Each of the methods is applied independently on every superpixel. For improved accuracy, these independent estimates are combined into a single illuminant-color value per superpixel. We evaluated different fusion methodologies. Our experiments indicate that the best performance is obtained by fusion strategies that combine the outputs of the estimators using regression.

关键词： Image color analysis Lighting Databases Light sources Image segmentation Estimation Bayesian methods

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：