检索结果-内蒙古大学图书馆

International Workshop on Artificial Neural Networks and Intelligent Information processing, ANNIIP 2014 - In Conjuction with the International Conference on Informatics in Control, Automation and Robotics, ICINCO 2014

作者： Bodnár, Péter Grósz, Tamás Tóth, László Nyúl, László G. University of Szeged Department of Image Processing and Computer Graphics Szeged Hungary MTA-SZTE Research Group on Artificial Intelligence Hungarian Academy of Sciences University of Szeged Szeged Hungary

ISBN: (纸本)9789897580413

The reading process of visual codes consists of two steps, localization and data decoding. This paper presents a novel method for QR code localization using deep rectifier neural networks, trained directly in the JPEG DCT domain, thus making image decompression unnecessary. This approach is efficient with respect to both storage and computation cost, being convenient, since camera hardware can provide JPEG stream as their output in many cases. The structure of the neural networks, regularization, and training data parameters, like input vector length and compression level, are evaluated and discussed. The proposed approach is not exclusively for QR codes, but can be adapted to Data Matrix codes or other two-dimensional code types as well.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

The 5th International Conference on Biomedical Engineering and Biotechnology (ICBEB 2016) Abstracts

引用

BMC MEDICAL IMAGING 2016年第SUPPL 1期16卷 65-65页

作者： [Anonymous] Department of MRI Shandong Medical Imaging Research Institute Affiliated to Shandong University Jinan Shandong 250021 People’s Republic of China Department of Interventional Radiology Shandong Provincial Hospital Affiliated to Shandong University Jinan Shandong 250021 People’s Republic of China College of Information Science and Technology Engr. Research Center of Digitized Textile & Fashion Tech. for Ministry of Education Donghua University Shanghai 201620 China Intelligent multimedia information processing Lab College of Software Northeastern University Shenyang Liaoning Province 110004 China Institute of Biomedical and Health Engineering Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences Shenzhen 518055 China School of Computer Science and Technology Nanjing Normal University Nanjing China Department of Electrical Engineering The City College of New York CUNY New York USA Jiangsu Key Laboratory of 3D Printing Equipment and Manufacturing Nanjing China School of Electronic Science and Engineering Nanjing University Nanjing Jiangsu 210046 China College of Engineering Nanyang Technological University Singapore 639798 Singapore School of Electronic Information Shanghai Dianji University Shanghai China School of Natural Sciences and Mathematics Shepherd University Shepherdstown WV 25443 USA Davis College of Agriculture Natural Resources and Design West Virginia University Morgantown WV 26505 USA State Key Laboratory of Millimeter Waves Southeast University Nanjing 210096 China Center of Medical Physics and Technology Hefei Institutes of Physical Science Chinese Academy of Sciences Hefei China College of Agricultural and Life Sciences University of Florida Gainesville FL 32611 USA Courant Institute of Mathematical Sciences New York University New York NY 10012 USA Translational Imaging Division & MRI Unit Columbia University and New York State Psychiatric Institute New York NY 10032 USA Guangxi Key Laboratory of Manufacturing System & Adv

来源：评论

学校读者我要写书评

暂无评论

Visual simulation of cardiac beating motion with shape matching dynamics

Transactions of Japanese Society for Medical and Biological ...

引用

Transactions of Japanese Society for Medical and Biological Engineering 2015年第3期53卷 130-137页

作者： Ijiri, Takashi Ashihara, Takashi Umetani, Nobuyuki Koyama, Yuki Igarashi, Takeo Haraguchi, Ryo Yokota, Hideo Nakazawa, Kazuo Department of Media Technology Ritsumeikan University Japan Image Processing Research Team RIKEN Japan Department of Cardiovascular Medicine Heart Rhythm Center Shiga University of Medical Science Japan Autodesk Research Japan Department of Computer Science University of Tokyo Japan Department of Medical Informatics National Cerebral and Cardiovascular Center Japan Laboratory of Biomedical Science and Information Management National Cerebral and Cardiovascular Center Research Institute Japan

A shape matching dynamics (SMD) is a robust and efficient elastic model based on geometric constraints. This article introduces our study [1] that adopts SMD to visual simulation of cardiac beating motion. In our technique, a heart is represented by a tetrahedral mesh model and a local region is defined at each vertex by connecting its immediate neighbors. During the simulation, we first contract all local regions depending on predefined muscle fiber orientations and contraction rate. Then using SMD, we compute the global shape of the heart model so that it satisfies the contracted local regions. Our technique introduces a fiber-orientation-dependent weighting function to emulate an anisotropic stiffness of myocardium. Since our technique is based on SMD, it is possible to compute cardiac motion in real-time on a commercially available PC. © 2015, Japan Soc. of Med. Electronics and Biol. Engineering. All rights reserved.

关键词： Dynamics

来源：评论

学校读者我要写书评

暂无评论

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection 15

Intrinsic spectral analysis based on temporal context featur...

引用

15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014

作者： Yang, Peng Leung, Cheung-Chi Xie, Lei Ma, Bin Li, Haizhou Shaanxi Provincial Key Laboratory of Speech and Image Information Processing School of Computer Science Northwestern Polytechnical University Xi'an China Institute for Infocomm Research ASTAR Singapore

We investigate the use of intrinsic spectral analysis (ISA) for query-by-example spoken term detection (QbE-STD). In the task, spoken queries and test utterances in an audio archive are converted to ISA features, and dynamic time warping is applied to match the feature sequence in each query with those in test utterances. Motivated by manifold learning, ISA has been pro- posed to recover from untranscribed utterances a set of nonlin- ear basis functions for the speech manifold, and shown with improved phonetic separability and inherent speaker indepen- dence. Due to the coarticulation phenomenon in speech, we propose to use temporal context information to obtain the ISA features. Gaussian posteriorgram, as an efficient acoustic rep- resentation usually used in QbE-STD, is considered a baseline feature. Experimental results on the TIMIT speech corpus show that the ISA features can provide a relative 13.5% improvement in mean average precision over the baseline features, when the temporal context information is used. Copyright © 2014 ISCA.

关键词： Spectrum analysis

来源：评论

学校读者我要写书评

暂无评论

QR code localization using deep neural networks

QR code localization using deep neural networks

引用

IEEE Workshop on Machine Learning for Signal processing

作者： Tamás Grósz Péter Bodnár László Tóth László G. Nyúl MTA-SZTE Research Group on Artificial Intelligence Hungarian Academy of Sciences and University of Szeged Department of Image Processing and Computer Graphics University of Szeged

Usage of computer-readable visual codes became common in our everyday life at industrial environments and private use. The reading process of visual codes consists of two steps, localization and data decoding. This paper introduces a new method for QR code localization using conventional and deep rectifier neural networks. The structure of the neural networks, regularization, and training parameters, like input vector properties, amount of overlapping at samples, and effect of different block sizes are evaluated and discussed. Results are compared to localization algorithms of the literature.

关键词： Training Vectors Artificial neural networks Neurons Visualization Discrete cosine transforms

来源：评论

学校读者我要写书评

暂无评论

A deep neural network approach for sentence boundary detection in broadcast news 15

A deep neural network approach for sentence boundary detecti...

引用

15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014

作者： Xu, Chenglin Xie, Lei Huang, Guangpu Xiao, Xiong Chng, Eng Siong Li, Haizhou Shaanxi Provincial Key Laboratory of Speech and Image Information Processing School of Computer Science Northwestern Polytechnical University China Temasek Laboratories NTU Nanyang Technological University Singapore Singapore School of Computer Engineering Nanyang Technological University Singapore Singapore Institute for Infocomm Research ASTAR Singapore Singapore

This paper presents a deep neural network (DNN) approach to sentence boundary detection in broadcast news. We extract prosodic and lexical features at each inter-word position in the transcripts and learn a sequential classifier to label these positions as either boundary or non-boundary. This work is realized by a hybrid DNN-CRF (conditional random field) architecture. The DNN accepts prosodic feature inputs and non-linearly maps them into boundary/non-boundary posterior probability outputs. Subsequently, the posterior probabilities are combined with lexical features and the integrated features are modeled by a linear-chain CRF. The CRF finally labels the inter-word positions as boundary or non-boundary by Viterbi decoding. Experiments show that, as compared with the state-of-the-art DTCRF approach [1], the proposed DNN-CRF approach achieves 16.7% and 4.1% reduction in NIST boundary detection error in reference and speech recognition transcripts, respectively. Copyright © 2014 ISCA.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Maximum likelihood thresholding algorithm based on four-parameter gamma distributions

Maximum likelihood thresholding algorithm based on four-para...

引用

International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE)

作者： Peter De-Ford Geovanni Martinez Image Processing and Computer Vision Research Laboratory (IPCV-LAB) Universidad de Costa Rica San José Costa Rica

In this contribution, we present a segmentation algorithm based on thresholding to subdivide an intensity image in the regions of object and background. The optimal threshold is found by maximizing a likelihood function derived from a novel intensity probability density function model, which consists of the sum of two weighted four-parameter gamma distributions, as a more flexible alternative to currently used models consisting of the sum of two weighted two-parameter Gaussian distributions. According to our experiments with 132 images, the proposed algorithm is in average slightly better than the best found in the scientific literature, performing particularly good in low contrast images. The additional parameters and complexity of its likelihood function resulted in an increase of the processing time by a factor of 3, from 0.003 sec/image to 0.009 sec/image.

关键词： image segmentation Probability density function Approximation algorithms Gaussian distribution Histograms Shape Pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Fusion of iris and palmprint for multimodal biometric authentication

Fusion of iris and palmprint for multimodal biometric authen...

引用

Workshops on image processing Theory, Tools and Applications, IPTA

作者： Nassima Kihal Salim Chitroub Jean Meunier Department of Computer Science and Operations Research (DIRO) University of Montreal Montreal QC Canada Electronics and Computer Science Faculty Signal and Image Processing Laboratory Algiers Algeria

This paper presents a multimodal biometric system for authentication, based on the fusion of iris and palmprint. We propose an approach for feature extraction of each modality by using wavelet packet decomposition at four levels. This gives 256 packets which can generate a compact binary code. It is obtained from the first three highest energy peaks to compute an adapted threshold that enable to affect 0 or 1 to each wavelet packet. Different fusion strategies were tested at different levels: feature level, score level and error level. The first fusion is a simple concatenation of iris and palmprint codes. The second employs a weighted sum rule to matching scores. The third applies the Hamacher t-norm to the errors. The proposed approach and each fusion strategy were tested for their accuracy on the Casia iris database fused with the Casia palmprint database, and then with the PolyU database. The proposed approach for multimodal biometric system achieves a recognition improvement with each fusion method.

关键词： Iris recognition Wavelet packets Databases Feature extraction Iris Vectors

来源：评论

学校读者我要写书评

暂无评论

A Probabilistic Framework for Multitarget Tracking with Mutual Occlusions

A Probabilistic Framework for Multitarget Tracking with Mutu...

引用

IEEE Conference on computer Vision and Pattern Recognition

作者： Menglong Yang Yiguang Liu Longyin Wen Zhisheng You Stan Z. Li Key Laboratory of Fundamental Synthetic Vision Graphics and Image for National Defense School of Aeronautics and Astronautics & Computer Science Sichuan University Center for Biometrics and Security Research & National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences

ISBN: (纸本)9781479951192

Mutual occlusions among targets can cause track loss or target position deviation, because the observation likelihood of an occluded target may vanish even when we have the estimated location of the target. This paper presents a novel probability framework for multitarget tracking with mutual occlusions. The primary contribution of this work is the introduction of a vectorial occlusion variable as part of the solution. The occlusion variable describes occlusion states of the targets. This forms the basis of the proposed probability framework, with the following further contributions: 1) Likelihood: A new observation likelihood model is presented, in which the likelihood of an occluded target is computed by referring to both of the occluded and occluding targets. 2) Priori: Markov random field (MRF) is used to model the occlusion priori such that less likely "circular" or "cascading" types of occlusions have lower priori probabilities. Both the occlusion priori and the motion priori take into consideration the state of occlusion. 3) Optimization: A realtime RJMCMC-based algorithm with a new move type called "occlusion state update" ispresented. Experimental results show that the proposed framework can handle occlusions well, even including long-duration full occlusions, which may cause tracking failures in the traditional methods.

关键词： Target tracking Probabilistic logic Approximation algorithms Cameras Proposals Computational modeling

来源：评论

学校读者我要写书评

暂无评论

An adaptive multiplicative decomposition of non stationary multi-temporal satellite images: Application to urban changes detection

An adaptive multiplicative decomposition of non stationary m...

引用

International image processing, Applications and Systems Conference (IPAS)

作者： Ali Ben Abbes Houcine Essid Imed Riadh Farah Vincent Barra Research Labortaory in Computer Integrated Documentation and Arabized-Documentiel Genuises and Software National School of Computer Science Manouba-Tunisia Department Image and Information Processing Telecom Bretagne Technopôle France Laboratory of Computer Modeling and System Optimization UMR CNRS 6158 Univcrsité Blaise Pascal AUBIERE CEDEX

Nowadays, the process of change detection is regarded as an outstanding way for urban planning and design. The major concern of this paper is to investigate the non-stationary character of multi-temporal time series. To overcome this problem, we propose an adaptive multiplicative decomposition of non-stationary multi-temporal satellite image, which allows to decompose the series into three components: trend, seasonal and random, to properly model the evolution of land cover. We carried several experiments to validate our approach based on Landsat images covering the region of “Tres Cantos-Madrid” in Spain. The obtained results show the effectiveness of our proposed method comparing to some conventional methods.

关键词： Time series analysis Market research Satellites Urban areas Additives Standards image processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：