检索结果-内蒙古大学图书馆

World Haptics Conference

作者： C.W. Borst Advanced Computer Studies University of Louisiana Lafayette USA

We investigate predictive coding for reducing the amount of data communicated between a haptic controller and a host. This allows increased update rate, which potentially improves quality even if coding is lossy. A low-order predictive coding is investigated for a pneumatic force display. Due to human and device characteristics, some compression is possible without loss, although the technique is lossy in general. Lossy uniform and nonuniform quantizers are also investigated. An experiment was conducted to determine how much data reduction is possible before compression artifacts become detectable to users.

关键词： predictive coding Haptic interfaces Decoding Low pass filters Computer displays Bandwidth Force sensors Encoding Pistons Force control

来源：评论

学校读者我要写书评

暂无评论

BIOLOGICALLY PLAUSIBLE BSDT RECOGNITION OF COMPLEX IMAGES: THE CASE OF HUMAN FACES

引用

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS 2008年第6期18卷 527-545页

作者： Gopych, Petro Univ Power Syst USA Ukraine LLC UA-61012 Kharkov Ukraine

On the basis of recent binary signal detection theory (BSDT), optimal recognition algorithms for complex images are constructed and their optimal performance are calculated. A methodology for comparing BSDT predictions and measured human performance is developed and applied to explaining particular face recognition experiment. The BSDT makes possible computer codes with recognition performance better than that in humans, its fundamental discreteness is consistent with the experiment. Related neurobiological and behavioral effects are briefly discussed.

关键词： Neural networks predictive coding hierarchical algorithms psychometric functions Neyman-Pearson objective decision confidence face recognition human errors

来源：评论

学校读者我要写书评

暂无评论

On the minimum entropy of a mixture of unimodal and symmetric distributions

引用

IEEE TRANSACTIONS ON INFORMATION THEORY 2008年第7期54卷 3166-3174页

作者： Chen, Ting-Li Geman, Stuart Acad Sinica Inst Stat Sci Taipei 115 Taiwan Brown Univ Div Appl Math Providence RI 02912 USA

Progressive encoding of a signal generally involves an estimation step, designed to reduce the entropy of the residual of an observation over the entropy of the observation itself. Oftentimes the conditional distributions of an observation, given already-encoded observations, are well fit within a class of symmetric and unimodal distributions (e.g., the two-sided geometric distributions in images of natural scenes, or symmetric Paretian distributions in models of financial data). It is common practice to choose an estimator that centers, or aligns, the modes of the conditional distributions, since it is common sense that this will minimize the entropy, and hence the coding cost of the residuals. But with the exception of a special case, there has been no rigorous proof. Here we prove that the entropy of an arbitrary mixture of symmetric and unimodal distributions is minimized by aligning the modes. The result generalizes to unimodal and rotation-invariant distributions in R(n). We illustrate the result through some experiments with natural images.

关键词： entropy coding LOCO lossless image compression mixture distributions predictive coding symmetric distributions unimodal distributions

来源：评论

学校读者我要写书评

暂无评论

Predicting dc using ac coefficients for JPEG coding

引用

OPTICAL ENGINEERING 2008年第2期47卷 027004-1-027004-5页

作者： Lakhani, Gopal Texas Tech Univ Lubbock TX 79409 USA

The JPEG baseline algorithm codes the dc of a block by giving its difference with the dc of the previous block. We propose to use ac coefficients for this purpose. Our method computes the difference of the sum of pixels of two boundary columns (or rows), one belonging to the current block and the other to a previous block, and then manipulates it in the direct cosine transform (DCT) domain so that the average of the coded differences for the whole image is near zero. Experimental results show that our method reduces the average JPEG dc residual by about 75% for images compressed at the default quality level. The reduction is even higher for unquantized DCT blocks. (c) 2008 Society of Photo-optical Instrumentation Engineers.

关键词： JPEG discrete cosine transform predictive coding

来源：评论

学校读者我要写书评

暂无评论

The effect of prior visual information on recognition of speech and sounds

引用

CEREBRAL CORTEX 2008年第3期18卷 598-609页

作者： Noppeney, Uta Josephs, Oliver Hocking, Julia Price, Cathy J. Friston, Karl J. Max Planck Inst Biol Cybernet D-72076 Tubingen Germany Inst Neurol Wellcome Dept Imaging Neurosci London WC1N 3BG England

To identify and categorize complex stimuli such as familiar objects or speech, the human brain integrates information that is abstracted at multiple levels from its sensory inputs. Using cross-modal priming for spoken words and sounds, this functional magnetic resonance imaging study identified 3 distinct classes of visuoauditory incongruency effects: visuoauditory incongruency effects were selective for 1) spoken words in the left superior temporal sulcus (STS), 2) environmental sounds in the left angular gyrus (AG), and 3) both words and sounds in the lateral and medial prefrontal cortices (IFS/mPFC). From a cognitive perspective, these incongruency effects suggest that prior visual information influences the neural processes underlying speech and sound recognition at multiple levels, with the STS being involved in phonological, AG in semantic, and mPFC/IFS in higher conceptual processing. In terms of neural mechanisms, effective connectivity analyses (dynamic causal modeling) suggest that these incongruency effects may emerge via greater bottom-up effects from early auditory regions to intermediate multisensory integration areas (i.e., STS and AG). This is consistent with a predictive coding perspective on hierarchical Bayesian inference in the cortex where the domain of the prediction error (phonological vs. semantic) determines its regional expression (middle temporal gyrus/STS vs. AG/intraparietal sulcus).

关键词： cross-modal priming dynamic causal modeling effective connectivity multisensory integration predictive coding semantics

来源：评论

学校读者我要写书评

暂无评论

Simulation of talking faces in the human brain improves auditory speech recognition

引用

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 2008年第18期105卷 6747-6752页

作者： von Kriegstein, Katharina Dogan, Oezguer Grueter, Martina Giraud, Anne-Lise Kell, Christian A. Grueter, Thomas Kleinschmidt, Andreas Kiebel, Stefan J. UCL Wellcome Trust Ctr Neuroimaging London WC1N 3BG England Univ Newcastle Sch Med Newcastle Upon Tyne NE2 4HH Tyne & Wear England Goethe Univ Frankfurt Dept Neurol D-60528 Frankfurt Germany Univ Vienna Dept Psychol Basic Res A-1010 Vienna Austria Ecole Normale Super Dept Etud Cognit F-75005 Paris France CEA NeuroSpin F-91401 Gif Sur Yvette France Inst Natl Sante & Rech Med F-91401 Gif Sur Yvette France

Human face-to-face communication is essentially audiovisual. Typically, people talk to us face-to-face, providing concurrent auditory and visual input. Understanding someone is easier when there is visual input, because visual cues like mouth and tongue movements provide complementary information about speech content. Here, we hypothesized that, even in the absence of visual input, the brain optimizes both auditory-only speech and speaker recognition by harvesting speaker-specific predictions and constraints from distinct visual face-processing areas. To test this hypothesis, we performed behavioral and neuroimaging experiments in two groups: subjects with a face recognition deficit (prosopagnosia) and matched controls. The results show that observing a specific person talking for 2 min improves subsequent auditory-only speech and speaker recognition for this person. In both prosopagnosics and controls, behavioral improvement in auditory-only speech recognition was based on an area typically involved in face-movement processing. Improvement in speaker recognition was only present in controls and was based on an area involved in face-identity processing. These findings challenge current unisensory models of speech processing, because they show that, in auditory-only speech, the brain exploits previously encoded audiovisual correlations to optimize communication. We suggest that this optimization is based on speaker-specific audiovisual internal models, which are used to simulate a talking face.

关键词： fMRI multisensory predictive coding prosopagnosia

来源：评论

学校读者我要写书评

暂无评论

Natural vision reveals regional specialization to local motion and to contrast-invariant, global flow in the human brain

引用

CEREBRAL CORTEX 2008年第3期18卷 705-717页

作者： Bartels, A. Zeki, S. Logothetis, N. K. Max Planck Inst Biol Cybernet Dept Psychol Cognit Proc D-72076 Tubingen Germany UCL Dept Anat Neurobiol Lab London WC1E 6BT England

Visual changes in feature movies, like in real-live, can be partitioned into global flow due to self/camera motion, local/differential flow due to object motion, and residuals, for example, due to illumination changes. We correlated these measures with brain responses of human volunteers viewing movies in an fMRI scanner. Early visual areas responded only to residual changes, thus lacking responses to equally large motion-induced changes, consistent with predictive coding. Motion activated V5+ (MT+), V3A, medial posterior parietal cortex (mPPC) and, weakly, lateral occipital cortex (LOC). V5+ responded to local/differential motion and depended on visual contrast, whereas mPPC responded to global flow spanning the whole visual field and was contrast independent. mPPC thus codes for flow compatible with unbiased heading estimation in natural scenes and for the comparison of visual flow with nonretinal, multimodal motion cues in it or downstream. mPPC was functionally connected to anterior portions of V5+, whereas laterally neighboring putative homologue of lateral intraparietal area (LIP) connected with frontal eye fields. Our results demonstrate a progression of selectivity from local and contrast-dependent motion processing in V5+ toward global and contrast-independent motion processing in mPPC. The function, connectivity, and anatomical neighborhood of mPPC imply several parallels to monkey ventral intraparietal area (VIP).

关键词： contrast heading LIP motion natural scenes objects predictive coding V5/MT VIP

来源：评论

学校读者我要写书评

暂无评论

IMAGE COMPRESSION USING HIGH ORDER WEDGELETS IN A GENERALIZED QUAD-TREE

IMAGE COMPRESSION USING HIGH ORDER WEDGELETS IN A GENERALIZE...

引用

15th IEEE International Conference on Image Processing (ICIP 2008)

作者： Rahimi, Azar Kassim, Ashraf A. Natl Univ Singapore Dept Elect & Comp Engn Singapore Singapore

ISBN: (纸本)9781424417650

Edges provide critical information which enables viewers to better discern objects in images. Although transform-based image compression schemes have been successful, they are unable to efficiently represent 2D edges. Wedgelets capture geometrical structures in images by explicitly defining an edge. In this paper, we introduce high order wedgelets in a more generalized form of quad-tree partitioning to realize improved compression performance compared to existing compression methods.

关键词： high order wedgelets quadratic spline quad-tree partitioning predictive coding

来源：评论

学校读者我要写书评

暂无评论

Multiple-description predictive-vector quantization with applications to low bit-rate speech coding over networks

引用

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2007年第3期15卷 749-755页

作者： Yahampath, Pradeepa Rondeau, Paul Univ Manitoba Dept Elect & Comp Engn Winnipeg MB R3T 5V6 Canada

An algorithm for designing linear prediction-based two-channel multiple-description predictive-vector quantizers;(MD-PVQs) for packet-loss channels is presented. This algorithm iteratively improves the encoder partition, the set of multiple description codebooks, and the linear predictor for a given channel loss probability, based on a training set of source data. The effectiveness of the designs obtained with the given algorithm is demonstrated using a waveform coding example involving a Markov source as well as vector quantization of speech line' spectral pairs.

关键词： multiple-description coding predictive coding speech coding vector quantization

来源：评论

学校读者我要写书评

暂无评论

Selective Compression Algorithm of Facial Images Based on DWT

Selective Compression Algorithm of Facial Images Based on DW...

引用

International Conference on Advanced Computer Theory and Engineering (ICACTE 2008)

作者： Lu Xiaoqi Zhang Baohua Inner Mongolia Univ Sci & Technol Sch Informat Engn Baotou City 014010 Mongolia

ISBN: (纸本)9780769534893

Image compression reduces time and cost in image storage without significant reduction of the image quality. This paper puts forward a wavelet-based predictive image coding algorithm, which has a higher coding rate than traditional coding algorithm. Based on the algorithm above, this article adopts the selective image compression technique to compress facial images. This algorithm attains a compression ratio from decade to several decades and settles the transmission and storage problem preferably.

关键词： Compression algorithms Discrete wavelet transforms Image coding Frequency Wavelet transforms predictive coding Wavelet analysis Image storage Humans Approximation algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：