检索结果-内蒙古大学图书馆

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

作者： Ali, Hussnain Hong, Feng Hansen, John H. L. Tobey, Emily Univ Texas Dallas Ctr Robust Speech Syst Cochlear Implant Lab Erik Jonsson Sch Engn & Comp Sci Richardson TX 75083 USA

ISBN: (纸本)9781479928934

Spectral maxima sound coding algorithms, for example n-ofm strategies, used in commercial cochlear implant devices rely on selecting channels with the highest energy in each frequency band. This technique works well in quiet, but is inherently problematic in noisy conditions when noise dominates the target, and noise-dominant channels are mistakenly selected for stimulation. A new channel selection criterion is proposed to addresses this shortcoming which adaptively assigns weights to each time-frequency unit based on the formant location of speech and instantaneous signal to noise ratio. The performance of the proposed technique is evaluated acutely with three cochlear implant users in different noise scenarios. Results indicate that the proposed technique improves speech intelligibility and perception quality, particularly at low signal-to-noise ratio. Significance of the proposed technique lies in its ability to be integrated with the existing sound coding framework employed within commercial cochlear implant processors, making it easier to adapt for resource-limited and time critical devices.

关键词： Cochlear implants sound coding algorithms

来源：评论

学校读者我要写书评

暂无评论

A review of speech and audio coding standards (ITU-T, ETSI and ISO/MPEG)

引用

ANNALS OF TELECOMMUNICATIONS 2000年第9-10期55卷 425-441页

作者： Le Guyader, A Philippe, P Rault, JB France Telecom R&D DIH DIPS F-22307 Lannion France France Telecom R&D DIH HDM F-35512 Cesson Sevigne France

Speech and audio coding standards al-E defined in international organizations having wide range of activities, only part of them dealing with bit rate compression of speech and audio. The UIT (international Telecommunication Union) deals with interactive communication standards, the ETSI (European Telecommunications Standards Institute) with mobile communication standards in Europe while multimedia communication standard's are under the responsability of the ISO (International Organization Sor Standardization). After a brief description of the standardization mechanism, we will review the features of speech and audio compression schemes (bit rates, quality complexity and delay) and the main applications of these standards. A list of these compression standards, already;adopted or in the course of definition, will be provided for each normalization organization. Finally: this payer gives the orientations or trends which are emerging in the field of audio compression standardization.

关键词： review standardization standardization institution international institution international standard UIT ISO ETSI speech coding sound coding passband compression sound quality application telecommunication audiovisual

来源：评论

学校读者我要写书评

暂无评论

The MPEG-2 AAC coder explained to the signal processing experts

引用

ANNALS OF TELECOMMUNICATIONS 2000年第9-10期55卷 442-461页

作者： Derrien, O Larbi, S Guimares, MP Moreau, N Ecole Natl Super Telecommun Bretagne F-75634 Paris 13 France ENIT Lab Syst Commun Tunis 1002 Tunisia Univ Paris 05 F-75270 Paris 06 France

The MPEG-2 Advanced Audio Coder is the latest issue of the MPEG audio encoders/decoders family whose most popular version is known as MP3. It gathers many of the latest highly efficient sound compression techniques in a quite classically structured coder. The main part is based or a Discrete Cosine Transform with variable resolution. The output from this filterbank is compressed by the combination of an adaptive bit allocation module, according to frequency subbands, and a set of noiseless Huffman codebooks. Bit allocation is controlled by a psychoacoustic model which determines an audibility threshold far signal distortion in the frequency domain. This article intends to explain the ISO standard without replacing it, and also to be a general introduction to perceptual audio coding.

关键词： speech coding sound coding passband compression didactic paper psychoacoustics coder standardization system architecture cosine transformation discrete transformation hearing signal quantization variable length code

来源：评论

学校读者我要写书评

暂无评论

Low delay coder (<25 ms) of wideband audio (20 Hz-15 kHz) scalable from 64 to 32 kbit/s

引用

ANNALS OF TELECOMMUNICATIONS 2000年第9-10期55卷 493-506页

作者： Moreau, N Dymarski, P Ecole Natl Super Telecommun Bretagne TSI F-75634 Paris 13 France Warsaw Univ Technol PL-00665 Warsaw Poland

A low delay coder for speech and music signals sampled at 32 kHz is described. Its algorithmic delay does not exceed 25 ms which enables audio conferencing applications without echo cancellation. Its bot rate is scalable between 64 and 32 kbits/s by steps of 8 kbits/s. The transmitter issues the binary code at 64 kbits/s with lower bit rate codes embedded in it. The receiver may operate at lower bit rates with gradual loss of quality. The proposed coder is based on a mixed scheme: the adopted solution contains elements from the CELP speech coder and frequency domain where bit allocation is calculated and transform coefficients are quantized. A first solution based on the DFT is discussed, then a second solution based on a MDCT with small overlap is applied. The quantization of these coefficients is done in the following way. First, a prediction of the whole spectrum is applied. Then, a mean-removed gain-shape split VQ is used for amplitude spectrum quantization and a hierarchical 2-dimensional VQ is used for phase spectrum quantization stage, each codeword describing the selected vector index is split into parts corresponding to different bit rates. Due to the hierarchical codebook structure, truncated indices may be used, without much affecting the signal quality. Simulation results are presented and the robustness of the proposed coder is examined.

关键词： speech coding sound coding hierarchical coding sound quality perception passband compression coder music signal spectrum signal quantizing scability

来源：评论

学校读者我要写书评

暂无评论

A scalable three bit-rates 8-14.1-24 kbit/s audio coder

引用

ANNALES DES TELECOMMUNICATIONS-ANNALS OF TELECOMMUNICATIONS 2000年第9-10期55卷 483-492页

作者： Taddei, H Massaloux, D Le Guyader, A France Telecom R&D DIH DIPS F-22307 Lannion France

This paper presents a scalable three bit-rates (8, 14.1 and 24 il kbit/s) coder: For the two embedded lowest ones, operating in the telephone bandwidth, CELP coding techniques are used. For the highest rate, that both improves narrowband quality and extends the band to [50-7000Hz], transform coding techniques are used. The main applications deal with transmission over network,with no guaranteed QoS.

关键词： speech coding sound coding coder decoder scalability hierarchical coding passband compression sound quality

来源：评论

学校读者我要写书评

暂无评论

Modified synaptic dynamics predict neural activity patterns in an auditory field within the frontal cortex

引用

EUROPEAN JOURNAL OF NEUROSCIENCE 2020年第4期51卷 1011-1025页

作者： Lopez-Jury, Luciana Mannel, Adrian Garcia-Rosales, Francisco Hechavarria, Julio C. Goethe Univ Inst Zellbiol & Neurowissensch Max von Laue Str 13 D-60438 Frankfurt Germany

Frontal areas of the mammalian cortex are thought to be important for cognitive control and complex behaviour. These areas have been studied mostly in humans, non-human primates and rodents. In this article, we present a quantitative characterization of response properties of a frontal auditory area responsive to sound in the brain of Carollia perspicillata, the frontal auditory field (FAF). Bats are highly vocal animals, and they constitute an important experimental model for studying the auditory system. We combined electrophysiology experiments and computational simulations to compare the response properties of auditory neurons found in the bat FAF and auditory cortex (AC) to simple sounds (pure tones). Anatomical studies have shown that the latter provides feedforward inputs to the former. Our results show that bat FAF neurons are responsive to sounds, and however, when compared to AC neurons, they presented sparser, less precise spiking and longer-lasting responses. Based on the results of an integrate-and-fire neuronal model, we suggest that slow, subthreshold, synaptic dynamics can account for the activity pattern of neurons in the FAF. These properties reflect the general function of the frontal cortex and likely result from its connections with multiple brain regions, including cortico-cortical projections from the AC to the FAF.

关键词： auditory cortex bats integrate-and-fire prefrontal cortex sound coding

来源：评论

学校读者我要写书评

暂无评论

HCN channels in the mammalian cochlea: Expression pattern, subcellular location, and age-dependent changes

引用

JOURNAL OF NEUROSCIENCE RESEARCH 2021年第2期99卷 699-728页

作者： Luque, Maria Schrott-Fischer, Anneliese Dudas, Jozsef Pechriggl, Elisabeth Brenner, Erich Rask-Andersen, Helge Liu, Wei Glueckert, Rudolf Med Univ Innsbruck Dept Otorhinolaryngol Anichstr 35 A-6020 Innsbruck Austria Med Univ Innsbruck Div Clin & Funct Anat Dept Anat Histol & Embryol Innsbruck Austria Uppsala Univ Hosp Sect Otolaryngol Dept Surg Sci Head & Neck Surg Uppsala Sweden Univ Clin Innsbruck Tirol Kliniken Innsbruck Austria

Neuronal diversity in the cochlea is largely determined by ion channels. Among voltage-gated channels, hyperpolarization-activated cyclic nucleotide-gated (HCN) channels open with hyperpolarization and depolarize the cell until the resting membrane potential. The functions for hearing are not well elucidated and knowledge about localization is controversial. We created a detailed map of subcellular location and co-expression of all four HCN subunits across different mammalian species including CBA/J, C57Bl/6N, Ly5.1 mice, guinea pigs, cats, and human subjects. We correlated age-related hearing deterioration in CBA/J and C57Bl/6N with expression levels of HCN1, -2, and -4 in individual auditory neurons from the same cohort. Spatiotemporal expression during murine postnatal development exposed HCN2 and HCN4 involvement in a critical phase of hair cell innervation. The huge diversity of subunit composition, but lack of relevant heteromeric pairing along the perisomatic membrane and axon initial segments, highlighted an active role for auditory neurons. Neuron clusters were found to be the hot spots of HCN1, -2, and -4 immunostaining. HCN channels were also located in afferent and efferent fibers of the sensory epithelium. Age-related changes on HCN subtype expression were not uniform among mice and could not be directly correlated with audiometric data. The oldest mice groups revealed HCN channel up- or downregulation, depending on the mouse strain. The unexpected involvement of HCN channels in outer hair cell function where HCN3 overlaps prestin location emphasized the importance for auditory function. A better understanding may open up new possibilities to tune neuronal responses evoked through electrical stimulation by cochlear implants.

关键词： auditory development auditory neuron diversity axon initial segment HCN channels prestin RRID AB_90725 RRID AB_2039906 RRID AB_2302038 RRID AB_2313584 RRID AB_2313726 RRID AB_2336419 RRID AB_2336420 RRID AB_2336790 RRID AB_2340477 RRID AB_2340452 RRID AB_2340593 RRID AB_2341028 RRID AB_2617143 RRID AB_2756625 RRID AB_2756742 RRID SCR_002865 RRID SCR_013652 RRID SCR_014823 sound coding spiral ganglion neurons voltage gated

来源：评论

学校读者我要写书评

暂无评论

Testing the methods of sound signal compression 4

Testing the methods of sound signal compression

引用

4th EURASIP Conference on Video, Image Processing and Multimedia Communications

作者： Zagursky, V Riekstinch, A Zarumba, I Latvian State Univ Inst Elect & Comp Sci LV-1006 Riga Latvia

ISBN: (纸本)9531840547

Three sound signal compression methods are being offered allowing (depending on the application aim) finding the best trade-off between the compression efficiency and the realization complexity. The testing of the methods proposed was done on a half-nature simulation system including personal computers 10 Mbit/sec Ethernet LAN and the sound signal input/output means.

关键词： sound signal signal compression sound coding

来源：评论

学校读者我要写书评

暂无评论

Spatial release from masking in bilateral cochlear implant users listening to the temporal limits encoder strategy 23

Spatial release from masking in bilateral cochlear implant u...

引用

23rd International Congress on Acoustics: Integrating 4th EAA Euroregio, ICA 2019

作者： Kan, Alan Meng, Qinglin University of Wisconsin-Madison United States South China University of Technology China

ISBN: (纸本)9783939296157

Binaural hearing benefits with bilateral cochlear implants (CI) are usually smaller than with normal hearing (NH) in the same tasks. This gap in performance has typically been attributed to a lack of coordinated stimulation between ears and the high stimulation rates used in clinical processors. These factors hinder sensitivity to interaural timing differences (ITDs);an important binaural cue for NH listeners. The Temporal Limits Encoder (TLE) strategy was originally designed to encode unilateral low-frequency temporal fine structure pitch cues into the signal envelope of CIs. However, TLE also lowers the stimulation rate on some channels which may potentially provide useable ITD cues. Here, we measured spatial release from masking (SRM) in bilateral CI users listening to TLE vs Advanced Combinational Encoder (ACE) strategy to determine if TLE provides a benefit. The CCiMobile research platform was used for testing. Results from eight listeners showed comparable word recognition performance in quiet and co-located conditions for TLE and ACE, even with a short acclimatization period with TLE. In the spatially-separated condition, performance across the group was more similar with TLE than ACE, and more listeners showed SRM. These results indicate that TLE has the potential for improving binaural hearing benefits for CI users. © 2019 Proceedings of the International Congress on Acoustics. All rights reserved.

关键词： Cochlear Implants sound coding Spatial Release from Masking

来源：评论

学校读者我要写书评

暂无评论

Image-guided customization of frequency-place mapping in cochlear implants 40

Image-guided customization of frequency-place mapping in coc...

引用

40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015

作者： Ali, Hussnain Noble, Jack H. Gifford, Rene H. Labadie, Robert F. Dawant, Benoit M. Hansen, John H.L. Tobey, Emily Department of Electrical Engineering University of Texas at Dallas Richardson United States Department of Electrical Engineering and Computer Science Vanderbilt University Nashville United States Department of Hearing and Speech Sciences Vanderbilt University Medical Center Nashville United States Department of Otolaryngology Vanderbilt University Medical Center Nashville United States Department of Behavioral and Brain Sciences University of Texas at Dallas Richardson United States

ISBN: (纸本)9781467369978

Multi-channel cochlear implants (CI) leverage frequency based cochlear tonotopic mapping to map acoustic information to the cochlear place of stimulation which is primarily determined by electrode locations. Despite the fact that electrode locations within the cochlea are unique to each patient, the acoustic frequencies assigned to the electrodes by the CI processor are determined generically, resulting in a mismatch between intended and actual pitch perception. This is known to be a limiting factor for hearing outcomes with CIs. In this study, we propose a novel, image-guided CI processor programming strategy to select more optimal, patient-customized frequency assignments. The performance of the proposed strategy was evaluated using vocoder-based simulations with ten normal hearing listeners. In our simulations, our strategy results in significantly better speech recognition scores than the standard clinical strategy. © 2015 IEEE.

关键词： algorithms Cochlear implants sound coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：