检索结果-内蒙古大学图书馆

2005 International Conference on Auditory-Visual Speech processing, AVSP 2005

作者： Lucey, Simon Lucey, Patrick Advanced Multimedia Processing Laboratory Department of Electrical and Computer Engineering Carnegie Mellon University PittsburghPA15213 United States Speech Audio Image and Video Research Laboratory Queensland University of Technology GPO Box 2424 Brisbane4001 Australia

Motivated by the success of free-parts based representations in face recognition [1] we have attempted to address some of the problems associated with applying such a philosophy to the task of speaker-independent automatic speech reading. Hitherto, a major problem with canonical area-based approaches in automatic speech reading is the intrinsic lack of training observations due to the visual speech modality’s low sample rate and large variability in appearance. We believe a free-parts representation can overcome many of these limitations due to its natural ability to generalize by producing many observations from a single mouth image, whilst still preserving the ability to discriminate between various visual-speech units. This approach additionally requires a modification to traditional techniques employed for the estimation of hidden Markov Models (HMMs), whose resultant models we currently refer to as free-parts HMMs (FP-HMMs). Results will be presented on the CUAVE audiovisual speech database. © AVSP 2005. All rights reserved.

关键词： Hidden Markov models

来源：评论

学校读者我要写书评

暂无评论

DSA image registration based on multiscale Gabor filters and mutual information

DSA image registration based on multiscale Gabor filters and...

引用

ICIA 2005: 2005 International Conference on Information Acquisition

作者： Cao, Zhiguo Liu, Xiaoxiao Peng, Bo Moon, Yiu-Sang Key Laboratory Huazhong University of Science and Technology Ministry of Education for Image Processing and Intelligent Control Wuhan 430074 China Department of Computer Science and Engineering Chinese University of Hong Kong Shatin N.T. Hong Kong

ISBN: (纸本)0780393031

In clinical practice, digital subtraction angiography (DSA) is a powerful technique for the visualization of blood vessels in X-ray image sequences. Different with traditional DSA image registration processes, in our proposed image registration method, the control points are selected from the vessel centerlines using multiscale Gabor filters, and mutual information (MI) is then taken as the similarity criterion to find the correspondences. Experimental results demonstrate our algorithm efficiently yields satisfying registration result for DSA images. © 2005 IEEE.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Feature selection for high dimensional face image using self-organizing maps

引用

9th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2005

作者： Tan, Xiaoyang Chen, Songcan Zhou, Zhi-Hua Zhang, Fuyan National Laboratory for Novel Software Technology Nanjing University Nanjing 210093 China Department of Computer Science and Engineering Nanjing University of Aeronautics and Astronautics Nanjing 210016 China Shanghai Key Laboratory of Intelligent Information Processing Fudan University Shanghai 200433 China

ISBN: (纸本)3540260765

While feature selection is very difficult for high dimensional, unstructured data such as face image, it may be much easier to do if the data can be faithfully transformed into lower dimensional space. In this paper, a new method is proposed to transform the high dimensional face images into low-dimensional SOM topological space, and then identify important local features of face images for face recognition automatically using simple statistics computed from the class distribution of the face image data. The effectiveness of the proposed method are demonstrated by the experiments on AR face databases, which reveal that up to 80% local features can be pruned with only slightly loss of the classification accuracy. © Springer-Verlag Berlin Heidelberg 2005.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Using a Free-Parts Representation for Visual Speech Recognition

Using a Free-Parts Representation for Visual Speech Recognit...

引用

Proceedings of the Digital image Computing: Technqiues and Applications (DICTA)

作者： P. Lucey S. Lucey S. Sridharan Queensland University of Technology Speech Audio Image and Video Research Laboratory Brisbane Australia Advanced Multimedia Processing Laboratory Department of Electrical and Computer Engineering Carnegie Mellon University Pittsburgh PA USA

Motivated by the success of free-parts based representations in face recognition, we have attempted to address some of the problems associated with applying such a philosophy to the task of speaker-independent visual speech recognition. A major problem with canonical area-based approaches in automatic visual speech recognition is the dependence these approaches have on locating and tracking the speaker’s region of interest (ROI) correctly. By employing a free-parts representation,we assume that the position/structure of patches within the mouth image can be relaxed so they can "freely" move to varying extents, hence reducing the influence of the front-end effect. In this paper, we show that by using a free-parts representation we gain some robustness against the problem of ROI localisation and tracking compared to current area-based feature extraction techniques such as the discrete cosine transform (DCT). Also in this paper, we expose the importance of representation for the task of visual speech recognition highlighted by the poor results current representations yield.

关键词： Speech recognition Mouth Hidden Markov models Laboratories Face recognition Feature extraction Discrete cosine transforms Robustness Automatic speech recognition Pixel

来源：评论

学校读者我要写书评

暂无评论

Investigating manifold learning algorithms based on magnification factors and principal spread directions

引用

Jisuanji Xuebao/Chinese Journal of computers 2005年第12期28卷 2000-2009页

作者： He, Li Zhang, Jun-Ping Zhou, Zhi-Hua Shanghai Key Laboratory of Intelligent Information Processing Department of Computer Science and Engineering Fudan University Shanghai 200433 China Key Laboratory of Complex Systems and Intelligence Science Chinese Academy of Sciences Shanghai 200433 China Department of Computer Science and Engineering School of Mathematical Sciences Fudan University Shanghai 200433 China National Laboratory for Novel Software Technology Nanjing University Nanjing 210093 China

As a new unsupervised learning technique, manifold learning has captured the attention of many researchers in the field of machine learning and cognitive sciences. The major algorithms include Isometric mapping (ISOMAP) and Locally Linear Embedding (LLE). The approaches can be used for discovering the intrinsic dimensions of nonlinear high-dimensional data effectively and aim researchers to analyze the data better. How to quantitatively analyze the relationship between the intrinsic dimensions and the observation space, however, has fewer reports. And thus further works in manifold learning may have suffered some difficulties. The paper focuses on two kinds of manifold learning algorithms (ISOMAP, LLE), and discusses magnification factors and principal spread directions from the observation space to the intrinsic low-dimensional space. Also the corresponding algorithm is proposed. Experiments show the effectiveness and advantages of the research.

关键词： Learning algorithms

来源：评论

学校读者我要写书评

暂无评论

DSA image registration based on multiscale Gabor filters and mutual information

DSA image registration based on multiscale Gabor filters and...

引用

International Conference on Information and Automation (ICIA)

作者： Zhiguo Cao Xiaoxiao Liu Bo Peng Yiu-Sang Moon Key Laboratory of Ministry of Education for Image Processing and Intelligent Control Institute for Pattern Recognition and Artificial Intelligence Huazhong University of Science and Technology Wuhan China Department of Computer Science and Engineering Chinese University of Hong Kong New Territories Hong Kong China

关键词： image registration Gabor filters Mutual information Biomedical imaging Angiography Visualization Displacement control Blood vessels image sequences Pixel

来源：评论

学校读者我要写书评

暂无评论

A computer-aided system for mass detection and classification in digitized mammograms

引用

Biomedical Engineering - Applications, Basis and Communications 2005年第5期17卷 215-228页

作者： Yang, Sheng-Chih Wang, Chuin-Mu Chung, Yi-Nung Hsu, Giu-Cheng Lee, San-Kan Chung, Pau-Choo Chang, Chein-I. Department of Electrical Engineering National Cheng Kung University Tainan Taiwan Department of Computer Science and Information Engineering National Chin Yi Institute of Technology Taichung Taiwan Department of Electrical Engineering Da-Yeh University Chunghua Taiwan Department of Radiology Tri-Service General Hospital National Defense Medical Center Taipei Taiwan Administrative Office Suao Veterans Hospital Yilan Taiwan Remote Sensing Signal and Image Processing Laboratory Department of Computer Science and Electrical Engineering University of Maryland Baltimore MD United States

This paper presents a computer-assisted diagnostic system for mass detection and classification, which performs mass detection on regions of interest followed by the benign-malignant classification on detected masses. In order for mass detection to be effective, a sequence of preprocessing steps are designed to enhance the intensity of a region of interest, remove the noise effects and locate suspicious masses using five texture features generated from the spatial gray level difference matrix (SGLDM) and fractal dimension. Finally, a probabilistic neural network (PNN) coupled with entropic thresholding techniques is developed for mass extraction. Since the shapes of masses are crucial in classification between benignancy and malignancy, four shape features are further generated and joined with the five features previously used in mass detection to be implemented in another PNN for mass classification. To evaluate our designed system a data set collected in the Taichung Veteran General Hospital, Taiwan, R.O.C. was used for performance evaluation. The results are encouraging and have shown promise of our system.

关键词： computer aided analysis

来源：评论

学校读者我要写书评

暂无评论

An automated seamless mosaicing system of multi-charge coupled devices of panchromatic data

引用

Journal of the Indian Society of Remote Sensing 2004年第1期32卷 103-111页

作者： Ramakrishnan, R. Manthira Moorthi, S. Padmanabhan, N. Gupta, P. Data Products Software Division Signal and Image Processing Group Space Applications Centre (ISRO) Ahmedabad-380 015 India Department of Computer Science and Engineering Indian Institute of Technology Kanpur Kanpur 208 016 India

Panchromatic data of pixel resolution 5.8 m obtained from IRS-IC and IRS-ID satellites proved to be very useful for mapping purposes. One of the popular data product is the 70 km swath mosaic which is covered by a combination of 3 CCD line sensors, each with 4096 pixels. Each CCD-line sensor with different imaging times causes geometric problems of mosaicing three strips data together. In this paper, we propose the details of the design elements of system that caters to the need for accurate and automatic multi strip image registration without any second resampling of the data. The systematic geometric correction grid mapping is improved to facilitate accurate mosaicing by automatic image registration task that makes use of the overlap data within image strips and image registration is achieved up to sub-pixel level.

关键词：

来源：评论

学校读者我要写书评

暂无评论

The cook projection index estimation using the wavelet kernel function

引用

International Symposium on Neural Networks, ISNN 2004

作者： Lin, Wei Zheng, Tian He, Fan Wen, Xian-Bin Department of Applied Mathematics Northwestern Polytechnical University Xi’an710072 China Department of Computer Science and Technology Northwestern Polytechnical University Xi’an710072 China Key Laboratory of Education Ministry for Image Processing and Intelligent Control Huazhong University of Science and Technology Wuhan430074 China

ISBN: (纸本)3540228411

The key procedure of exploratory projection pursuit is to optimize a criterion function, which is called the projection pursuit index. The cook family index estimated by the wavelet kernel function is given in this paper. And the asymptotic unbiasedness and the convergence property of the projection index are proved. Also, as the fast computing of this kind projection index, it is suited for the processing of a large data. Some results of projection index based on the wavelet kernel estimation are compared with that of the Gauss kernel estimation. © Springer-Verlag Berlin Heidelberg 2004.

关键词： Artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

An efficient algorithm and implementation for residue to binary number conversion

An efficient algorithm and implementation for residue to bin...

引用

International Conference on Communications, Circuits and Systems (ICCCAS)

作者： Chengyi Xiong Zhirong Gao Jinwen Tian Key Laboratory of Education Commission for Image Processing and Intelligent Control Institute of Pattern Recognition & Artificial Intelligence Huazhong University of Science and Technology Wuhan China Department of Computer Science Wuhan University of Science & Engineering Wuhan China Key Laboratory of Education Commission for Image Processing Huazhong University of Science & Technology China

The residue number system (RNS) has computational advantages in addition and multiplication compared with weighted number systems, such as the binary number system (BNS), since operations on residue digits are performed independently and these processes can be performed in parallel. Thus they are widely used in digital signal processing etc. Since residue to binary conversion is critical and difficult for the practicality of RNS, in this paper, a novel residue to binary (R/B) conversion algorithm for the restricted moduli set (2/sup n/ -1, 2/sup n/, 2n+1), based on exploring the periodicity of modulo (2/sup n/ /spl plusmn/ 1) operations is presented. A new 2n-bit adder based R/B converter is also proposed. The performance comparison results demonstrate that the new converter is faster and requires less area compared with the others reported in the previous literature.

关键词： Signal processing algorithms image converters Educational institutions computer science Concurrent computing Digital signal processing Arithmetic Laboratories image processing Intelligent control

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：