检索结果-内蒙古大学图书馆

A novel monochromatic cue for detecting regions of visual interest

image AND vISION COMPUTING 2014年第6-7期32卷 405-413页

作者： Jung, Chanho Kim, Wonjun Yoo, Seungwoo Kim, Changick ETRI IT Convergence Technol Res Lab Taejon 305700 South Korea SAIT Future IT Res Ctr Adv Media Lab Seoul South Korea Qualcomm Res Korea Seoul South Korea Korea Adv Inst Sci & Technol Dept Elect Engn Taejon 305732 South Korea

Finding regions of interest (ROIs) is a fundamentally important problem in the area of computer vision and image processing. Previous studies addressing this issue have mainly focused on investigating chromatic cues to characterize visually salient image regions, while less attention has been devoted to monochromatic cues. The purpose of this paper is the study of monochromatic cues, which have the potential to complement chromatic cues, for the detection of ROIs in an image. This paper first presents a taxonomy of existing ROI detection approaches using monochromatic cues, ranging from well-known algorithms to the most recently published techniques. We then propose a novel monochromatic cue for ROI detection. Finally, a comparative evaluation has been conducted on large scale challenging test sets of real-world natural scenes. Experimental results demonstrate that the use of our proposed monochromatic cue yields a more accurate identification of ROIs. This paper serves as a benchmark for future research on this particular topic and a steppingstone for developers and practitioners interested in adopting monochromatic cues to ROI detection systems and methodologies. (C) 2014 Elsevier B.v. All rights reserved.

关键词： Regions of interest (ROIs) Monochromatic cues visual attention Taxonomy Performance comparison of algorithms and systems

来源：评论

学校读者我要写书评

暂无评论

Research and Development of algorithms for Objects Detection on images of Industrial Materials

Research and Development of Algorithms for Objects Detection...

引用

International Conference on Mechanical Engineering, Automation and Control systems (MEACS)

作者： Antonov, L. v. Orlov, A. A. Vladimir State Univ Dept Phys & Appl Math Murom Inst Branch Fed State Budgetary Educ Inst H Murom Russia

ISBN: (纸本)9781479962211

The main current directions of research in nanoscale images processing of a high degree of detail are shown in the paper. A system of algorithms for solving a wide range of tasks for the structural analysis of the images of industrial materials has been developed. The experimental results of the developed algorithms have been presented in the article. The results of the image processing of industrial materials have been shown.

关键词： image processing industrial materials

来源：评论

学校读者我要写书评

暂无评论

GPU-Accelerated Interactive visualization and Planning of Neurosurgical Interventions

引用

IEEE COMPUTER GRAPHICS AND APPLICATIONS 2014年第1期34卷 22-31页

作者： Rincon-Nigro, Mario Navkar, Nikhil v. Tsekos, Nikolaos v. Deng, Zhigang Univ Houston Dept Comp Sci Houston TX 77004 USA Univ Houston Houston TX 77004 USA

Advances in computational methods and hardware platforms provide efficient processing of medical-imaging datasets for surgical planning. For neurosurgical interventions employing a straight access path, planning entails selecting a path from the scalp to the target area that's of minimal risk to the patient. A proposed GPU-accelerated method enables interactive quantitative estimation of the risk for a particular path. It exploits acceleration spatial data structures and efficient implementation of algorithms on GPUs. In evaluations of its computational efficiency and scalability, it achieved interactive rates even for high-resolution meshes. A user study and feedback from neurosurgeons identified this methods' potential benefits for preoperative planning and intraoperative replanning.

关键词： Medical image processing Neurosurgery Graphics processing Units Instruction Sets Data Structures Interactive visualizations GPU Acceleration Neurosurgical Interventions Risk Maps Straight Access Computer Graphics visualizations

来源：评论

学校读者我要写书评

暂无评论

No-reference image quality assessment in curvelet domain

引用

SIGNAL processing-image COMMUNICATION 2014年第4期29卷 494-505页

作者： Liu, Lixiong Dong, Hongping Huang, Hua Bovik, Alan C. Beijing Inst Technol Sch Comp Sci & Technol Beijing Lab Intelligent Informat Technol Beijing 100081 Peoples R China Univ Texas Austin Lab Image & Video Engn Dept Elect & Comp Engn Austin TX 78712 USA

We study the efficacy of utilizing a powerful image descriptor, the curvelet transform, to learn a no-reference (NR) image quality assessment (IQA) model. A set of statistical features are extracted from a computed image curvelet representation, including the coordinates of the maxima of the log-histograms of the curvelet coefficients values, and the energy distributions of both orientation and scale in the curvelet domain. Our results indicate that these features are sensitive to the presence and severity of image distortion. Operating within a 2-stage framework of distortion classification followed by quality assessment, we train an image distortion and quality prediction engine using a support vector machine (SvM). The resulting algorithm, dubbed CurveletQA for short, was tested on the LIvE IQA database and compared to state-of-the-art NR/FR IQA algorithms. We found that CurveletQA correlates well with human subjective opinions of image quality, delivering performance that is competitive with popular full-reference (FR) IQA algorithms such as SSIM, and with top-performing NR IQA models. At the same time, CurveletQA has a relatively low complexity.(c) 2014 Elsevier B.v. All rights reserved.

关键词： image quality assessment (IQA) No reference (NR) Curvelet Natural scene statistics (NSS) Support vector Machine (SvM)

来源：评论

学校读者我要写书评

暂无评论

Engraved character recognition using computer vision to recognize engine and chassis numbers: Computer vision technique to identify engraved numbers

Engraved character recognition using computer vision to reco...

引用

International Conference on Information processing (ICIP)

作者： Aniket v. Patil Mrinai M. Dhanvijay Electronics and Telecommunication Department Modern Education Society's College of Engineering Pune Pune India

ISBN: (纸本)9781467377591

Optical character recognition systems (OCR) have been effectively developed for the recognition of printed characters. One such application is the identifying engine number and chassis number which is engraved on machine parts. Manual logging of serial numbers in industries is very tedious and a time consuming affair. Our proposed system is robust under poor illumination conditions. Our overall system is efficient and can be applied in realtime applications. Since OCR is well-studied area where powerful algorithms like Zidouri algorithm for letter segmentation, Blob detection algorithm for removal of unwanted areas and character extraction, Hilditch algorithm for Arabic character recognition already exists, our OCR based engraved character recognition yields more accurate results up to 99.99% accuracy. The paper explains how optical character recognition technique along with computer vision can be applied to identify engine number and chassis number which are engraved on two and four wheeler vehicles.

关键词： vehicles Character recognition Engines Optical character recognition software Optical imaging image segmentation Java

来源：评论

学校读者我要写书评

暂无评论

Automatic emotion recognition in compressed speech using acoustic and non-linear features

Automatic emotion recognition in compressed speech using aco...

引用

Symposium of image, Signal processing, and Artificial vision (STSIvA)

作者： N. García J.C vásquez-Correa J.D Arias-Londoño J.F várgas-Bonilla J.R Orozco-Arroyave Faculty of Engineering Universidad de Antioquia UdeA Medellin Colombia Faculty of Engineering Universidad de Antioquia UdeA Calle 70 No. 52-21 Medellín Colombia Faculty of Engineering Friedrich Alexander Universitat Erlangen Germany

Automatic recognition of emotions in speech has attracted the attention of the research community in recent years. Some of the most relevant proposed applications of it are in call-centers. In these scenarios the speech is distorted by compression algorithms. The effects of such distortion on the performance of systems for automatic recognition of emotions must be assessed. In this study these effects are evaluated independently of any other distortions generated by the communications channel. Several state-of-the-art codecs are used to compress the speech signals of two emotional speech databases. The databases used are the Berlin Database of Emotional Speech and the enterface05. The methodology considers voiced and unvoiced segments of the speech separately. Spectral, cepstral, noise and Non-Linear Dynamics (NLD) measures are used to characterize the segments. Finally, a classifier based on a Gaussian Mixture Model (GMM) is used to identify the emotion. The results indicate that voiced segments are less affected by the compression than unvoiced ones in terms in classification accuracy. They also show that the bandwidth of the analyzed signals is an important factor in the classification results.

关键词： Codecs Speech Databases Emotion recognition Speech recognition Accuracy Frequency measurement

来源：评论

学校读者我要写书评

暂无评论

Progress Towards Automated Early Stage Detection of Diabetic Retinopathy: image Analysis systems and Potential

引用

JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING 2014年第6期34卷 520-527页

作者： Mane, vijay M. Jadhav, Dattatray v. Savitribai Phule Pune Univ JSPMS Rajarshi Shahu Coll Engn Dept Elect Engn Pune 411033 Maharashtra India TSSM Bhivarabai Sawant Coll Engn & Res Pune 411041 Maharashtra India

Captured retina images enable important parts of the visual system to be analyzed. Automated retinal image processing is becoming a primary screening tool for the detection of diseases such as diabetic retinopathy (DR). An automated system reduces human error and also reduces the burden on ophthalmologists. The accurate detection of microaneurysms (MAs) is an important step for the early detection of DR. MAs appear as a first sign of DR and can be seen on retina images. This paper discusses some of the current techniques used to automatically detect MAs from retinal digital fundus images. This review outlines the general principle upon which retinal digital image analysis is based for the detection of MAs. The algorithms are categorized according to four processing steps (preprocessing, candidate MA detection, feature extraction, and classification). various gold standard or ground truth databases, data sample size, and the use of image databases are discussed. The variety of outcome measures and flaws in the literature are discussed. The challenges and future potential for research are discussed to provide guidance to algorithm designers of the early detection of DR.

关键词： Diabetic retinopathy Microaneurysms Preprocessing Candidate detection Classification

来源：评论

学校读者我要写书评

暂无评论

Combating bad weather part I: Rain removal from video

引用

Synthesis Lectures on image, video, and Multimedia processing 2014年第2期16卷 1-92页

作者： Mukhopadhyay, Sudipta Tripathi, Abhishek Kumar IIT Kharagpur India Uurmi Systems India

ISBN: (纸本)9781627055765

Current vision systems are designed to perform in normal weather condition. However, no one can escape from severe weather conditions. Bad weather reduces scene contrast and visibility, which results in degradation in the performance of various computer vision algorithms such as object tracking, segmentation and recognition. Thus, current vision systems must include some mechanisms that enable them to perform up to the mark in bad weather conditions such as rain and fog. Rain causes the spatial and temporal intensity variations in images or video frames. These intensity changes are due to the random distribution and high velocities of the raindrops. Fog causes low contrast and whiteness in the image and leads to a shift in the color. This book has studied rain and fog from the perspective of vision. The book has two main goals: 1) removal of rain from videos captured by a moving and static camera, 2) removal of the fog from images and videos captured by a moving single uncalibrated camera system. The book begins with a literature survey. Pros and cons of the selected prior art algorithms are described, and a general framework for the development of an efficient rain removal algorithm is explored. Temporal and spatiotemporal properties of rain pixels are analyzed and using these properties, two rain removal algorithms for the videos captured by a static camera are developed. For the removal of rain, temporal and spatiotemporal algorithms require fewer numbers of consecutive frames which reduces buffer size and delay. These algorithms do not assume the shape, size and velocity of raindrops which make it robust to different rain conditions (i.e., heavy rain, light rain and moderate rain). In a practical situation, there is no ground truth available for rain video. Thus, no reference quality metric is very useful in measuring the efficacy of the rain removal algorithms. Temporal variance and spatiotemporal variance are presented in this book as no reference quality metr

关键词： Rain

来源：评论

学校读者我要写书评

暂无评论

Robust target tracking algorithm for MAv navigation system

Robust target tracking algorithm for MAV navigation system

引用

International Conference on Industrial Instrumentation and Control (ICIC)

作者： S. Sankarasrinivasan E. Balasubramanian F. Y. Hsiao L. J. Yang Center for Autonomous System Research Vel Tech University Chennai India Department of Mechanical and Electromechanical Engineering Tamkang University Tamsui Taiwan

Micro Aerial vehicles (MAv's) are becoming ubiquitous with its ever increasing applications in defense, space and environmental sectors. In real time scenario, MAv's are expected to perform autonomously and development of intelligent algorithms meant for pattern recognition and object tracking are most demanding. This work concentrates on the development of vision based navigation system for real time target tracking using MAvs. The target is identified based on its color feature and various color models namely RGB, Normalized RGB, HSI, YUv, YIQ, YCbCr, CIELAB and CIELUv are considered for thresholding analysis. The idea is to frame an effective image processing algorithm concerning thresholding time and accuracy. In addition, the robustness of the color models for various noises such as fast fading, gaussian blur, jpeg, jp2k and white noise are also investigated. Simulation results suggests that, Y based color models exhibits less thresholding time, good accuracy and robust to noise. The target tracking algorithm is developed using optimum color model and justified through real time experimentation. A MATLAB (Matrix Laboratory) based navigation system is developed encompassing micro camera, A/v transmitter and receiver unit, flight controller, image processing system and other interfacing circuits. The navigation system is successfully tested in our lab environments and it is proven to be a realizable and a cost effective solution.

关键词： image color analysis Real-time systems Colored noise Navigation Target tracking Robustness

来源：评论

学校读者我要写书评

暂无评论

ISAR MOTION COMPENSATION BASED ON A NEW DOPPLER PARAMETERS ESTIMATION PROCEDURE

ISAR MOTION COMPENSATION BASED ON A NEW DOPPLER PARAMETERS E...

引用

IEEE International Geoscience and Remote Sensing Symposium

作者： Carlo Noviello Gianfranco Fornaro Paolo Braca Marco Martorella Institute for Electromagnetic Sensing of the Environment (IREA-CNR) NATO Science & Technology Organization Centre for Maritime Research and Experimentation (CMRE) Department of Ingegneria dell'Informazione University of Pisa

ISBN: (纸本)9781479979301

The work addresses the problem of compensating the distortion effects induced by the translational motion of moving targets in Inverse Synthetic Aperture Radar (ISAR) imaging systems. The ISAR motion compensation is the most crucial step in the Autofocusing ISAR technique;this task is typically solved by implementing exhaustive search algorithms by adopting proper functionals based f.i. on image entropy or image contrast. In this work, we discuss an innovative and fast motion compensation procedure that is based on the estimation of two Doppler key Parameters: the Doppler Centroid and the Doppler Rate, which are related to the target motion parameters. The effectiveness of the proposed method is tested on real data acquired by a static Frequency Modulated Continuous Wave radar with an azimuth wide beamwidth;the radar is installed near the inner harbor of La Spezia (Italy) and it owned to the Centre for Maritime Research and Experimentation of the North Atlantic Treaty Organization (CMRE-NATO).

关键词： ISAR ISAR Motion Compensation Doppler processing ISAR Autofocusing FMCW radar

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：