General human-computer interaction (HCI) based on monocular camera is one of the most important areas where rapid and comprehensive intelligence of network terminals and natural interaction systems can be achieved usi...
详细信息
ISBN:
(纸本)9781622761234
General human-computer interaction (HCI) based on monocular camera is one of the most important areas where rapid and comprehensive intelligence of network terminals and natural interaction systems can be achieved using image sensors. Currently, sensor networks require lots of expensive and single function sensors, and at the same time there exists some complexity of interaction between human beings and objects as well as objects and objects. Under these circumstances, the development of the intelligent network terminals based on image sensors is becoming increasingly significant. In this paper, we propose an interaction system based on HCI technology achieved by the most widely used image sensors, such as PC cameras, phone cameras and surveillance cameras in public places. The system applies simple but fast image processing technologies, such as image segmentation, tracking, recognition and some well-improved methods.
We present a beamspace realization of the generalized sidelobe canceller (GSC) applied to audio and speech capture, and show that it has some specific advantages compared to the standard GSC implementation for certain...
详细信息
We present a beamspace realization of the generalized sidelobe canceller (GSC) applied to audio and speech capture, and show that it has some specific advantages compared to the standard GSC implementation for certain noise assumptions. Specifically, we demonstrate that for this application the herein proposed implementation is less prone to signal cancellation due to source positioning errors, and that it exhibits a better attenuation of low frequencies when broadband audio signals are captured by uniform linear arrays. Results are shown both for simulations with small uniform linear arrays of five and eleven elements, as well as for actual data recorded by a 300 element microphone array at the Staples Center sports arena.
This paper deals with segmentation methods and fatigue features determination for a camera-based visual systems monitoring driver vigilance. Generally visual monitoring systems have to analyse a set of computed fatigu...
详细信息
This paper deals with segmentation methods and fatigue features determination for a camera-based visual systems monitoring driver vigilance. Generally visual monitoring systems have to analyse a set of computed fatigue features and recognize driver inattention or sleepiness. The paper is focused mostly on the segmentation methods used for reliable eyes tracking because of eyes features are certainly the most significant features for determining of a driver fatigue. Fundamentals segmentation methods as a simple colour segmentation and Hough transform are introduced in the paper. After that a more complex Haar-like features approach and symmetries detection approach are shortly introduced. Finally, several of the leading fatigue features are listed and described. All the presented segmentation methods were tested on both laboratory and real images.
This paper reports a personalized model of an information retrieval system based on three-layer agent. This system includes a personalized user agent, an information retrieval agent, and an information filtering agent...
详细信息
ISBN:
(纸本)9781612848792
This paper reports a personalized model of an information retrieval system based on three-layer agent. This system includes a personalized user agent, an information retrieval agent, and an information filtering agent. It provides personalized servers for users through works of intelligent agents associated with each other.
At present, pumping station measurement and control system has the problems of networking difficult, poor Expansibility, low data transfer rate. In order to solve these problems, a new type pumping station measurement...
详细信息
At present, pumping station measurement and control system has the problems of networking difficult, poor Expansibility, low data transfer rate. In order to solve these problems, a new type pumping station measurement and control system is designed by using ZigBee and 3G network. In this system, the local measurement & control unit by using ZigBee realizes to each remote measurement & control nod wireless information acquisition and the remote control, and also, the local measurement & control unit takes 3G network to achieve two-way data transfer to monitoring center. Through the use of ZigBee & 3G network technical advantages, The preliminary experimental results show that the system can not only successfully acheive the wireless measurement and control tasks, but also overcome many defects of the current measurement and control system.
We present a new model-based monaural speech separation technique for separating two speech signals when only a single recording of their linear mixture is available. Two important aspects of model-based monaural spee...
详细信息
We present a new model-based monaural speech separation technique for separating two speech signals when only a single recording of their linear mixture is available. Two important aspects of model-based monaural speech separation are the applied modeling technique and the estimation technique. In this approach, we introduce sub-section vector quantization technique and use it as the modeling technique instead of conventional vector quantization method. Then, separated speech signals are estimated using a simple soft mask filter whose states are controlled by the components of the codevectors. In the speech separation experiments, the proposed method is shown to improve SNR by 1.8 dB compared to the system using conventional VQ technique as the modeling technique and binary mask filter as the estimator.
In this paper, a compressive sensing photoacoustic imaging scheme based on Digital Micromirror Device(DMD) is built. In compressive sensing photoacoustic imaging, DMD is used as an optical mask. The mask is placed bet...
详细信息
ISBN:
(纸本)9781457701726
In this paper, a compressive sensing photoacoustic imaging scheme based on Digital Micromirror Device(DMD) is built. In compressive sensing photoacoustic imaging, DMD is used as an optical mask. The mask is placed between a short-pulsed laser and biological tissues to realize the coded illumination. To realize the random illumination, the coded pattern of the mask should be changed for each laser pulse. based on the DMD, random code patterns of the mask can be changed quickly by controlling a digital logical circuit. The illuminated tissue absorbs the optical energy to generate the ultrasonic waves. The generated ultrasonic waves along the same arc are compressed and detected by an unfocused ultrasonic transducer. After certain measurements, the photoacoustic image can be reconstructed by a suitable CS reconstruction algorithm.
Edge of image is one of the most fundamental and significant features. Edge detection is always one of the classical studying projects of computer vision and image processing field. It is the first step of image analy...
详细信息
Edge of image is one of the most fundamental and significant features. Edge detection is always one of the classical studying projects of computer vision and image processing field. It is the first step of image analysis and understanding. With the continuous improvement of remote sensing image, especially the appearance of Digital Aerial image, edge detection is necessary step to extract information from the Digital Aerial images..The purpose of edge detection is to discover the information about the shapes and the reflectance or transmittance in an image. The correctness and reliability of its results affect directly the comprehension machine system made for objective world. In this paper FPGA-based architecture for edge detection algorithms has been proposed. The implementation of edge detection algorithms on a field programmable gate array (FPGA) is having advantage of using large memory and embedded multipliers. FPGAs are providing a platform for processing real time algorithms on application-specific hardware with substantially higher performance than programmable digital signal processors (DSPs). The proposed architecture can be used as a building block of a aerial imaging systems for navigation and for the pattern recognition. The hardware implementation results are presented for the Sobel and Prewitt operator.
The conventional microphone array near-field Fourier acoustic holography using Discrete Fourier Transform (DFT) is able to efficiently reconstruct sound field and acquire an image of noise distribution. However, Fouri...
详细信息
ISBN:
(纸本)9781457700293
The conventional microphone array near-field Fourier acoustic holography using Discrete Fourier Transform (DFT) is able to efficiently reconstruct sound field and acquire an image of noise distribution. However, Fourier transform causes measuring error in practical applications, and people have to select primary frequency for observing sound field holography based on the spectrum of source signal. In this paper, we use the empirical mode decomposition (EMD) owing to its completeness, orthogonality, and adaptiveness, which are able to decompose multiple sound sources in the spatial domain and acquire instantaneous frequencies via intrinsic mode functions (IMFs). Prior information about the primary frequency is not necessary by this approach that makes the simultaneous observation of each source possible. In addition, EMD sound source imaging approach may be integrated into a near-field equivalent source imaging (NESI) system, which includes a virtual microphone technology generally used for sound field image enhancement. We have implemented and compared the constituent 1D EMD, 2D EMD spatial transform systems, and EMD based NESI approach in Labview language. Several experimental results and detailed discussions are also provided to verify the characteristics of multiple sound sources.
Most work on activity recognition focuses on 2D image properties, holistic spatiotemporal representations, or space-time shapes in image domain rather than with 3D pose in a body-centric or world frame. Such technique...
详细信息
Most work on activity recognition focuses on 2D image properties, holistic spatiotemporal representations, or space-time shapes in image domain rather than with 3D pose in a body-centric or world frame. Such techniques rely on advanced pattern recognition algorithms and interpreting complex behavioral patterns. In this work we posit that it is possible to achieve 3D pose tracking using videos recorded in multi-camera surveillance systems. We show experimental results that were obtained on PETS 2009 datasets. The estimation of the 3D articulated motion is achieved using a modified particle swarm optimization.
暂无评论