In 1991 Kodak released a set of 24 digital color images derived from a variety of film source materials. Since then, most image processing algorithms have been developed, optimized, tested and compared using this set....
详细信息
ISBN:
(纸本)9781479923427
In 1991 Kodak released a set of 24 digital color images derived from a variety of film source materials. Since then, most image processing algorithms have been developed, optimized, tested and compared using this set. Until a few years ago it was considered "the" image set;however, today it shows its limitations. Researches have expressed their need for better, more up-to-date material. We present a new set of high quality color image sequences captured with our professional digital cinema camera. This camera stores uncompressed raw sensor data and the set is freely available for FTP at password: image set.
In this paper we present the contactless active optical sensor for driver fatigue detection in variable weather and lighting conditions. The fatigue was determined by monitoring activity of the eyes. In our case the b...
详细信息
ISBN:
(纸本)9781467346412
In this paper we present the contactless active optical sensor for driver fatigue detection in variable weather and lighting conditions. The fatigue was determined by monitoring activity of the eyes. In our case the bright pupil effect was used. The brightness of the pupil was increased by developing a new keying technique of controlling the IR emitter signal. This technique caused the brightness to increase in comparison to the continuous signal by 50%on average. The new developed sensor, as well as, the image processing algorithm was used during the construction of the driver fatigue monitoring system. The system detects fatigue by analyzing the variability of the following signals: PERCLOSE, PEROPEN, blink frequency, activity of the eyes and pupil diameter.
Stochastic encoding represents a value using the probability of ones in a random bit stream. Computation based on this encoding has good fault-tolerance and low hardware cost. However, one of its major issues is long ...
详细信息
ISBN:
(纸本)9781479904945
Stochastic encoding represents a value using the probability of ones in a random bit stream. Computation based on this encoding has good fault-tolerance and low hardware cost. However, one of its major issues is long processing time. We have to use a long enough bit stream to represent a value to guarantee that random fluctuations introduce only small errors to final computation results. For example, for most digital image processing algorithms, we need a 512-bit stream to represent an 8-bit pixel value stochastically to guarantee that the final computation error is less than 5%. To solve this issue, this paper proposes to share bits between adjacent bit streams to represent adjacent deterministic values. For example, in imageprocessing applications, the bit stream which represents the current pixel value can share parts of the bits in the bit stream which represents the previous pixel value. We use an image contrast stretching algorithm to evaluate this method. Our experimental results show that the proposed methods can improve the performance by 90%.
Physicians use ultrasound scans to obtain real-time images of internal organs, because such scans are safe and inexpensive. However, people in remote areas face difficulties to be scanned due to aging society and phys...
详细信息
ISBN:
(纸本)9781457702150
Physicians use ultrasound scans to obtain real-time images of internal organs, because such scans are safe and inexpensive. However, people in remote areas face difficulties to be scanned due to aging society and physician's shortage. Hence, it is important to develop an autonomous robotic system to perform remote ultrasound scans. Previously, we developed a robotic system for automatic ultrasound scan focusing on human's liver. In order to make it a completely autonomous system, we present in this paper a way to autonomously localize the epigastric region as the starting position for the automatic ultrasound scan. An image processing algorithm marks the umbilicus and mammary papillae on a digital photograph of the patient's abdomen. Then, we made estimation for the location of the epigastric region using the distances between these landmarks. A supporting algorithm distinguishes rib position from epigastrium using the relationship between force and displacement. We implemented these algorithms with the automatic scanning system into an apparatus: a Mitsubishi Electric's MELFA RV-1 six axis manipulator. Tests on 14 healthy male subjects showed the apparatus located the epigastric region with a success rate of 94%. The results suggest that image recognition was effective in localizing a human body part.
Temporal lobe epilepsy is the most common cause of drug-resistant seizures. Electrical stimulation seems to be a promising alternative to the classical anteromesial temporal resection. Our work aims at improving and v...
详细信息
ISBN:
(纸本)9781467319676
Temporal lobe epilepsy is the most common cause of drug-resistant seizures. Electrical stimulation seems to be a promising alternative to the classical anteromesial temporal resection. Our work aims at improving and validating therapeutic approaches for treating temporal lobe epilepsy using high-frequency stimulation. The relative phase clustering index (rPCI) is used to describe the epileptogenicity of a brain region and seizure prediction. An online image processing algorithm using a 3D camera is presented for facilitating the tedious and time consuming visual image inspection by traditional long term video footage required to determine optimal electrical stimulation parameters. See our companion paper describing the 2D solution in this journal.
As it is well known, multimodality is a very common task in the human-robot communication. Human conversation is also considered usually multimodal, and a great amount of research is done worldwide to engine novel rob...
详细信息
ISBN:
(纸本)9781479908271
As it is well known, multimodality is a very common task in the human-robot communication. Human conversation is also considered usually multimodal, and a great amount of research is done worldwide to engine novel robotic systems embedding more and more intelligence for human gestures or speech recognition abilities enhancement. This paper presents a hardware configuration developed for multimodal communication skills implementation on the NI SbRIO-9631 prototype robot. The standard hardware structure of this mobile device has been gradually extended with voice recognition and real-time imageprocessing units. By using the Lab View graphical software toolkit, these modules integrates through specific software drivers various sensor modality including vision and speech signals analyzing. The robot endowed with the mentioned capabilities becomes a very flexible and powerful development tool for human-robot multimodal communication research and implementation. In addition to this, is also suitable for a large amount of voice signals and image processing algorithms testing, through that the robot displays intelligence and cooperativeness in its behavior.
Compression is increasingly used in medical applications to enable efficient and universally accessible electronic health records. However, lossy compression introduces artifacts that can alter diagnostic accuracy, in...
详细信息
ISBN:
(纸本)9781457702150
Compression is increasingly used in medical applications to enable efficient and universally accessible electronic health records. However, lossy compression introduces artifacts that can alter diagnostic accuracy, interfere with image processing algorithms and cause liability issues in cases of diagnostic errors. Compression guidelines were introduced to mitigate these issues and foster the use of modern compression algorithms with diagnostic imaging. However, these guidelines are usually defined as maximum compression ratios for each imaging protocol and do not take compressibility variations due to image content into account. In this paper we have evaluated the compressibility of thousands of computed tomography slices of an anthropomorphic thoracic phantom acquired with different parameters. We have shown that exposure, slice thickness and reconstruction filters have a significant impact on compressibility suggesting that guidelines based solely on compression ratios may be inadequate.
We often need to consider the influence on human perception when we develop an image processing algorithm or design parameters for imageprocessing. The perception of color noise is also important for understanding th...
详细信息
ISBN:
(纸本)9781479923427
We often need to consider the influence on human perception when we develop an image processing algorithm or design parameters for imageprocessing. The perception of color noise is also important for understanding the human vision system (HVS). Although we can use the CSF to express the property of achromatic noise appearance, it is not so easy to be applied to colors. While the CSF can be explained by a primary color vision, a color appearance is governed by higher order mechanisms. In this paper, we show the quantitative difference between achromatic and chromatic noise appearance. To reveal the perception of color noise quantitatively, we have conducted subjective experiments with modeled achromatic and chromatic noises using the 2AFC method. According to the results, it is shown that the ratio of luminance noise sensitivity to color noise sensitivity is 10~0-10~2 depends on their spatial frequencies and background colors.
Road condition data are important in transportation management systems. Over the last decades, significant progress has been made and new approaches have been proposed for efficient collection of pavement condition da...
详细信息
Road condition data are important in transportation management systems. Over the last decades, significant progress has been made and new approaches have been proposed for efficient collection of pavement condition data. However, the assessment of unpaved road conditions has been rarely addressed in transportation research. Unpaved roads constitute approximately 40% of the U.S. road network, and are the lifeline in rural areas. Thus, it is important for timely identification and rectification of deformation on such roads. This article introduces an innovative Unmanned Aerial Vehicle (UAV)-based digital imaging system focusing on efficient collection of surface condition data over rural roads. In contrast to other approaches, aerial assessment is proposed by exploring aerial imagery acquired from an unpiloted platform to derive a three-dimensional (3D) surface model over a road distress area for distress measurement. The system consists of a low-cost model helicopter equipped with a digital camera, a Global Positioning System (GPS) receiver and an Inertial Navigation System (INS), and a geomagnetic sensor. A set of image processing algorithms has been developed for precise orientation of the acquired images, and generation of 3D road surface models and orthoimages, which allows for accurate measurement of the size and the dimension of the road surface distresses. The developed system has been tested over several test sites with roads of various surface distresses. The experiments show that the system is capable for providing 3D information of surface distresses for road condition assessment. Experiment results demonstrate that the system is very promising and provides high accuracy and reliable results. Evaluation of the system using 2D and 3D models with known dimensions shows that subcentimeter measurement accuracy is readily achieved. The comparison of the derived 3D information with the onsite manual measurements of the road distresses reveals differences of 0.50
This paper presents the design of a multichannel imaging system where the different optical channels have a different angular resolution and field-of-view. Such an imaging system is able to resolve fine details in a s...
详细信息
This paper presents the design of a multichannel imaging system where the different optical channels have a different angular resolution and field-of-view. Such an imaging system is able to resolve fine details in a small region of interest through the channel that has the highest angular resolution (0.0096 degrees) while controlling the surrounding region through the channel that has the widest field-of-view (2 x 40 degrees). An interesting feature of such a multichannel, multiresolution imaging system is that various image processing algorithms can be applied at different segments of the image sensor. We have designed a three channel imaging system where each optical channel consists of four aspheric lens surfaces. These three imaging channels share a single image sensor with a resolution of 1440 x 960 and a 10 mu m pixel size. All imaging channels have diffraction-limited performance ensuring good overall image quality. (C) 2012 Optical Society of America
暂无评论