image segmentation provides a powerful semantic description of videoimagery essential in image understanding and efficient manipulation of image data. In particular, segmentation based on image motion defines regions...
详细信息
ISBN:
(纸本)0819414778
image segmentation provides a powerful semantic description of videoimagery essential in image understanding and efficient manipulation of image data. In particular, segmentation based on image motion defines regions undergoing similar motion allowing an image coding system to more efficiently represent video sequences. This paper describes a general iterative framework for segmentation of video data. The objective of our spatiotemporal segmentation is to produce a layered image representation of the video for image coding applications whereby video data is simply described as a set of moving layers.
Integral imaging is an attractive auto-stereoscopic three dimensional (3D) technique for next generation 3DTV. To improve its video quality, new techniques are required to effectively compress the huge volume of integ...
详细信息
ISBN:
(纸本)9781457713033
Integral imaging is an attractive auto-stereoscopic three dimensional (3D) technique for next generation 3DTV. To improve its video quality, new techniques are required to effectively compress the huge volume of integral image (ii) data. In this paper, a new compression method implemented by multi-view video coding (MVC) is provided and used for sub-images (SI). SI is an alternative form of 2D image transformed from original ii. Each SI represents the 3D scene from parallel viewing directions and contains superior compression capabilities than original captured elemental images (EI). For this reason, we consider arranging the group of SIs as the format of multi-view video (MVV) and then encode the generated MVV by MVC standard. Experimental results show that our proposed compression approach improves the compression efficiency when compared to the traditional MPEG-4/AVC compression method for ii.
In this paper, we report recent effort in expanding image and video compression to computer graphics applications, in particular those based on image-based rendering. We then introduce our recent progress in developin...
详细信息
ISBN:
(纸本)0780362985
In this paper, we report recent effort in expanding image and video compression to computer graphics applications, in particular those based on image-based rendering. We then introduce our recent progress in developing a networked immersive environment that integrates image analysis, face animation, and streaming of 3D objects, to provide a truly immersive environment with the goal of replacing existing video conferencing platforms.
Recently, the law enforcement community with professional interests in applications of image/videoprocessing technology, has been exposed to scientifically flawed salesmanship assertions regarding the advantages and ...
详细信息
ISBN:
(纸本)0819444596
Recently, the law enforcement community with professional interests in applications of image/videoprocessing technology, has been exposed to scientifically flawed salesmanship assertions regarding the advantages and disadvantages of various hardware image acquisition devices (video digitizing cards). These assertions state a necessity of using SMPTE CCIR-601 standard when digitizing NTSC composite video signals from surveillance videotapes. In particular, it would imply that the pixel-sampling rate of 720*486 is absolutely required to capture all the available video information encoded in the composite video signal. Fortunately, these erroneous statements can be directly analyzed within the strict mathematical context of Shannon's Sampling Theory. Here we apply the classical Shannon-Nyquist results to the process of digitizing composite analog video from videotapes to dispel the theoretically unfounded, wrong assertions.
The analysis of video observation tapes can be tedious and tiring work. An analysis system can relieve this burden and create a compilation tape autonomously. Working unattended AVACS creates a tape and a Compact Disk...
详细信息
ISBN:
(纸本)0819444596
The analysis of video observation tapes can be tedious and tiring work. An analysis system can relieve this burden and create a compilation tape autonomously. Working unattended AVACS creates a tape and a Compact Disk (CD) with only the images-of-interest.
Here we discuss space-time processing of tactical FLIR video. Using a FLIR data set which does not have the perspicuity of traditional video segments, we apply a succession of simple signal processing operations to im...
详细信息
ISBN:
(纸本)0780362985
Here we discuss space-time processing of tactical FLIR video. Using a FLIR data set which does not have the perspicuity of traditional video segments, we apply a succession of simple signal processing operations to improve compression. These include transforms in time and space as well as nonlinear processing. In doing this we separate and process the still and motion parts of the video differently. Using this non-standard method of videoprocessing, we obtained compression up to 5x MPEG with comparable image quality. All processing is amenable to parallel implementation;therefore, realtime processing is possible. The current advantage in compression allows the possibility of real-time transmission over a 64 kbps link.
This paper discusses a proposed processing technique for combining videoimagery with auxiliary sensor information. The latter greatly simplifies imageprocessing by reducing complexity of the transformation model. Th...
详细信息
ISBN:
(纸本)0780367251
This paper discusses a proposed processing technique for combining videoimagery with auxiliary sensor information. The latter greatly simplifies imageprocessing by reducing complexity of the transformation model. The mosaics produced by this technique are adequate for many applications, in particular habitat mapping. The algorithm is demonstrated through simulations and hardware configuration is described.
The affective content of a video is defined as the expected amount and type of emotion that are contained in a video. Utilizing this affective content will extend the current scope of application possibilities. The di...
详细信息
ISBN:
(纸本)9781424404810
The affective content of a video is defined as the expected amount and type of emotion that are contained in a video. Utilizing this affective content will extend the current scope of application possibilities. The dimensional approach to representing emotion can play an important role in the development of an affective video content analyzer. The three basic affect dimensions are defined as valence, arousal and control [5]. This paper presents a novel FPGA-based system for modeling the arousal content of a video based on user saliency and film grammar. The design is implemented on a Xilinx Virtex-ii xc2v6000 on board a RC300 board.
Parsing video content is an important first step in the video indexing process. This paper presents algorithms to automate the video parsing task, including video partitioning and video clip classification according t...
详细信息
ISBN:
(纸本)0819414778
Parsing video content is an important first step in the video indexing process. This paper presents algorithms to automate the video parsing task, including video partitioning and video clip classification according to camera operations using compressed video data. We have studied and implemented two algorithms for partitioning video data compressed according to the MPEG standard. The first one is based on discrete cosine transform coefficients of video frames, and the other based on correlation of motion vectors. Algorithms to detect camera operations using motion vectors are presented.
A 372.3 mW coarse-grained reconfigurable image stream processor, CRISP-ii, for image-processing and intelligent operations is implemented in TSMC 90 nm low-power technology with a core size of 15.21 mm 2 . With the pr...
A 372.3 mW coarse-grained reconfigurable image stream processor, CRISP-ii, for image-processing and intelligent operations is implemented in TSMC 90 nm low-power technology with a core size of 15.21 mm 2 . With the proposed multi-stream mode unified protocol, hierarchical ring architecture, and adaptive computing engine, CRISP-ii is able to solve the emerging scalability and flexibility problems. It could execute several advanced operations efficiently, like high-dynamic-imaging and face detection. Compared with other state-of-the-art processors, CRISP-ii achieves 8.42 times power efficiency than the highly parallel SIMD processor, and meets the real-time requirement of video cameras with QFHD (3840 × 2160) resolution.
暂无评论