In the real world, we commonly receive information simultaneously through two or more senses, with the brain fusing this data to produce a single coherent message. Lip-reading is one example of this phenomenon. Labora...
详细信息
In the real world, we commonly receive information simultaneously through two or more senses, with the brain fusing this data to produce a single coherent message. Lip-reading is one example of this phenomenon. Laboratory studies, on the other hand, often measure the response to a stimulus by a single sense and extrapolate these results to predict real-world behaviour. In this paper, we show that semantics have a significant impact on viewers' sensitivity to the quality of a video sequence for spatially separated parts of the sequence and, more importantly, that this difference in sensitivity can be changed by the presence of an audio signal. This result is important for any testing of subjects' responses to visual material. One example is the subjective assessment of the quality of video in an audio-visualcommunications system (such as television or video conferencing).
We describe a multi-chip CMOS VLSI visual motion processing system which combines analog circuitry with an asynchronous digital interchip communications protocol to allow more complex motion processing than is possibl...
详细信息
ISBN:
(纸本)0769500560
We describe a multi-chip CMOS VLSI visual motion processing system which combines analog circuitry with an asynchronous digital interchip communications protocol to allow more complex motion processing than is possible with all the circuitry in the focal plane. The two basic VLSI building blocks are a sender chip which incorporates a 2D imager array and transmits the position of moving spatial edges, and a receiver chip which computes a 2D optical flow vector field from the edge information. The elementary two-chip motion processing system consisting of a single sender and receiver is first characterized Subsequently, two three-chip motion processing systems are described. The first such system uses two sender chips to compute the presence of motion only at a particular stereoscopic disparity. The second such system uses two receivers to simultaneously compute a linear and polar topographic mapping of the image plane, resulting in information about image translation, rotation, and expansion. These three-chip systems demonstrate the modularity and flexibility of the multi-chip neuromorphic approach.
People acquire most of their information through their visual system. Moreover, people often quote that "a picture is worth a thousand words" and it is well known that color attracts attention and helps comm...
详细信息
People acquire most of their information through their visual system. Moreover, people often quote that "a picture is worth a thousand words" and it is well known that color attracts attention and helps communicate. Nevertheless, most of our documents are primarily black and white text! This talk reviews the history of visual communication, proposing the thesis that technology, both its advantages and its limitations, has distorted our document design and preparation. Technology is now reaching the point where color and illustrations can be included in many documents, and professionally prepared magazines, newspapers, etc regularly do so. The technology still needed for the future is that which will make it as easy to produce a color illustration as it is now to type a paragraph. This development will require both improved man-machine interfaces and teaching the average person new concepts of document design.
This study aims at finding an efficient compensation method of transmission errors in low bit-rate mobile communications using the H. 263 video codec. For this aim, we suggest a novel error compensation method in the ...
详细信息
This study aims at finding an efficient compensation method of transmission errors in low bit-rate mobile communications using the H. 263 video codec. For this aim, we suggest a novel error compensation method in the encoder (or transmitter), that can remove visual quality degradation due to spatio-temporal error propagation by utilizing a feedback channel. In the proposed method, a corrupted group of blocks (GOB) is concealed to avoid annoying artifacts due to the bit-error, and the GOB and its corresponding frame number are reported to the encoder via the feedback channel. Then, the encoder evaluates the negative acknowledgments and reconstructs the spatial and temporal error propagation by using a novel fast redecoding algorithm. The proposed error compensation method is compared with the two existing ones: the reference picture selection method and the error tracking method. Experimental results show that the proposed method is superior to the existing ones in many aspects.
Although H.320 is one of the most popular ITU-T standard for video conference systems, H.323 is receiving wide acceptance in the internet society. In this paper, we study the problem of transporting video conference t...
详细信息
Although H.320 is one of the most popular ITU-T standard for video conference systems, H.323 is receiving wide acceptance in the internet society. In this paper, we study the problem of transporting video conference traffic to and from the internet. Some characteristics of the problem are as follows. For example, H.323 video stream is VBR while H.320 video stream is CBR; H.323 is byte-oriented while H.320 is bit-oriented; audio and video packets are transmitted independently in H.323 while they are multiplexed together in H.320; the probability of packet loss in a H.323 network is much higher than in a H.320 ISDN circuit switching network. In this paper, we present our designs and some preliminary experimental results in dealing with these issues.
The recent advances in VLSI technology, high-speed processor designs, Internet/Intranet implementations, broadband networks (ATM and ISDN) and compression standards (JPEG, MPEG, H.261, H.263 and G.273) are leading to ...
详细信息
This paper reports about an implementation of a search engine for visual information content, which has been developed in the context of the forthcoming MPEG-7 standard. The system supports similarity-based retrieval ...
详细信息
Detecting interesting regions from pictures has become important in order to reduce the computational complexity associated with such time-consuming processes as object recognition. In this paper we assume that figure...
详细信息
The great potential of "foveated imaging" lies in the entropy reduction relative to the original image while minimizing the loss of visual information. Utilizing human foveation combined with video compressi...
详细信息
The great potential of "foveated imaging" lies in the entropy reduction relative to the original image while minimizing the loss of visual information. Utilizing human foveation combined with video compression, as well as communication and human-machine interface techniques, more efficient multimedia services are expected to be provided in the near future. In this paper, we introduce a prototype for foveated visualcommunications as one of future human interactive multimedia applications, and demonstrate the benefit of the foveation over fading statistics in the downtown area of Austin, Texas. In order to compare the performance with regular video, we use spatial/temporal resolution and source transmission delay as the evaluation criteria.
A non-iterative wavelet-based algorithm was proposed to reduce the ringing artifacts associated with a lossy compressed image. The proposed algorithm is based on the fact that increases in magnitude of the quantized w...
详细信息
ISBN:
(纸本)0780355830
A non-iterative wavelet-based algorithm was proposed to reduce the ringing artifacts associated with a lossy compressed image. The proposed algorithm is based on the fact that increases in magnitude of the quantized wavelet coefficients lead to decreases in visual smoothness. Thus a shrinkage algorithm is applied to maintain visual smoothness. The proposed algorithm is, however, adaptive in nature, in which the amount of shrinkage depends on edge strength, region activity and compression ratio. Experimental results have confirmed that the adaptive algorithm could suppress the ringing artifacts and improve visual smoothness, especially around edges where ringing is severe.
暂无评论