Computer imaging technology is a kind of use of digital photography, using a computer as amedium to realize interactive communication and interaction between humans and machines through the collection and processing o...
详细信息
ISBN:
(纸本)9783031243660;9783031243677
Computer imaging technology is a kind of use of digital photography, using a computer as amedium to realize interactive communication and interaction between humans and machines through the collection and processing of images and the editing and storage of graphic information. The purpose of this paper to study the design of the 3D imagevisual communication system based on computer image technology is to improve the mastery of 3D image technology and design the visual communication system. This article mainly uses experimental and comparative methods to analyze the feature extraction situation of the 3D imagevisual communication system, and finds that the error of the improved RANSAC algorithm in image feature extraction is about 54%, while the unimproved algorithm and other algorithms The error is greater. This shows that the improved algorithm proposed in this paper is incomparable in the 3D imagevisual communication system.
In this paper we propose an image coding scheme based on the polynomial transform and multiresolution analysis. The polynomial transform is an image representation model that mimics some properties of the human visual...
详细信息
ISBN:
(纸本)0819424358
In this paper we propose an image coding scheme based on the polynomial transform and multiresolution analysis. The polynomial transform is an image representation model that mimics some properties of the human visual system, and which we use in order to model edges in terms of their characteristic parameters. Based on the polynomial transform, we build a pyramidal hierarchical predictive scheme for image coding. The feature parameters that we encode are: local average, edge orientation, edge position and edge magnitude.
We utilize speech information to improve the quality of audio-visualcommunications such as video telephony and videoconferencing. We show that the marriage of speech analysis and imageprocessing can solve problems r...
详细信息
We utilize speech information to improve the quality of audio-visualcommunications such as video telephony and videoconferencing. We show that the marriage of speech analysis and imageprocessing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.
image quality assessment (IQA) is useful in many visualprocessing systems but challenging to perform in line with the human perception. A great deal of recent research effort has been directed towards IQA. In order t...
详细信息
ISBN:
(纸本)9780819482341
image quality assessment (IQA) is useful in many visualprocessing systems but challenging to perform in line with the human perception. A great deal of recent research effort has been directed towards IQA. In order to overcome the difficulty and infeasibility of subjective tests in many situations, the aim of such effort is to assess visual quality objectively towards better alignment with the perception of the Human visual system (HVS). In this work, we review and analyze the recent progress in the areas related to IQA, as well as giving our views whenever possible. Following the recent trends, we discuss the engineering approach in more details, explore the related aspects for feature pooling, and present a case study with machine learning.
We study the complementary behaviors of external and internal examples in image restoration, and are motivated to formulate a composite dictionary design framework. The composite dictionary consists of the global part...
详细信息
ISBN:
(纸本)9781467373142
We study the complementary behaviors of external and internal examples in image restoration, and are motivated to formulate a composite dictionary design framework. The composite dictionary consists of the global part learned from external examples, and the sample-specific part learned from internal examples. The dictionary atoms in both parts are further adaptively weighted to emphasize their model statistics. Experiments demonstrate that the joint utilization of external and internal examples leads to substantial improvements, with successful applications in image denoising and super resolution.
Place recognition, also called visual localization, facilitates the autonomous navigation capabilities of the future of driverless cars. This paper proposes a new place recognition algorithm that considers the appeara...
详细信息
ISBN:
(纸本)9781728119045
Place recognition, also called visual localization, facilitates the autonomous navigation capabilities of the future of driverless cars. This paper proposes a new place recognition algorithm that considers the appearancebased methodology to localize the vehicle by utilizing visual route map, i.e. a sequence of images, or sets of features extracted from these images, that were recorded over different times and dates for the route environments. These reference sequences are accurately labeled and annotated using GPS tags or manually using odometry information. The dynamic time warping (DTW) algorithm is used to achieve image sequence alignment and find the best match for each frame from the test sequence. The proposed algorithm considered hand-crafted features like SIFT, HOG, and LDB. Experiments, using common challenging and benchmark datasets, i.e. "UQ St Lucia" and "Nordland", have been conducted, and it has been observed that the proposed technique has significantly improved the performance of well-known appearance-based descriptors SIFT, HOG, and LDB as compared to its individual performance and to some of the state-of-the-art localization and mapping methods such as ABLE (Binary-appearance Loop-closure).
High Dynamic Range (HDR) images capture the full range of luminance present in real world scenes, and unlike Low Dynamic Range (LDR) images, can simultaneously contain detailed information in the deepest of shadows an...
详细信息
ISBN:
(纸本)9780819482341
High Dynamic Range (HDR) images capture the full range of luminance present in real world scenes, and unlike Low Dynamic Range (LDR) images, can simultaneously contain detailed information in the deepest of shadows and the brightest of light sources. In order to render HDR image on LDR displayers, it is often necessary to create LDR depictions of HDR images at the cost of contrast information loss. To reduce the loss, this paper enables to render HDRI (High Dynamic Range image) with multiple low-bit images periodically. From the viewpoint of a human, the pixel value is fractural. It does not adjust the tones but can reconstruct HDR images.
Pre-processing algorithms improve on the performance of a video compression system by removing spurious noise and insignificant features from the original images. This increases compression efficiency and attenuates c...
详细信息
ISBN:
(纸本)0819439886
Pre-processing algorithms improve on the performance of a video compression system by removing spurious noise and insignificant features from the original images. This increases compression efficiency and attenuates coding artifacts. Unfortunately, determining the appropriate amount of pre-filtering is a difficult problem, as it depends on both the content of an image as well as the target bit-rate of compression algorithm. In this paper, we explore a pre-processing technique that is loosely coupled to the quantization decisions of a rate control mechanism. This technique results in a pre-processing system that operates directly on the Displaced Frame Difference (DFD) and is applicable to any standard-compatible compression system. Results explore the effect of several standard filters on the DFD. An adaptive technique is then considered.
Despite the rapid progress of deep learning research in recent years, interpreting deep network is still quite challenging. Interpreting deep networks is essential to both end-users and developers since it gives confi...
详细信息
ISBN:
(纸本)9781538662496
Despite the rapid progress of deep learning research in recent years, interpreting deep network is still quite challenging. Interpreting deep networks is essential to both end-users and developers since it gives confidence in the usage of the deep network. This paper deals with a method for interpreting deep networks, especially visual interpretation. In order to get visual interpretation from a target deep network, we propose a ProbeNet that provides a decomposed visual interpretation of the target deep network. The ProbeNet decomposes the feature representations of the point of the target deep network into human interpretable units. Furthermore, the ProbeNet provides kernel-level analysis about the target deep network. In experiments, visual interpretation of two different target deep networks showed the usefulness of the ProbeNet to interpret target deep networks.
With desktop imaging devices becoming ubiquitous, effectively managing the images in large collections has become a challenge. The requirements for a modem imaging system now demand not only efficient storage (low bit...
详细信息
ISBN:
(纸本)0819439886
With desktop imaging devices becoming ubiquitous, effectively managing the images in large collections has become a challenge. The requirements for a modem imaging system now demand not only efficient storage (low bit rate coding), but also easy manipulation, indexing and retrieval of images. In this paper, we introduce a new method for colour image coding based on a visual appearance model of local colour image patterns. The visual appearance of small image patterns is characterised by their spatial pattern, colour direction and local energy strength. To encode the local visual appearance, an approach based on vector quantisation (VQ) is introduced. A separate VQ is designed for the spatial pattern and colour direction respectively. It is shown that the method not only achieves good image coding results in terms of rate distortion criterion, it also enables content-based retrieval to be performed in the compressed domain easily and conveniently.
暂无评论