This paper proposes the proportional image enlargement using hybrid methods. Hybrid method is combinations of scaling and carving methods. This method consists of two steps. The first step enlarges the source image to...
详细信息
This paper proposes the proportional image enlargement using hybrid methods. Hybrid method is combinations of scaling and carving methods. This method consists of two steps. The first step enlarges the source image to the same size with minimum size for height or width from the target image. In this step, we use a kernel scaling method which is resulted in proportional content image size. The second step is the full size image enlargement in the width direction. The important content in the image is maintained. The image energy is used to detect the significant part in the image. We slice the image by following the minimum energy from top to bottom. The interpolation pixel is implemented among slices of the image. We use rank-ordered mean filters to reduce jagged image, especially in the interpolation pixels. The experiments show the proportional image enlargement, and the aspect ratio of an image is changed.
Query by Singing/Humming(QBSH) is to retrieve songs in the music database using user's singing or humming. Open-End Dynamic Time Warping (OEDTW) is one of the methods which are commonly used in QBSH studies. This ...
详细信息
ISBN:
(纸本)9781479932122
Query by Singing/Humming(QBSH) is to retrieve songs in the music database using user's singing or humming. Open-End Dynamic Time Warping (OEDTW) is one of the methods which are commonly used in QBSH studies. This paper proposes a method for improving OEDTW performance using optimal scaling factor taken from Linear scaling phase. The proposed method finds an optimal global scale of a query for each target song by linear scaling and key shifting of the query and target song over the obtained scale can complement DTW or OEDTW. The experimental results demonstrate that the proposed method can improve Top-1 hit rate by 32% compared with the original DTW.
A method is proposed for estimating two illuminant spectral power distributions from the highlights of overlapping illuminants on an object. It was assumed in most previous studies that specular highlights on object s...
详细信息
A method is proposed for estimating two illuminant spectral power distributions from the highlights of overlapping illuminants on an object. It was assumed in most previous studies that specular highlights on object surfaces are caused by a single light source, or by separate multiple light sources. In this paper, we assume that two highlights from different light sources are overlapped on object surfaces, and estimate the spectral power distributions of both. A multiband camera system is used for capturing spectral images of dielectric objects in a scene. First, we detect specular highlight areas from the spectral image. Then, the illuminant spectra of two light sources are simultaneously estimated based on cluster classification of particular pixel distribution in the highlight area. The feasibility of the proposed method is examined in experiments on real-world scenes.
In this paper, we studied the impact of the mismatch existing between training and testing data due to the presence of an additive noise on the performance of speaker verification system. Using a GMM-UBM system with M...
详细信息
In this paper, we studied the impact of the mismatch existing between training and testing data due to the presence of an additive noise on the performance of speaker verification system. Using a GMM-UBM system with MAP adaptation as a baseline system, front-end diversity is achieved by using MFCCs and different asymmetric MFCCs stand-alone as features or followed by PCA and LDA as dimensionality reduction techniques applied before the GMM-UBM back-end classifier. A score level fusion framework based on logistic regression is proposed to improve performance and to mitigate noise degradation. The obtained results on both clean and corrupted TIMIT database confirm the superiority of fused system in clean and noisy environment against each system alone, and the drastic degradation of the performances of PCA and LDA basedsystems in the presence of environmental noise.
Our research group has been archiving a Buddhist ceremonial procession called Nerikuyo. Nerikuyo has special features in its walking movements and actions. It is difficult to display these features using traditional p...
详细信息
Our research group has been archiving a Buddhist ceremonial procession called Nerikuyo. Nerikuyo has special features in its walking movements and actions. It is difficult to display these features using traditional panels in a museum. Our purpose is to create videos and interactive content that vividly describe this ceremony. We have archived the ceremony with super-high-detail videos and then created video contents for a special exhibition on Nerikuyo. We have also proposed a virtual fitting system that recognizes poses or gestures of users and then displays the corresponding images and sounds over the captured images of the users. All of the gestures are related to the poses or motions of Nerikuyo, and they are assigned to masks and tools. The created videos were shown at a special exhibition of the Ryukoku Museum, and the proposed system was demonstrated for three days as one of the related events of the special exhibition.
The paper presents a novel bone-conduction based brain-computer interface paradigm. Four sub-threshold acoustic frequency stimulus patterns are presented to the subjects in an oddball paradigm allowing for "aha-r...
详细信息
ISBN:
(纸本)9781479932122
The paper presents a novel bone-conduction based brain-computer interface paradigm. Four sub-threshold acoustic frequency stimulus patterns are presented to the subjects in an oddball paradigm allowing for "aha-responses" generation to the attended targets. This allows for successful implementation of the bone-conduction based brain-computer interface (BCI) paradigm. The concept is confirmed with seven subjects in online bone-conducted auditory Morse-code patterns spelling BCI paradigm. We report also brain electrophysiological signal processing and classification steps taken to achieve the successful BCI paradigm. We also present a finding of the response latency variability in a function of stimulus difficulty.
Atomic force microscopy (AFM) is one of the most advanced tools for high-resolution imaging and manipulation of nanoscale matter. Unfortunately, standard AFM imaging requires a timescale on the order of seconds to min...
详细信息
Atomic force microscopy (AFM) is one of the most advanced tools for high-resolution imaging and manipulation of nanoscale matter. Unfortunately, standard AFM imaging requires a timescale on the order of seconds to minutes to acquire an image which makes it complicated to observe dynamic processes. Moreover, it is often required to take several images before a relevant observation region is identified. In this paper we show how to significantly reduce the image acquisition time by under sampling. The reconstruction of an under sampled AFM image can be viewed as an in painting, interpolating problem, or a special case of compressed sensing. We argue that the preferred approach depends upon the type of image. Of the methods proposed for AFM, images containing high frequencies should be reconstructed using basis pursuit from data collected in a spiral pattern. images without too much high frequency content should be reconstructed using interpolation.
The filter simulates "Shinhanga," multi-colored woodblock print, which was once popular in Japan around one hundred years ago. We expect that the filter sheds light again on this fascinating art style. The f...
详细信息
ISBN:
(纸本)9781479932122
The filter simulates "Shinhanga," multi-colored woodblock print, which was once popular in Japan around one hundred years ago. We expect that the filter sheds light again on this fascinating art style. The filter follows three steps to simulate contour lines and gradation, which characterize "Shinhanga"; The filter extracts contour lines based on hue of an original image, then determines the direction of color gradient in each region either vertically or horizontally, based on the amount and the direction of variation of hue in each region. We improved the contour lines and gradation process from the previous filter to broaden its applicability and conducted an experiment to assess it. The result suggests that the filter has less dependency to the types of images comparing to our previously proposed method.
This work presents a conceptual framework for learning an ontological structure of domain knowledge, which combines Jaccard similarity coefficient with the Infinite Relational Model (IRM) by (Kemp et al. 2006) and its...
详细信息
This work presents a conceptual framework for learning an ontological structure of domain knowledge, which combines Jaccard similarity coefficient with the Infinite Relational Model (IRM) by (Kemp et al. 2006) and its extended model, i.e. the normal-Infinite Relational Model (n-IRM) by (Herlau et al. 2012). The proposed approach is applied to a dataset where legal concepts related to the Japanese educational system are defined by the Japanese authorities according to the international Standard Classification of Education (ISCED). Results indicate that the proposed approach effectively structures features for defining groups of concepts in several levels (i.e., concept, category, abstract category levels) from which an ontological structure is systematically visualized as a lattice graph based on the Formal Concept Analysis (FCA) by (Ganter and Wille 1997).
Reconstruction of an image from its LBP codes can aid understanding of the information contained within the codes by comparing the reconstructed image to the original. We are the first to show that the LBP process can...
详细信息
Reconstruction of an image from its LBP codes can aid understanding of the information contained within the codes by comparing the reconstructed image to the original. We are the first to show that the LBP process can be inverted and present a novel algorithm to perform the reconstruction, resulting in an approximation of the original image that is both visually appealing and completely matches the LBP codes of the original. The algorithm calculates the minimum contrast between two pixels, reconstructing some of the contrast information thought lost in the LBP process. Tests on the algorithm have been conducted on images from the Brodatz database and Berkeley Segmentation Dataset which show an image visually similar to the original with perfect texture reconstruction. The reconstructed images also remove the effects of illumination from the images, suggesting future investigation into the possibility of image brightness normalisation. Additionally, since the reconstructed image provides the same LBP codes as the original, the susceptibility to spoofing of systems using LBP feature vectors has been identified.
暂无评论