With the rapid development of binocular stereo vision technology, S3D images and videos have been widely used in many aspects. However, there are few researches on the detail perception of comfortable experience and s...
详细信息
ISBN:
(数字)9781728152448
ISBN:
(纸本)9781728152455
With the rapid development of binocular stereo vision technology, S3D images and videos have been widely used in many aspects. However, there are few researches on the detail perception of comfortable experience and stereo sense based on image content with certain disparity. Considering that texture is an important visual information contained in an image, in this paper, we designed the computer-generated S3D images containing two kinds of parallax based on three types of texture distribution and six regular textures for subjective evaluation. The influence of texture features on visual comfort, depth perception and the region of interest were studied through the related data analysis. For comparison with the content of disparities, the 2D images with same texture were designed and rated for assessment. Experimental results have shown that the texture was an important factor affecting the comfort perception. However, only when the subject and the background had texture at the same time, the texture was an important factor affecting the depth perception. The different texture distribution between the main subject and background would cause the change of area of interest and evaluation result. Overall, the larger the area of the texture, the lower the comfort evaluation results, and the higher the depth perception. Our studies will not only confirm the law of perception on S3D images with certain features, but also provide the guide for multidimensional content production and display, which have practical value for the development of S3D image and video technology.
Composition harmony mainly refers to the various components of the image that are sensed together as a harmonious and unified organism, it plays a key role in the assessment of image aesthetic quality. In this paper, ...
详细信息
ISBN:
(数字)9781728181387
ISBN:
(纸本)9781728181394
Composition harmony mainly refers to the various components of the image that are sensed together as a harmonious and unified organism, it plays a key role in the assessment of image aesthetic quality. In this paper, a composition harmony assessment method based on the rule of thirds was proposed. The eye-tracking dataset was used to extract the saliency regions of the image and the composition harmony assessment method was designed by computing how close the centroid of the largest saliency region and the rule of thirds. Moreover, the effectiveness of the assessment method was analyzed by the subjective evaluation experiment results of the composition harmony. The experimental results showed that the composition harmony assessment method proposed in this paper is effective.
Color harmony is an important factor affecting the aesthetic feeling of images and plays an important role in visual perception. It can evaluate the feasibility of images by constructing the computational model of hum...
详细信息
ISBN:
(数字)9781728181387
ISBN:
(纸本)9781728181394
Color harmony is an important factor affecting the aesthetic feeling of images and plays an important role in visual perception. It can evaluate the feasibility of images by constructing the computational model of human aesthetic thinking. In this paper, a color harmony evaluation method based on color complexity was implemented. Given the classical color harmony model which is suitable for the simple color combination but not suitable for the complex image, and the local feature redundancy caused by grid sampling, an adaptive sampling based on CCM(color complexity measure) was implemented. Then, K-means was adopted to extract local color harmony features. Finally, a model was constructed between local color harmony features and the overall color harmony by SVM. The results show that the method implemented in this paper can represent the color harmony of complex images accurately and effectively.
With the acceleration of modernization and the rapid development of popular music, the splendid culture of Guqin, a type of Chinese traditional musical instrument which has lasted for nearly 1,000 years, is facing a s...
详细信息
ISBN:
(数字)9798350380347
ISBN:
(纸本)9798350380354
With the acceleration of modernization and the rapid development of popular music, the splendid culture of Guqin, a type of Chinese traditional musical instrument which has lasted for nearly 1,000 years, is facing a serious inheritance crisis. Therefore, how to make full use of modern technical means to digitally store and manage the valuable traditional music cultural resources is the current *** response to the aforementioned challenges, this study focuses on the Guqin, an emblematic Chinese traditional musical instrument. By employing advanced audio feature extraction techniques, this research systematically analyzes the diverse playing techniques of Guqin, thereby establishing a comprehensive acoustic dataset. This endeavor not only facilitates the digital preservation and management of the Guqin’s musical heritage but also pioneers a standardized framework for the creation of acoustic datasets. This framework serves as a benchmark for the development of similar datasets, thereby promoting the conservation and standardization of resources associated with Chinese traditional musical instruments. This will provide a scientific basis for the inheritance, standardized management and construction of more acoustic datasets of ethnic musical instruments in the future.
The Mossformer model excels in speech separation but has not been effectively applied to music source separation. Music sources have complex characteristics and higher sampling rates, making separation tasks more chal...
详细信息
ISBN:
(数字)9798350380347
ISBN:
(纸本)9798350380354
The Mossformer model excels in speech separation but has not been effectively applied to music source separation. Music sources have complex characteristics and higher sampling rates, making separation tasks more challenging. We addressed a rarely explored task of separating piano concerto recordings into individual piano and orchestral tracks. This process involves intricate coordination between the piano and orchestra, creating highly complex audio signals in both time and frequency domains. Our main contributions include: (1) adapting the speech separation model for the novel task of piano concerto source separation, constructing and processing a specialized dataset.(2) introducing channel attention in the separation module to dynamically adjust feature focus based on instrument characteristics, enhancing key features. Experiments on the Piano Concerto Dataset (PCD) showed improved separation performance, with a 0.22dB average Signal-to-Distortion Ratio (SDR) increase over the baseline model.
The oral story culture of ethnic folklore is one of the important components of traditional Chinese culture, and it is of key importance to study the timbre evaluation methods of this type of audio after digital captu...
The oral story culture of ethnic folklore is one of the important components of traditional Chinese culture, and it is of key importance to study the timbre evaluation methods of this type of audio after digital capture. This paper uses subjective timbre evaluation experiments to explore the subjective perceptual characteristics of audio signals of oral story, extracts the objective acoustic parameters of audio timbre based on human vocal principles, auditory perceptual characteristics and speech time domain features, and conducts research on objective timbre evaluation methods based on support vector regression (SVR), random forest regression (RFR), convolutional neural networks (CNN) and long short-term memory networks (LSTM). The results show that the use of feature extraction and non-linear algorithms has good results in the evaluation of oral story timbre.
visual comfort is one of the important indexes to evaluate the image quality and viewing experience in the process of viewing stereoscopic video and images. This paper mainly investigated the effect of the main part s...
详细信息
ISBN:
(纸本)9781510812055
visual comfort is one of the important indexes to evaluate the image quality and viewing experience in the process of viewing stereoscopic video and images. This paper mainly investigated the effect of the main part size and disparity distribution type of stereoscopic images on visual comfort through two subjective evaluation experiments. The experiment results confirmed that the main part size and disparity space distribution type are two important factors which can affect visual comfort, and they also proved the influence trend of these factors on visual comfort in different disparity conditions. Our experiments have important guiding significance for the establishment and improvement of the relevant objective evaluation experimental model and acquisition and display technology of stereoscopic images.
To protect the acoustical environment of auditorium in performing place, this paper mainly focusses on the emission noise measurement method of stage machinery. For the high noise and frequently-used equipment during ...
To protect the acoustical environment of auditorium in performing place, this paper mainly focusses on the emission noise measurement method of stage machinery. For the high noise and frequently-used equipment during the performance, the measurement equipment, environment and condition, detection position, evaluation parameter, tested object and the running requirement are all studied and discussed. In particular, the suitable test point and load demand is analyzed through the simulation of EASE software and the noise measurement under laboratory environment. Our research aims to provide the basic work for developing the draft standard for stage machinery noise, which has certain application value to improve the audio-visual effects and control the noise pollution for modern theatre.
暂无评论