Panoramic mosaics methods based on 8-paramter planar homography matrix have to overcome the accumulated errors, when a sequence of images loops back on itself. Usual methods are computationally intensive, and can not ...
详细信息
ISBN:
(纸本)9628576623
Panoramic mosaics methods based on 8-paramter planar homography matrix have to overcome the accumulated errors, when a sequence of images loops back on itself. Usual methods are computationally intensive, and can not ensure complete consistency of homographies. This paper presents a simple method, which do not require the consistency of homographies. The method mainly exploits un-calibrated image perspective interpolation technique [1]. So it is of simple calculation and easy realization.
Panoramic mosaics methods based on an 8-parameter planar homography matrix have to overcome the accumulated errors when a sequence of images loops back on itself. Usual methods are computationally intensive, and canno...
详细信息
ISBN:
(纸本)9628576623
Panoramic mosaics methods based on an 8-parameter planar homography matrix have to overcome the accumulated errors when a sequence of images loops back on itself. Usual methods are computationally intensive, and cannot ensure complete consistency of homographies. The paper presents a simple method which does not require the consistency of homographies. The method mainly exploits an un-calibrated image perspective interpolation technique (B. Yuan et al., 1998). It is therefore simple to calculate and easy to implement.
Food image generation holds promising application prospects in food design, advertising, and food education. However, the existing methods rely on information such as recipes, ingredients, or food names, which leads t...
详细信息
Food image generation holds promising application prospects in food design, advertising, and food education. However, the existing methods rely on information such as recipes, ingredients, or food names, which leads to generated food images with less intra-class diversity. When recipes, ingredients and food names are identical for the same food, the real-world images may vary significantly in appearance. The question of how to simultaneously ensure the quality and diversity of the generated images is a key issue. To this end, we employ pre-trained diffusion model and Transformer to propose a method for generating diverse and high-quality images of both Chinese and Western food, named CW-Food. Different from previous works that utilize an overall food feature to generate new images, CW-Food first decouples the food images to obtain common intra-class features and private instance features. Additionally, we design a Transformer-based feature fusion module to integrate the common and private features, in order to avoid the shortcomings of conventional methods. Moreover, we also utilize a pre-trained diffusion model as our backbone, which is fine-tuned using LoRA with the fused multi-variate features. Extensive experiments on four datasets demonstrate the advantages of our proposed method, producing diverse and high-quality food images encompassing both Chinese and Western cuisines. To the best of our knowledge, our work is the first attempt to generate Chinese food images using only food names.
Knowledge graphs have proven highly effective for learning representations of entities and relations, with hyper-relational knowledge graphs (HKGs) gaining increased attention due to their enhanced representation capa...
详细信息
Knowledge graphs have proven highly effective for learning representations of entities and relations, with hyper-relational knowledge graphs (HKGs) gaining increased attention due to their enhanced representation capabilities. Each fact in an HKG consists of a main triple supplemented by attribute-value qualifiers that provide additional contextual information. Due to the complexity of hyper-relations, HKGs typically contain complex geometric structures, such as hierarchical, ring, and chain structures, often mixed together. However, previous work mainly embeds HKGs into Euclidean space, limiting their ability to capture these complex geometric structures simultaneously. To address this challenge, we propose a novel model called Geometry Aware Hyper-relational Embedding (GAHE). Specifically, GAHE adopts a multi-curvature geometry-aware approach by modeling HKGs in Euclidean space (zero curvature), hyperbolic space (negative curvature), and hyperspherical space (positive curvature) in a unified framework. In this way, it can integrate space-invariant and space-specific features to accurately capture the diverse structures in HKGs. In addition, GAHE introduces a module termed hyper-relational subspace learning, which allocates multiple sub-relations for each hyper-relation. It enables the exploitation of abundant latent semantic interactions and facilitates the exploration of fine-grained semantics between attribute-value pairs and hyper-relations across multiple subspaces. Furthermore, we provide theoretical guarantees that GAHE is fully expressive and capable of modeling a wide range of semantic patterns for hyper-relations. Empirical evaluations demonstrate that GAHE achieves state-of-the-art results on both hyper-relational and binary-relational benchmarks.
In this paper, we present a novel indirect convergent Jacobi spectral collocation method for fractional optimal control problems governed by a dynamical system including both classical derivative and Caputo fractional...
The 2010 Pacific-Rim Conference on Multimedia (PCM 2010) was held in Shanghai at Fudan University, during September 21–24, 2010. Since its inauguration in 2000, PCM has been held in various places around the Pacific ...
详细信息
ISBN:
(数字)9783642157028
ISBN:
(纸本)9783642157011
The 2010 Pacific-Rim Conference on Multimedia (PCM 2010) was held in Shanghai at Fudan University, during September 21–24, 2010. Since its inauguration in 2000, PCM has been held in various places around the Pacific Rim, namely Sydney (PCM 2000), Beijing (PCM 2001), Hsinchu (PCM 2002), Singapore (PCM 2003), Tokyo (PCM 2004), Jeju (PCM 2005), Zhejiang (PCM 2006), Hong Kong (PCM 2007), Tainan (PCM 2008), and Bangkok (PCM 2009). PCM is a major annual international conference organized as a forum for the dissemination of state-of-the-art technological advances and research results in the fields of theoretical, experimental, and applied multimedia analysis and processing. PCM 2010 featured a comprehensive technical program which included 75 oral and 56 poster presentations selected from 261 submissions from Australia, Canada, China, France, Germany, Hong Kong, India, Iran, Italy, Japan, Korea, Myanmar, Norway, Singapore, Taiwan, Thailand, the UK, and the USA. Three distinguished researchers, Prof. Zhi-Hua Zhou from Nanjing University, Dr. Yong Rui from Microsoft, and Dr. Tie-Yan Liu from Microsoft Research Asia delivered three keynote talks to the conference. We are very grateful to the many people who helped to make this conference a s- cess. We would like to especially thank Hong Lu for local organization, Qi Zhang for handling the publication of the proceedings, and Cheng Jin for looking after the c- ference website and publicity. We thank Fei Wu for organizing the special session on large-scale multimedia search in the social network settings.
Personality primarily refers to the unique and stable way of a person’s thinking and behavior. A few studies have recently been conducted on personality recognition using physiological signals, most of which have use...
详细信息
Personality primarily refers to the unique and stable way of a person’s thinking and behavior. A few studies have recently been conducted on personality recognition using physiological signals, most of which have used two-dimensional (2D) emotional stimulus materials. Virtual reality (VR) has been utilized in many fields, and its superiority over 2D in emotion recognition has been proven. However, relevant research on VR scenes is lacking in the field of personality recognition. In this study, based on the psychological principle that emotional arousal can expose an individual’s personality, we attempt to explore the feasibility and effect of using electrocardiogram (ECG) signals in response to VR emotional stimuli for personality identification. For this purpose, a VR-2D emotion-induction experiment was conducted in which ECG signals were collected, and physiological datasets of emotional personalities were constructed through preprocessing and feature extraction. Statistical analysis of the emotion scale scores and ECG features of the participants showed that the VR group had a higher number of significantly correlated features. Meanwhile, VR- and 2D-based personality recognition models were constructed using machine learning algorithms. The results showed that the VR-based personality recognition model achieved better results for the four personality dimensions, with a maximum accuracy of 79.76%. These findings indicate that VR not only enhances the physiological correlation between emotion and personality but also improves the classification accuracy of personality recognition.
In this paper, we develop spectral collocation method for a class of fractional diffusion differential equations. Since the solutions of these fractional differential equations usually exhibit singularities at the end...
详细信息
The 2010 Pacific-Rim Conference on Multimedia (PCM 2010) was held in Shanghai at Fudan University, during September 21–24, 2010. Since its inauguration in 2000, PCM has been held in various places around the Pacific ...
详细信息
ISBN:
(数字)9783642156960
ISBN:
(纸本)9783642156953
The 2010 Pacific-Rim Conference on Multimedia (PCM 2010) was held in Shanghai at Fudan University, during September 21–24, 2010. Since its inauguration in 2000, PCM has been held in various places around the Pacific Rim, namely Sydney (PCM 2000), Beijing (PCM 2001), Hsinchu (PCM 2002), Singapore (PCM 2003), Tokyo (PCM 2004), Jeju (PCM 2005), Zhejiang (PCM 2006), Hong Kong (PCM 2007), Tainan (PCM 2008), and Bangkok (PCM 2009). PCM is a major annual international conference organized as a forum for the dissemination of state-of-the-art technological advances and research results in the fields of theoretical, experimental, and applied multimedia analysis and processing. PCM 2010 featured a comprehensive technical program which included 75 oral and 56 poster presentations selected from 261 submissions from Australia, Canada, China, France, Germany, Hong Kong, India, Iran, Italy, Japan, Korea, Myanmar, Norway, Singapore, Taiwan, Thailand, the UK, and the USA. Three distinguished researchers, Prof. Zhi-Hua Zhou from Nanjing University, Dr. Yong Rui from Microsoft, and Dr. Tie-Yan Liu from Microsoft Research Asia delivered three keynote talks to the conference. We are very grateful to the many people who helped to make this conference a s- cess. We would like to especially thank Hong Lu for local organization, Qi Zhang for handling the publication of the proceedings, and Cheng Jin for looking after the c- ference website and publicity. We thank Fei Wu for organizing the special session on large-scale multimedia search in the social network settings.
暂无评论