The paper examines the integration of AI in visualcommunication design, particularly focusing on the enhanced impact of images when combined with text. It explores the evolution of visualcommunication design in the ...
详细信息
The dynamic range of modern image sensors can exceed the range of a video monitor by an order of magnitude or more. This means that fast and intelligent range compression and image enhancement must be interposed betwe...
详细信息
ISBN:
(纸本)081941543X
The dynamic range of modern image sensors can exceed the range of a video monitor by an order of magnitude or more. This means that fast and intelligent range compression and image enhancement must be interposed between the sensor and display for effective visualcommunication. Digital hardware can enhance an image in real time, but the common method for range compression on digital hardware, linear filtering, can severely distort the image. Nonlinear methods must be used to prevent this distortion. In this paper, dynamic range compression of video images, at video rates, is demonstrated. An analog ASIC performs all of the necessary nonlinear image filtering at low power.
visualcommunication design in tourism products based on computer vision and complex feature mining algorithm is studied in this paper. In essential aspect, the visualcommunication should be considered. A core typica...
详细信息
Product images in market could be depend on customer opinion towards the product. The customer opinion describes the customers' needs, therefore before the product is launched to public, it could be important to h...
详细信息
ISBN:
(纸本)9781509000968
Product images in market could be depend on customer opinion towards the product. The customer opinion describes the customers' needs, therefore before the product is launched to public, it could be important to have customer observations in order to know what kind of product that customers' needs. A method to find the customers' needs is learning the costumer behaviours through their visual respons such as facial expression from the input of product's visualization. This research implements digital image processing to predict customers (as users) opinion, whether they like or dislike the product based on user's facial expression and user focus. The images of face features and skin are sparatcd using brightness segmentation and determining the keypoints for each face features. The scqueence of keypoints movement predicts user facial expression and user focus. The combination of these results gives an output for user opinion prediction.
The flutist robot WF-4RIV at Waseda University is able to play the flute at the level of an intermediate human player. So far the robot has been able to play in a statically sequenced duet with another musician, indiv...
详细信息
ISBN:
(纸本)9781424466757
The flutist robot WF-4RIV at Waseda University is able to play the flute at the level of an intermediate human player. So far the robot has been able to play in a statically sequenced duet with another musician, individually communicating only by keeping eye-contact. To extend the interactive capabilities of the flutist robot, we have in previous publications described the implementation of a Music-based Interaction System (MbIS). The purpose of this system is to combine information from the robot's visual and aural sensor input signal processing systems to enable musical communication with a partner musician. In this paper we focus on that part of the MbIS that is responsible for mapping the information from the sensor processing system to generate meaningful modulation of the musical output of the robot. We propose a two skill level approach to enable musicians of different ability levels to interact with the robot. When interacting with the flutist robot the device's physical capabilities / limitations need to be taken into account. In the beginner level interaction system the user's input to the robot is filtered in order to adjust it to the state of the robot's breathing system. The advanced level stage uses both the aural and visual sensor processinginformation. In a teaching phase the musician teaches the robot a tone sequence (by actually performing the sequence) that he relates to a certain instrument movement. In a performance phase, the musician can trigger these taught sequences by performing the according movements. Experiments to validate the functionality of the MbIS approach have been performed and the results are presented in this paper.
Noise is the primary visibility limit in the process of non-linear image enhancement, and is no longer a statistically stable additive noise in the post-enhancement image. Therefore novel approaches are needed to both...
详细信息
ISBN:
(纸本)0819453617
Noise is the primary visibility limit in the process of non-linear image enhancement, and is no longer a statistically stable additive noise in the post-enhancement image. Therefore novel approaches are needed to both assess and reduce spatially variable noise at this stage in overall image processing. Here we will examine the use of edge pattern analysis both for automatic assessment of spatially variable noise and as a foundation for new noise reduction methods.
Recently, several approaches have been designed to hide data in portable document format (PDF) files. These approaches have demonstrated their advantages in different application scenarios, including copyright verific...
详细信息
ISBN:
(纸本)9798350367331;9798350367348
Recently, several approaches have been designed to hide data in portable document format (PDF) files. These approaches have demonstrated their advantages in different application scenarios, including copyright verification, covert communication/steganography, and content forensics. However, they often suffer from visual distortion or lack universal applicability. In this work, we propose a reversible and transparent method that exploits the coding properties of text objects (i.e., substrings) in a PDF-compliant document to embed data. In particular, the position information of the substrings is adjusted to hide data, where each unique permutation of the substrings encodes a bit sequence. Subsequently, the distance of each substring from the left margin is corrected so that the processed PDF has the exact layout or appearance of the original PDF, hence completely preserving the quality of the original PDF file. In the best-case scenario, to hide one bit of data, 5.88 bits of the PDF file are required, i.e., 1 : 5.88. In addition, this method can be deployed in tandem with conventional data hiding methods to hide more data and to hide data in different ways.
In visualcommunication design (VCD), multi-source data integration plays an important role in innovation and growth. This paper studies and discusses the means of information dissemination in the new media era by ana...
详细信息
Generally, resource-awareness plays a key role in wireless sensor networks- due the limited capabilities in processing, storage and communication. In this paper we present a resource-aware cooperative state estimation...
详细信息
ISBN:
(纸本)9789897580864
Generally, resource-awareness plays a key role in wireless sensor networks- due the limited capabilities in processing, storage and communication. In this paper we present a resource-aware cooperative state estimation facilitated by a dynamic cluster-based protocol in a visual sensor network (VSN). The VSN consists of smart cameras. which process and analyze the captured data locally. We apply a state estimation algorithm to improve the tracking results of the cameras. To design a lightweight protocol, the final aggregation of the observations and state estimation are only performed by the cluster head. Our protocol is based on a market-based approach in which the cluster head is elected based-on-the available resources and a visibility parameter of the object gained by the cluster members. We show in simulations that our approach reduces the costs for state estimation and communication as compared to a fully distributed approach. As resource-awareness is the focus of the cluster-based protocol we can accept a slight degradation of the accuracy on the object's state estimation by a standard deviation of about 1.48 length units to the available ground truth.
This paper describes our design policy anti prototype data collection of RWC (Real World Computing Program) multimodal database. The database is intended for research and development on the integration of spoken langu...
详细信息
ISBN:
(纸本)0780335554
This paper describes our design policy anti prototype data collection of RWC (Real World Computing Program) multimodal database. The database is intended for research and development on the integration of spoken language and visualinformation far human computer interactions. The interactions are supposed to use image recognition, image synthesis, speech recognition, and speech synthesis. visualinformation also includes non-verbal communication such as interactions using hand gestures and facial expressions between human and a human-like CG (Computer Graphics) agent with a face and hands. Based on the experiments of interactions with these modes, specifications of the database are discussed from the viewpoint of controlling the variability and cost for the collection.
暂无评论