In order to realize the farmland information pinpointing acquisition, this paper has conducted the technology research for farmland information collecting and processing based on the DGPS. According to the precision a...
详细信息
ISBN:
(纸本)9783642183355
In order to realize the farmland information pinpointing acquisition, this paper has conducted the technology research for farmland information collecting and processing based on the DGPS. According to the precision agriculture's demand which agricultural condition information should be made positional, fast, precise, continuous acquisition, system uses the DGPS receiver and the touch-screen computer, uses Microsoft Access database and uses visual Basic 6.0 to establish the application program, which used in the farmland information acquisition and processing. According to the farm land contract's demand, the method is established by using DGPS receiver to survey and drawing for field length of side and the area, which was applied in the farm and the effect was good. The result shows that this technology had very good usability.
This paper explores a human-robot mutual communication system, which human users can either communicate with or use as an information terminal. In particular, we propose the deformation based facial expression system....
详细信息
ISBN:
(纸本)0780372727
This paper explores a human-robot mutual communication system, which human users can either communicate with or use as an information terminal. In particular, we propose the deformation based facial expression system. We also propose a robotic vision system, which changes its visual attention according to the environment. Firstly, the system must have advanced abilities to express their intention by means of making facial expressions, gestures, or speech. Above all, facial expression. Reconsidering the facial action coding system and action unit from the point of What expression of the robot human recognize easily. We propose the deformation based expression system. Secondly, to realize fluent communication between human and robots, we propose a robot vision system changing its gazing communication according to the environment and situation based on visual recognition. We developed an original character robot (CR) and evaluated the proposed methods. Consequently, it was shown that human-robot mutual communication is achievable.
Infographic as a form of presenting information that is designed combining visual with text has become a form that allows the reader easily understand series or complex information. This paper will begin with brief ex...
详细信息
Infographic as a form of presenting information that is designed combining visual with text has become a form that allows the reader easily understand series or complex information. This paper will begin with brief explanation about how the brain processes the information and the history of Infographic that has become part of the visualcommunication design heritage. Then will be discussed how infographic both on printed and digital media convey comprehensive information that related to the human information-processing system. At the end of the article can be obtained a clearer picture on how infographic with its visual approach makes impacts to the reader, in order to understand complex information.
The introduction of Transformer neural networks has changed the landscape of Natural Language processing (NLP) during the recent years. These models are very complex, and therefore hard to debug and explain. In this c...
详细信息
ISBN:
(纸本)9781665490078
The introduction of Transformer neural networks has changed the landscape of Natural Language processing (NLP) during the recent years. These models are very complex, and therefore hard to debug and explain. In this context, visual explanation became an attractive approach. The visualization of the path that leads to certain outputs of a model is at the core of visual explanation, as this illuminates the features or parts of the model that may need to be changed to achieve the desired results. In particular, one goal of a NLP visual explanation is to highlight the most significant parts of the text that have the greatest impact on the model output. Several visual explanation methods for NLP models were recently proposed. A major challenge is how to compare the performances of such methods since we cannot simply use the usual classification accuracy measures to evaluate the quality of visualizations. We need good metrics and rigorous criteria to measure how useful the extracted knowledge is for explaining the models. In addition, we want to visualize the differences between the knowledge extracted by different models, in order to be able to rank them. In this paper, we investigate how to evaluate explanations/visualizations resulted from machine learning models for text classification. The goal is not to improve the accuracy of a particular NLP classifier, but to assess the quality of the visualizations that explain its decisions. We describe several methods for evaluating the quality of NLP visualizations, including both automated techniques based on quantifiable measures and subjective techniques based on human judgements.
In this research we focus on analyzing which contextual variables moderate the effectiveness of knowledge visualization. Through a field experiment we compare the same content expressed in a textual and in a multimoda...
详细信息
ISBN:
(纸本)9781538672020
In this research we focus on analyzing which contextual variables moderate the effectiveness of knowledge visualization. Through a field experiment we compare the same content expressed in a textual and in a multimodal format (here intended as composed of text and icons together, organized spatially in a meaningful way). The application context is Polycystic Ovary Syndrome which is the most common cause of infertility. We conducted a field experiment in India and found that all subjects are more engaged by the content when read it in a visual format, but that the intention to change behavior is moderated by educational level.
It is speculated that the processing of different types of information (e.g., quantitative, ordinal, or nominal data) will be affected by what type of visual display is used to present that information (e.g., line gra...
详细信息
ISBN:
(纸本)0769521770
It is speculated that the processing of different types of information (e.g., quantitative, ordinal, or nominal data) will be affected by what type of visual display is used to present that information (e.g., line graphs, shapes with varying levels of gray saturation, or shapes of different colors). People are, expected to be able to more efficiently and accurately process and answer questions about the visual displays if the type of display (i.e., the representing dimension) provides neither too much nor too little information and matches the type of information (i.e., the represented dimension) being processed. In the present study we found that in general task performance was best when the represented and representing dimensions match. An exception to this is discussed.
Techniques for communication over flat multi-input, multi-output (MIMO) channels are well established when either perfect channel state information or no channel state information is available at the transmitter. Howe...
详细信息
Techniques for communication over flat multi-input, multi-output (MIMO) channels are well established when either perfect channel state information or no channel state information is available at the transmitter. However, communication over channels where the transmitter has access to partial or imperfect information has received less attention. If exploited, such information could improve system performance. In this paper, we propose a simple system design scheme, that approximately maximizes the data rates of MIMO communication systems where imperfect channel estimates are available at the transmitter. The algorithm is computationally attractive and by taking the uncertainty of the channel estimates into account in the design gains compared with systems not exploiting this information can be demonstrated.
We propose an effective method to measure the capture-to-display delay (CDD) of a visualcommunication application. The method does not require modifications to the existing system, nor require the encoder and decoder...
详细信息
ISBN:
(纸本)9789869000604
We propose an effective method to measure the capture-to-display delay (CDD) of a visualcommunication application. The method does not require modifications to the existing system, nor require the encoder and decoder clocks to be synchronized. Furthermore, we propose a solution to solve the multiple overlapped-timestamp problems due to the response time of the display and the exposure time of the camera. We implemented the method in software to measure the capture-to-display delay of a cellphone video chat application over various types of networks. Experiments confirmed the effectiveness of our proposed methods.
It is well known that speech production and perception process is inherently bimodal consisting of audio and visual components. Recently there has been increased interest in using the visual modality in combination wi...
详细信息
ISBN:
(纸本)9781424417513
It is well known that speech production and perception process is inherently bimodal consisting of audio and visual components. Recently there has been increased interest in using the visual modality in combination with the acoustic modality for improved speech processing. This field of study has gained the title of audio-visual speech processing. Lip movement recognition, also known as lip reading, is a communication skill which involves the interpretation of lip movements in order to estimate some important parameters of the lips that include, but not limited to, size, shape and orientation. In this paper, we represent a hybrid framework for lip reading which is based on both audio and visual speech parameters extracted from a video stream of isolated spoken words. The proposed algorithm is self-tuned in the sense that it starts with an estimations of speech parameters based on visual lip features and then the coefficients of the algorithm are fine-tuned based on the extracted audio parameters. In the audio speech processing part, extracted audio features are used to generate a vector containing information of the speech phonemes. These information are used later to enhance the recognition and matching process. For lip feature extraction, we use a modified version of the method used by F. Huang and T. Chen for tracking of multiple faces. This method is based on statistical color modeling and the deformable template. The experiments based on the proposed framework showed interesting results in recognition of isolated words.
As a hub of information controlled by the patient, personal health records (PHR) collect information from the patient medical history including a wide variety of data sources as patient's observations, lab results...
详细信息
ISBN:
(纸本)9780769544762
As a hub of information controlled by the patient, personal health records (PHR) collect information from the patient medical history including a wide variety of data sources as patient's observations, lab results, clinical findings and in the future maybe even personal genetic data and automatic recordings from monitoring devices. This development will on the one hand make health care more personalized and user controlled but on the other hand also overloads consumers with a huge amount of data. To address this issue we developed a framework for adaptive visual symbols (AVS). An AVS can adapt its appearance and level of detail during the communication process. Finally we demonstrate the AVS principle for the visualization of personal health records.
暂无评论