The registration rate and fusion efficiency of traditional automated visualcommunicationvisual fusion system are low in the application process, which seriously affects the application effect of the system, so this ...
详细信息
Selective source coding is an essential part of very low bit rate (VLBR) image/video compression where a significant irrelevancy reduction has to be performed. In this paper, this reduction is described in the context...
详细信息
Recently Computer vision and Natural language processing paradigm contains enormous research progress in their respective areas. Despite the progress in both areas, still it remains as a challenging task for machines ...
详细信息
ISBN:
(纸本)9781538678084
Recently Computer vision and Natural language processing paradigm contains enormous research progress in their respective areas. Despite the progress in both areas, still it remains as a challenging task for machines to extract image semantics and then communicate this extracted information with the desired users. These problems will be solved by visual Question Answering (VQA) system by connecting both computer vision and natural language processing paradigms. In VQA, system is presented with an image and textual question related to that image. The system will generate the answer by processing on both image and textual features. Answer generated by VQA is in one word, phrase or in sentence. Various datasets are available for training and evaluating VQA system which contains real or abstract images and question-answer pairs related to the semantics available in the image. VQA is being used in many areas such as for blind and visually impaired users, robotics, art gallery and many more areas. This paper discusses VQA techniques, VQA datasets and highlights the parametric evaluation of these techniques along with generic issues in VQA system.
This paper describes the development and study of temporal RVTI-grammar, which allows to take into account time parameters in the process of document flow analysis. Methods of neutralization of the revealed errors, tr...
详细信息
ISBN:
(纸本)9781538664681
This paper describes the development and study of temporal RVTI-grammar, which allows to take into account time parameters in the process of document flow analysis. Methods of neutralization of the revealed errors, translation of conceptual schematic models of the automated systems presented in widely used visual languages into the diagram models based on formal languages are considered.
In the rapidly developing digital information age, the progress of science and technology, and the rapid development of the Internet, when people's eyes turn from the paper to the screen, the original passive, lin...
详细信息
This paper is about the idea proposal and design of the smart glass function as an auxiliary engineering device for low vision. Low visionary people try to see objects or surrounding situations in the field of view as...
详细信息
ISBN:
(纸本)9781665409346
This paper is about the idea proposal and design of the smart glass function as an auxiliary engineering device for low vision. Low visionary people try to see objects or surrounding situations in the field of view as much as possible by utilizing residual vision. To this end, analog assistive devices are used, but it is an inconvenient environment in which various devices have to be changed and used according to the object or situation to be viewed. The proposed auxiliary engineering technology is a smart glass system equipped with a camera module and an ultra-small display, which uses it to provide various visual assistance functions such as visual enlargement, image contrast change, and color weakness assistance to low vision. The proposed system is still in its early stages of research, and it is sought to conduct continuous research to assist the insufficient vision of many low visioners by attempting to miniaturize and lower the unit cost of equipment configuration for smooth portability.
Traditional traffic visualcommunication design is mainly based on static images, but with the complexity of the traffic system, especially the types of vehicles, road network composition and other continuous diversif...
详细信息
Gesture recognition is one of challenging image processing. In this paper, a method of gesture segmentation is proposed, which is based on fusion of multi-information from multiple neural networks inspired by the huma...
详细信息
ISBN:
(纸本)9781479917976
Gesture recognition is one of challenging image processing. In this paper, a method of gesture segmentation is proposed, which is based on fusion of multi-information from multiple neural networks inspired by the human visual system. In this method, the gesture region is segmented from the video image sequence, innovatively using integration of the outputs from two kinds of spiking neural networks. The structures and the properties of the two networks are detailed in this paper. Based on the integrated outputs, the features of distance distribution histograms and outline moments are extracted and fused to form the mixed features. Finally, gestures are classified by the multi-class Support Vector Machine. Experimental results show that the proposed algorithm works efficiently and can perform gesture segmentation and gesture recognition with the satisfying accuracy for dynamic visual image sequence under complex background. It is promising to apply this approach to video processing domain and robotic visual systems.
Aiming at the problem that the adaptive convergence of noise iteration performance is not fast enough in the traditional spatial scene visualinformationcommunication method, a method for accurately transmitting spat...
详细信息
Real time image processing systems are very complex real time systems that challenge the limitations of hardware resources (processing, memory, bandwidth). Typically, real time image processing systems are implemented...
详细信息
ISBN:
(纸本)0819431907
Real time image processing systems are very complex real time systems that challenge the limitations of hardware resources (processing, memory, bandwidth). Typically, real time image processing systems are implemented on pipeline or multi processor architectures. For multi processor architectures we distinguish shared memory and/or inter processor communication links for the communication. In the following paper we present a system design for real time image processing systems on multi processor architectures using inter processor communication links for the communication. The paper focuses on the specific design issues important for real time image processing systems. A detailed overview over the complete design is given by presenting the following topics: hardware topology, programming model, inter processor communication, image processing infrastructure and image processing. The system design is illustrated using the ground to ground Automatic Target Detection and Tracking (ATDT) system developed by Computing Devices Canada for its Fire-Control and Surveillance products.
暂无评论