Clothing parsing has been actively studied in the vision community in recent years. Inspired by the color coherence for clothing and the self-attention mechanism, this paper proposes a Triple Attention Network (TANet)...
详细信息
ISBN:
(纸本)9783030638290;9783030638306
Clothing parsing has been actively studied in the vision community in recent years. Inspired by the color coherence for clothing and the self-attention mechanism, this paper proposes a Triple Attention Network (TANet) equipped with a color attention module, a position attention module and a channel attention module, to facilitate fine-grained segmentation of clothing images. Concretely, the color attention module is introduced for harvesting color coherence, which selectively aggregates the color feature of clothing. The position attention module and the channel attention module are designed to emphasize the semantic interdependencies in spatial and channel dimensions respectively. The outputs of the three attention modules are incorporated to further improve feature representation which contributes to more precise clothing parsing results. The proposed TANet has achieved 69.54% mIoU - a promising clothing parsing performance on ModaNet, the latest large-scale clothing parsing dataset. Especially, the color attention module is also demonstrated to bring semantic consistency and precision improvement obviously. The source code is made available in the public domain.
is the first text dedicated to the fusion techniques for such a huge volume of data consisting of a very large number of images. This monograph brings out recent advances in the research in the area of visualization o...
详细信息
ISBN:
(数字)9781461474708
ISBN:
(纸本)9781461474692;9781489993755
is the first text dedicated to the fusion techniques for such a huge volume of data consisting of a very large number of images. This monograph brings out recent advances in the research in the area of visualization of hyperspectral data. It provides a set of pixel-based fusion techniques, each of which is based on a different framework and has its own advantages and disadvantages. The techniques are presented with complete details so that practitioners can easily implement them.
It is also demonstrated how one can select only a few specific bands to speed up the process of fusion by exploiting spatial correlation within successive bands of the hyperspectral data. While the techniques for fusion of hyperspectral images are being developed, it is also important to establish a framework for objective assessment of such techniques. This monograph has a dedicated chapter describing various fusion performance measures that are applicable to hyperspectral image fusion. This monograph also presents a notion of consistency of a fusion technique which can be used to verify the suitability and applicability of a technique for fusion of a very large number of images.
This book will be a highly useful resource to the students, researchers, academicians and practitioners in the specific area of hyperspectral image fusion, as well as generic image fusion.
CMOS based cameras are present nowadays for many imaging applications, ranging from any consumer, industrial or artistic use, they will also be introduced into the simulation engines applied to any raytracing technolo...
详细信息
Selecting cosmetics requires visual information and often benefits from the assessments of a cosmetics expert. In this paper we present a unique mobile imaging application that enables women to use their cell phones t...
详细信息
ISBN:
(纸本)9780819479358
Selecting cosmetics requires visual information and often benefits from the assessments of a cosmetics expert. In this paper we present a unique mobile imaging application that enables women to use their cell phones to get immediate expert advice when selecting personal cosmetic products. We derive the visual information from analysis of camera phone images, and provide the judgment of the cosmetics specialist through use of an expert system. The result is a new paradigm for mobile interactions-image-based information services exploiting the ubiquity of camera phones. The application is designed to work with any handset over any cellular carrier using commonly available MMS and SMS features. Targeted at the unsophisticated consumer, it must be quick and easy to use, not requiring download capabilities or preplanning. Thus, all application processing occurs in the back-end system and not on the handset itself. We present the imaging pipeline technology and a comparison of the services' accuracy with respect to human experts.
This book gives a start-to-finish overview of the whole Fish4Knowledge project, in 18 short chapters, each describing one aspect of the project. The Fish4Knowledge project explored the possibilities of big video data,...
详细信息
ISBN:
(数字)9783319302089
ISBN:
(纸本)9783319302065;9783319807508
This book gives a start-to-finish overview of the whole Fish4Knowledge project, in 18 short chapters, each describing one aspect of the project. The Fish4Knowledge project explored the possibilities of big video data, in this case from undersea video. Recording and analyzing 90 thousand hours of video from ten camera locations, the project gives a 3 year view of fish abundance in several tropical coral reefs off the coast of Taiwan. The research system built a remote recording network, over 100 Tb of storage, supercomputerprocessing, video target detection and tracking, fish species recognition and analysis, a large SQL database to record the results and an efficient retrieval mechanism. Novel user interface mechanisms were developed to provide easy access for marine ecologists, who wanted to explore the dataset. The book is a useful resource for system builders, as it gives an overview of the many new methods that were created to build the Fish4Knowledge system in a manner that also allows readers to see how all the components fit together.
Solar energy is the most readily available and the cheapest form of energy available. In a data-driven world of today, the data analysis tools combined with machine learning algorithms and sensors can be used to analy...
详细信息
ISBN:
(纸本)9781450363532
Solar energy is the most readily available and the cheapest form of energy available. In a data-driven world of today, the data analysis tools combined with machine learning algorithms and sensors can be used to analyze and crunch data to solve various problems plaguing the solar industry. Designing a system that recognizes the accurate positions for installing the PV modules on a rooftop, predict the power production capacity of the solar power plant and also manage the maintenance cycles, will not only help in minimizing the losses due to installation at improper places but also will increase the efficiency of the plant and reduce the cost of electricity production as a whole. This paper aims at using machine learning algorithms and image analysis tools to find the proper areas wherein the PV modules can be installed so as to maximize the production and at the same time predicting and visualizing the maintenance cycles of the power plant. The paper also aims to provide relevant data points for prediction of weather and power, which will be tabulated in a user-friendly way for further analytics.
Human activity understanding is an important research problem due to its relevance to a wide range of applications. Recently, 3D skeleton-based activity analysis becomes popular due to its succinctness, robustness, an...
详细信息
Human activity understanding is an important research problem due to its relevance to a wide range of applications. Recently, 3D skeleton-based activity analysis becomes popular due to its succinctness, robustness, and view-invariant representation. In this thesis, we focus on human activity understanding in 3D skeleton sequences. Recent works attempted to utilize recurrent neural networks (RNNs) and long short-term memory (LSTM) networks to model the temporal dependencies between the 3D positional configurations of human body joints for better analysis of human activities in the 3D skeletal data. As the first work of this thesis, we apply recurrent analysis to spatial domain as well as temporal domain to better analyze the hidden sources of action-related information within the human skeleton sequences in both of these domains simultaneously. Based on the pictorial structure of Kinect's skeletal data, an effective tree-structure based traversal framework is also proposed. In order to deal with the noise in the skeletal data, a new gating mechanism within LSTM module is introduced, with which the network can learn the reliability of the sequential data and accordingly adjust the effect of the input data on the updating procedure of the long-term context representation stored in the unit's memory cell. The comprehensive experimental results on seven challenging benchmark datasets for human action recognition demonstrate the effectiveness of the proposed method. In skeleton-based action recognition, not all skeletal joints are informative for activity analysis, and the irrelevant joints often bring noise which can degrade the performance. Therefore, we need to pay more attention to the informative ones. However, the original LSTM network does not have explicit attention ability. In our second piece of work, we propose a new class of LSTM network, global context-aware attention LSTM, for skeleton-based action recognition, which is capable of selectively focusing on the
A HRI study with 31 expert robot operators established that an external viewpoint from an assisting robot could increase teleoperation performance by 14% to 58% while reducing human error by 87% to 100% This video ill...
详细信息
ISBN:
(纸本)9781450382908
A HRI study with 31 expert robot operators established that an external viewpoint from an assisting robot could increase teleoperation performance by 14% to 58% while reducing human error by 87% to 100% This video illustrates those findings with a side-by-side comparison of the best and worst viewpoints for the passability and traversability affordances. The passability scenario uses a small unmanned aerial system as a visual assistant that can reach any viewpoint on the idealized hemisphere surrounding the task action. The traversability scenario uses a small ground robot that is restricted to a subset of viewpoints that are reachable.
ThesearetheproceedingsoftheRoboCup2004Symposium,heldattheInstituto Superior T´ ecnico, in Lisbon, Portugal in conjunction with the RoboCup c- petition. The papers presented here document the many innovations in r...
详细信息
ISBN:
(数字)9783540322566
ISBN:
(纸本)9783540250463
ThesearetheproceedingsoftheRoboCup2004Symposium,heldattheInstituto Superior T´ ecnico, in Lisbon, Portugal in conjunction with the RoboCup c- petition. The papers presented here document the many innovations in robotics that result from RoboCup. A problem in any branch of science or engineering is how to devise tests that can provide objective comparisons between alt- native methods. In recent years, competitive engineering challenges have been established to motivate researchers to tackle di?cult problems while providing a framework for the comparison of results. RoboCup was one of the ?rst such competitions and has been a model for the organization of challenges foll- ing sound scienti?c principles. In addition to the competition, the associated symposium provides a forum for researchers to present refereed papers. But, for RoboCup, the symposium has the greater goal of encouraging the exchange of ideas between teams so that the competition, as a whole, progresses from year to year and strengthens its contribution to robotics. One hundred and eighteen papers were submitted to the Symposium. Each paper was reviewed by at least two international referees; 30 papers were - cepted for presentation at the Symposium as full papers and a further 38 were accepted for poster presentation. The quality of the Symposium could not be maintained without the support of the authors and the generous assistance of the referees.
暂无评论