Multi-view learning is a very useful classification technique when multiple, conditionally independent feature sets are available in a dataset. In this paper multi-view learning is used to classify sequences of protei...
详细信息
ISBN:
(纸本)9783319419206;9783319419190
Multi-view learning is a very useful classification technique when multiple, conditionally independent feature sets are available in a dataset. In this paper multi-view learning is used to classify sequences of protein crystallization images that were obtained over a period of time, varying between a few hours to a few months. We introduce the use of the difference image features, along with the original image features, as a second feature set in classifying x-ray crystallography images, after arranging the images according to the timeline of an experiment. Usage of multi-view learning is proposed after carrying out experiments to determine the features that should be used in each view to increase classification accuracy. Random forests are used as the classifier in each view, as preliminary experiments have suggested that it provides higher classification accuracy in crystallography datasets. Accuracy of 97.2% was obtained using multi-view learning based on original and difference features, which is the highest obtained so far in the classification of protein crystallography images.
The purpose of this study, by using an artificial intelligent approaches, is to compare a correlation between geophysical and geotechnical parameters. The input variables for this system are the electrical resistivity...
详细信息
ISBN:
(纸本)9783319419206;9783319419190
The purpose of this study, by using an artificial intelligent approaches, is to compare a correlation between geophysical and geotechnical parameters. The input variables for this system are the electrical resistivity reading, the water content laboratory measurements. The output variable is water content of soils. In this study, our data sets are clustered into 120 training sets and 28 testing sets for constructing the fuzzy system and validating the ability of system prediction, respectively. Relationships between soil water content and electrical parameters were obtained by curvilinear models. The ranges of our samples are changed between 1 - 50 ohm. m (for resistivity) and 20 - 60 (%, for water content). An artificial intelligent system (artificial neural networks, Fuzzy logic applications, Mamdani and Sugeno approaches) are based on some comparisons about correlation between electrical resistivity and soil-water content, for Istanbul and Golcuk Soils in Turkey.
The proceedings contain 11 papers. The topics discussed include: representing and linking music performance data with score information;MORTY: a toolbox for mode recognition and tonic identification;in collaboration w...
ISBN:
(纸本)9781450347518
The proceedings contain 11 papers. The topics discussed include: representing and linking music performance data with score information;MORTY: a toolbox for mode recognition and tonic identification;in collaboration with in concert: reflecting a digital library as linked data for performance ephemera;a standard format proposal for hierarchical analyses and representations;approaches to handwritten conductor annotation extraction in musical scores;document analysis for music scores via machinelearning;exploring JDISC: some preliminary analyses;digitizing musical scores: challenges and opportunities for libraries;and data generation and multimodal analysis for archival recorded operatic performance.
The goal of this study is to develop a method that is capable of inferring geo-locations for non-representative data. In order to protect privacy of surveyed individuals, most data collectors release coarse geo-inform...
详细信息
ISBN:
(数字)9783319419206
ISBN:
(纸本)9783319419206;9783319419190
The goal of this study is to develop a method that is capable of inferring geo-locations for non-representative data. In order to protect privacy of surveyed individuals, most data collectors release coarse geo-information (e.g., tract), rather than detailed geo-information (e.g., street, apt number) when sharing surveyed data. Without the exact locations, many point-based analyses cannot be performed. While several scholars have developed new methods to address this issue, little attention has been paid to how to correct this issue when data are not representative. To fill this knowledge gap, we propose a bias correction method that adjusts for the bias using a bias factor approach. Applying our method to an empirical data set with a known bias associated with gender, we found that our method could generate reliable results despite the non-representativeness of the sample.
We present a solution named LiveDoc, which augments natural language text documents with relevant contextual background information. This background information helps readers to understand the context of the discourse...
详细信息
ISBN:
(纸本)9783319419206;9783319419190
We present a solution named LiveDoc, which augments natural language text documents with relevant contextual background information. This background information helps readers to understand the context of the discourse better by fetching relevant information from other sources such as Wikipedia. Often the readers do not possess all background and supplementary information required for comprehending the purport of a narrative such as a news op-ed article. At the same time, it is not possible for authors to provide all contextual information while addressing a particular topic. LiveDoc processes the information in a document;uses extracted entities to fetch relevant background information in the context of the document from various sources (as defined by user) using semantic matching and topic modeling techniques like Latent Dirichlet Allocation and Hierarchical Dirichlet Process;and presents the background information to the user by augmenting the original document with the fetched information. Reader is then equipped better to understand the document with this additional background information. We present the effectiveness of our solution through extensive experimentation and associated results.
In the era of mobile Internet, a vast amount of geo-spatial data allows us to gain further insights into human activities, which is critical for Internet Services Providers (ISP) to provide better personalized service...
详细信息
ISBN:
(纸本)9781450343091
In the era of mobile Internet, a vast amount of geo-spatial data allows us to gain further insights into human activities, which is critical for Internet Services Providers (ISP) to provide better personalized services. With the pervasiveness of mobile Internet, much evidence show that human mobility has heavy impact on app usage behavior. In this paper, we propose a method based on machinelearning to predict users' app usage behavior using several features of human mobility extracted from geo-spatial data in mobile Internet traces. The core idea of our method is selecting a set of mobility attributes (e.g. location, travel pattern, and mobility indicators) that have large impact on app usage behavior and inputting them into a classification model. We evaluate our method using real-world network traffic collected by our self-developed high-speed Traffic Monitoring System (TMS). Our prediction method achieves 90.3% accuracy in our experiment, which verifies the strong correlation between human mobility and app usage behavior. Our experimental results uncover a big potential of geo-spatial data extracted from mobile Internet.
Exploratory analysis of ubiquitous data and social media includes resources created by humans as well as those generated by sensor devices. This paper reviews recent advances concerning according approaches and method...
详细信息
This paper proposes a novel methodology for discovering interestingness hotspots in spatial datasets using a graph-based algorithm. We define interestingness hotspots as contiguous regions in space which are interesti...
详细信息
ISBN:
(纸本)9783319419206;9783319419190
This paper proposes a novel methodology for discovering interestingness hotspots in spatial datasets using a graph-based algorithm. We define interestingness hotspots as contiguous regions in space which are interesting based on a domain expert's notion of interestingness captured by an interestingness function. In our recent work, we proposed a computational framework which discovers interestingness hotspots in gridded datasets using a 3-step approach which consists of seeding, hotspot growing and post-processing steps. In this work, we extend our framework to discover hotspots in any given spatial dataset. We propose a methodology which firstly creates a neighborhood graph for the given dataset and then identifies seed regions in the graph using the interestingness measure. Next, we grow interestingness hotspots from seed regions by adding neighboring nodes, maximizing the given interestingness function. Finally after all interestingness hotspots are identified, we create a polygon model for each hotspot using an approach that uses Voronoi tessellations and the convex hull of the objects belonging to the hotspot. The proposed methodology is evaluated in a case study for a 2-dimensional earthquake dataset in which we find interestingness hotspots based on variance and correlation interestingness functions.
The proceedings contain 812 papers. The topics discussed include: power consumption analysis across heterogeneous data center using cloudsim;underlying text independent speaker recognition;automatic ration distributio...
ISBN:
(纸本)9789380544199
The proceedings contain 812 papers. The topics discussed include: power consumption analysis across heterogeneous data center using cloudsim;underlying text independent speaker recognition;automatic ration distribution system-a review;evaluation for POS tagger, chunk and resolving issues in word sense disambiguate in machine translation for Hindi to English languages;virtual calling number for ESME;traffic signal preemption (TSP) system by ordinary vehicles in case of emergency based on Internet of Things ecosystem;ICT based communication systems as enabler for technology transfer;adaptability in constant modulus algorithm and optimization for smart antenna systems;design of gain enhanced stacked rectangular dielectric resonator antenna for C-band applications;design of rectangular dielectric resonator antenna using offset micro-strip feed for satellite application;identification of parameters of digital IIR filters using teaching-learning optimization algorithm and statistical inference comparison with particle swarm optimization algorithms;proximal privacy preserving linear classification of horizontally and checkerboard partitioned data;beyond m-score for detection of misusability by malicious insiders;effect of population and bit size on optimization of function by genetic algorithm;and design and advances of cylindrical dielectric resonator antenna - a review.
Beta-Liouville mixture models have achieved measurable success in many computer vision and patternrecognition applications. In this paper, we develop a novel algorithm to learn this particular kind of models that hav...
详细信息
ISBN:
(纸本)9781509021888
Beta-Liouville mixture models have achieved measurable success in many computer vision and patternrecognition applications. In this paper, we develop a novel algorithm to learn this particular kind of models that have been shown to be very efficient for the clustering of proportional data. Our algorithm is based on an accelerated version of the variational Bayes approach. Experiments show that the developed algorithm work very well for the categorization of 3D shapes.
暂无评论