Bio-medical entity recognition extracts significant entities, for instance cells, proteins and genes, which is an arduous task in an automatic system that mine knowledge in bioinformatics texts. In this thesis, we uti...
详细信息
The typical representative of the hard clustering algorithm, K-means, is one of the fastest processing algorithms with good scalability. However, it cannot deal with categorical attributes, which is one of the importa...
The typical representative of the hard clustering algorithm, K-means, is one of the fastest processing algorithms with good scalability. However, it cannot deal with categorical attributes, which is one of the important indicators to measure the pros and cons. Due to the lack of processing capabilities on categorical attributes, k-means has a large limit on data processing capabilities. This paper proposes a clustering algorithm extends from K-means. This algorithm introduces the concept of a Pseudo-mean distance calculation formula and a counting-table so that categorical attributes can be processed while reducing the time cost as much as possible. Experimental results illustrate the proposed Pseudo-means can extend the processing range of k-means to category-type data, and the counting table also effectively reduces the time cost.
It is extremely important to explore the heavy metal content and spatial distribution in different cultivated soils. In our study, cokriging (COK) method was used to investigate the relationship between heavy metals a...
详细信息
Previous works on personalized recommendation mostly emphasize modeling peoples' diversity in potential fa- vorites into a uniform recommender. However, these recommenders always ignore the heterogeneity of users at ...
详细信息
Previous works on personalized recommendation mostly emphasize modeling peoples' diversity in potential fa- vorites into a uniform recommender. However, these recommenders always ignore the heterogeneity of users at an individual level. In this study, we propose an individualized recommender that can satisfy every user with a customized parameter. Experimental results on four benchmark datasets demonstrate that the individualized recommender can significantly improve the accuracy of recommendation. The work highlights the importance of the user heterogeneity in recommender design.
Hashing has become a promising technique to be applied to the large-scale visual retrieval tasks. Multi-view data has multiple views, providing more comprehensive information. Existing hashing methods measure the simi...
详细信息
Over the recent years, ontologies are widely used in the biomedical domains. However, biomedical ontology heterogeneity problem hamper the cooperation between intelligent applications based on biomedical ontologies. I...
详细信息
Crowdsensing enables a wide range of data collection, where the data are usually tagged with private locations. Protecting users' location privacy has been a central issue. The study of various location perturbati...
详细信息
ISBN:
(数字)9781728109626
ISBN:
(纸本)9781728109633
Crowdsensing enables a wide range of data collection, where the data are usually tagged with private locations. Protecting users' location privacy has been a central issue. The study of various location perturbation techniques for protecting users' location privacy has received widespread attention. Despite the huge promise and considerable attention, the location perturbation operation causes inevitable location errors, which can diminish the location quality of the crowdsensing results. Provable good algorithms that consider location quality in privacy preserving crowdsensing from optimization perspectives are still lacking in the literature. In this paper, we investigate the problem of location quality optimization in privacy preserving crowdsensing, which is to minimize the location quality desegregation, while protecting all users' location privacy. We present an optimal algorithm OLQDM for this problem. Extensive simulations demonstrate that OLQDM significantly outperforms an existing algorithm in terms of the location quality and SSE.
Recently,many researchers have concentrated on using neural networks to learn features for Distant Supervised Relation Extraction(DSRE).These approaches generally use a softmax classifier with cross-entropy loss,which...
详细信息
Recently,many researchers have concentrated on using neural networks to learn features for Distant Supervised Relation Extraction(DSRE).These approaches generally use a softmax classifier with cross-entropy loss,which inevitably brings the noise of artificial class NA into classification *** address the shortcoming,the classifier with ranking loss is employed to *** randomly selecting a relation or heuristically selecting the highest score among all incorrect relations are two common methods for generating a negative class in the ranking loss ***,the majority of the generated negative class can be easily discriminated from positive class and will contribute little towards the *** by Generative Adversarial Networks(GANs),we use a neural network as the negative class generator to assist the training of our desired model,which acts as the discriminator in *** the alternating optimization of generator and discriminator,the generator is learning to produce more and more discriminable negative classes and the discriminator has to become better as *** framework is independent of the concrete form of generator and *** this paper,we use a two layers fully-connected neural network as the generator and the Piecewise Convolutional Neural Networks(PCNNs)as the *** results show that our proposed GAN-based method is effective and performs better than state-of-the-art methods.
暂无评论