Topic models, such as Latent Dirichlet Allocation (LDA), are successful in learning hidden topics and has been widely applied in text mining. There are many recently developed augmented topic modeling methods to utili...
详细信息
ISBN:
(数字)9781728181561
ISBN:
(纸本)9781728181578
Topic models, such as Latent Dirichlet Allocation (LDA), are successful in learning hidden topics and has been widely applied in text mining. There are many recently developed augmented topic modeling methods to utilize metadata information. However, the effect of topic models is still not comparable to humans. We think one key point is that humans have background knowledge, which is essential for topic understanding. Inspired by this, we propose a knowledge base enhanced topic model in this paper. We take knowledge bases as good presentations of human knowledge, with huge collections of entities and their relations. We assume that documents with related entities tend to have similar topic distributions. Based on this assumption, we compute document similarity information via the linked entities and then use it as a constraint for LDA. More specifically, we embed entities in a low-dimensional space via DeepWalk and use Entity Movers Distance to efficiently and effectively measure the similarities between documents. The results of experiments over two real-world datasets show that our method boosts the LDA model on the document classification while no supervision information is needed.
The neural network method is one of the most important methods in the field of speech recognition. In this paper, we propose a new speech recognition method, probabilistic neural network (PNN) ensembles, where the Bag...
详细信息
ISBN:
(纸本)9781467376808
The neural network method is one of the most important methods in the field of speech recognition. In this paper, we propose a new speech recognition method, probabilistic neural network (PNN) ensembles, where the Bagging ensembles method is used to form a speech recognition model with probabilistic neural networks integrated, to implement a speaker-independent English speech recognition system. This paper also demonstrates that before speech recognition, applying segment clustering algorithm to the extracted speech data, i.e., the process of time warping, can ensure the validity of dataset and the performance of PNN. Through experiments, the experimental results show that the PNN ensembles method has faster modeling speed and higher recognition rate than the single BP (Back Propagation) and the BP ensembles method, and has higher recognition rate than the traditional PNN method.
Messages spreading inside vehicular ad hoc networks (VANETs) generally need to achieve the property of verifiability and content integrity, while preserving user privacy. Otherwise, VANETs will either fall into chaos,...
详细信息
ISBN:
(纸本)9781479965144
Messages spreading inside vehicular ad hoc networks (VANETs) generally need to achieve the property of verifiability and content integrity, while preserving user privacy. Otherwise, VANETs will either fall into chaos, or prevent users from embracing it. To achieve this goal, we propose a protocol, which contains a priori and posteriori countermeasures, to guarantee these features. The a priori process firstly verifies that each message is sent by a vehicle only once. Then it collects and checks whether the count of the message exceeds the threshold value to improve the trustworthiness of the message. The posteriori process verifies the integrity of the message, ensuring it is unchanged during transmission between the vehicle and the road side unit. The privacy is preserved by applying group signature. In case of disruptive events, the proposed solution can trace back to the source vehicle which generates the message.
In this paper, we introduce SLQS, a new entropy-based measure for the unsupervised identification of hypernymy and its directionality in Distributional Semantic Models (DSMs). SLQS is assessed through two tasks: (i.) ...
详细信息
Aspect-based sentiment analysis (ABSA) aims to predict fine-grained sentiments of comments with respect to given aspect terms or categories. In previous ABSA methods, the importance of aspect has been realized and ver...
详细信息
We present a neural framework for learning associations between interrelated groups of words such as the ones found in Subject-Verb-Object (SVO) structures. Our model induces a joint function-specific word vector spac...
详细信息
Infectious disease outbreaks continue to pose a significant threat to human health and well-being. To improve disease surveillance and understanding of disease spread, several surveillance systems have been developed ...
详细信息
To study and implement a computer evaluation system for spoken English pronunciation is important for learners to improve their spoken English. This paper introduces an undergraduate-oriented evaluation model of spoke...
详细信息
ISBN:
(纸本)9781509040940
To study and implement a computer evaluation system for spoken English pronunciation is important for learners to improve their spoken English. This paper introduces an undergraduate-oriented evaluation model of spoken English pronunciation and its related system, with four evaluation parameter of accuracy, speed, rhythm and intonation. This paper illustrates the necessity of each evaluation index, its computer realization method, and its weight in the overall evaluation model. Verified by experiments, the evaluation index and the model method adopted in this paper are reasonable and reliable.
It has been long known that sparsity is an effective inductive bias for learning efficient representation of data in vectors with fixed dimensionality, and it has been explored in many areas of representation learning...
详细信息
Knowledge probing is crucial for understanding the knowledge transfer mechanism behind the pre-trained language models (PLMs). Despite the growing progress of probing knowledge for PLMs in the general domain, speciali...
详细信息
暂无评论