This work introduces a symmetric multiprocessing (SMP) version of the continuous iterative guided spectral class rejection (CIGSCR), a semi-automated classification algorithm for remote sensing (multispectral) images....
详细信息
ISBN:
(纸本)9781617828409
This work introduces a symmetric multiprocessing (SMP) version of the continuous iterative guided spectral class rejection (CIGSCR), a semi-automated classification algorithm for remote sensing (multispectral) images. The algorithm uses soft data clusters to produce a soft classification containing inherently more information than a comparable hard classification at an increased computational cost. Previous work suggests that similar algorithms achieve good parallel scalability, motivating the parallel algorithm development work here. Experimental results of applying parallel CIGSCR to an image with approximately 10~8 pixels and 6 bands demonstrate superlinear speedup. A soft two class classification is generated in just over four minutes using 32 processors.
Naive users using a browser have no idea about the back-end of the page. The users might be tricked into giving away their credentials or downloading malicious data. Our aim is to create an extension for Chrome which ...
详细信息
ISBN:
(纸本)9781509037056
Naive users using a browser have no idea about the back-end of the page. The users might be tricked into giving away their credentials or downloading malicious data. Our aim is to create an extension for Chrome which will act as middleware between the users and the malicious websites, and mitigate the risk of users succumbing to such websites. Further, all harmful content cannot be exhaustively collected as even that is bound to continuous development. To counter this we are using machine learning- to train the tool and categorize the new content it sees every time into the particular categories so that corresponding action can be taken.
When facing various and massive data resources, how to effectively utilize the resources according to the division field is one of the core problem of the institutional repository research. In this paper, we improved ...
详细信息
When facing various and massive data resources, how to effectively utilize the resources according to the division field is one of the core problem of the institutional repository research. In this paper, we improved Bayesian classification algorithm, then proposed a text classification algorithm based on domain knowledge. Furthermore, some key technologies such as text classification, feature selection, weight improvement and domain knowledge algorithm improvement are designed and implemented. We use widely applied IkAnalyzer method to classify Chinese words. For feature selection and weight improvement part, we focus on the processing of special vocabulary in the document. We introduce the field expand vocabulary assist the Bayesian formula in the field application part to obtain the final result. The experiment result shows that the improved algorithm enhanced the accuracy of the classification efficiently, and the system calculating time is acceptable.
Brain-Computer interface (BCI) which aims at enabling users to perform tasks through their brain waves has been a feasible and worth developing solution for growing demand of healthcare. Current proposed BCI systems a...
详细信息
ISBN:
(纸本)9781509061839
Brain-Computer interface (BCI) which aims at enabling users to perform tasks through their brain waves has been a feasible and worth developing solution for growing demand of healthcare. Current proposed BCI systems are often with lower applicability and do not provide much help for reducing burdens of users because of the time-consuming preparation required by adopted wet sensors and the shortage of provided interactive functions. Here, by integrating a state visually evoked potential (SSVEP)-based BCI system and a robotic eating assistive system, we propose a non-invasive wireless steady state visually evoked potential (SSVEP)-based BCI eating assistive system that enables users with physical disabilities to have meals independently. The analysis compared different methods of classification and indicated the best method. The applicability of the integrated eating assistive system was tested by an Amyotrophic Lateral Sclerosis (ALS) patient, and a questionnaire reply and some suggestion are provided. Fifteen healthy subjects engaged the experiment, and an average accuracy of 91.35%, and information transfer rate (ITR) of 20.69 bit per min are achieved. For online performance evaluation, the ALS patient gave basic affirmation and provided suggestions for further improvement. In summary, we proposed a usable SSVEP-based BCI system enabling users to have meals independently. With additional adjustment of movement design of the robotic arm and classification algorithm, the system may offer users with physical disabilities a new way to take care of themselves.
We develop a classification algorithm for estimating posterior distributions from positive-unlabeled data, that is robust to noise in the positive labels and effective for high-dimensional data. In recent years, sever...
详细信息
ISBN:
(纸本)9781510838819
We develop a classification algorithm for estimating posterior distributions from positive-unlabeled data, that is robust to noise in the positive labels and effective for high-dimensional data. In recent years, several algorithms have been proposed to learn from positive-unlabeled data; however, many of these contributions remain theoretical, performing poorly on real high-dimensional data that is typically contaminated with noise. We build on this previous work to develop two practical classification algorithms that explicitly model the noise in the positive labels and utilize univariate transforms built on discriminative classifiers. We prove that these univariate transforms preserve the class prior, enabling estimation in the univariate space and avoiding kernel density estimation for high-dimensional data. The theoretical development and parametric and nonparametric algorithms proposed here constitute an important step towards wide-spread use of robust classification algorithms for positive-unlabeled data.
Social network has become a very popular way for internet users to communicate and interact online. Users spend a great deal of time on famous social networks (e.g. Facebook, Twitter, Sina Weibo, etc.), reading news, ...
详细信息
ISBN:
(纸本)9781479940929
Social network has become a very popular way for internet users to communicate and interact online. Users spend a great deal of time on famous social networks (e.g. Facebook, Twitter, Sina Weibo, etc.), reading news, discussing events and posting their messages. Unfortunately, this popularity also attracts a significant amount of spammers who continuously expose malicious behaviors (e.g. post messages containing commercial topics or URLs, following a larger amount of users, etc.), leading to great inconvenience on normal users' social activities. In this paper, a supervised machine learning based spammer filtering method is proposed. We first collected a dataset from Sina Weibo that includes 30,116 users and more than 16 million messages;then, construct a labeled dataset of users and manually classify users into spammers and non-spammers;after that, abstract a set of novel features from message content and users' social behavior, and apply into SVM based spammer classifier. Our experiments show that true positive rate of spammers and non-spammers could reach 99.1% and 99.9%.
Focusing on the current impulse waveforms generated when the fixed or variable frequency air conditioners, vacuum cleaner, microwave oven and other electrical appliances are turned on, the characteristic parameters su...
详细信息
ISBN:
(数字)9781728162836
ISBN:
(纸本)9781728162843
Focusing on the current impulse waveforms generated when the fixed or variable frequency air conditioners, vacuum cleaner, microwave oven and other electrical appliances are turned on, the characteristic parameters such as impulse amplitude, rising time, dropping amplitude, and falling time are defined, and a multi-feature Bayesian classification model is established. Considering that the classification algorithm works on the smart meter hardware, the computing and storage resources are limited, as a result that the model scale needs to be reduced as much as possible, a meant parameters Bayesian classification method is proposed. The current impulse waveform samples of different appliances are divided into several groups, and the mean value of the characteristic values of the samples is used as the parameters of the classification model. Under the same model capacity, the generalization error of the model is reduced. The simulation results show that the accuracy of Bayesian classifier constructed with meant value parameters is better than that of ordinary Bayesian classifier. In the laboratory, the impulse classification test is carried out on a single-phase smart meter, and the test results verify the feasibility of the proposed method.
The usage of the Android Operating System (OS) has surpassed all other operating systems and as a result, it has become the primary target of attackers. Many attacks can be geared towards Android phones mainly using a...
详细信息
ISBN:
(数字)9781728151601
ISBN:
(纸本)9781728151618
The usage of the Android Operating System (OS) has surpassed all other operating systems and as a result, it has become the primary target of attackers. Many attacks can be geared towards Android phones mainly using application installation. These third-party applications first seek permission from the user before installation. Some of the permissions can be elusive evading the users' attention. With the type of harm that can be done which include illegal extraction and transfer of the users' data, spying on the users and so on there is a need to have a heuristic approach in the detection of malware. In this research work, some classification algorithms were tested to determine the best performing algorithm when it comes to the detection of android malware detection. An android application dataset was obtained from figshare and used in the Waikato Environment for Knowledge Analysis (WEKA) for training and testing, it was measured under accuracy, false-positive rate, precision, recall, f-measure, Receiver Operating Curve (ROC) and Root Mean Square Error (RMSE). It was discovered that multi-layer perceptron performs best with an accuracy of 99.4%.
In reading Computed Tomography (CT) scans with potentially malignant lung nodules, radiologists make use of high level information (semantic characteristics) in their analysis. Computer-Aided Diagnostic Characterizati...
详细信息
ISBN:
(纸本)9781424441211
In reading Computed Tomography (CT) scans with potentially malignant lung nodules, radiologists make use of high level information (semantic characteristics) in their analysis. Computer-Aided Diagnostic Characterization (CADc) systems can assist radiologists by offering a "second opinion" predicting these semantic characteristics for lung nodules. In this work, we propose a way of predicting the distribution of radiologists' opinions using a multiple-label classification algorithm based on belief decision trees using the National Cancer Institute (NCI) Lung Image Database Consortium (LIDC) dataset, which includes semantic annotations by up to four human radiologists for each one of the 914 nodules. Furthermore, we evaluate our multiple-label results using a novel distance-threshold curve technique - and, measuring the area under this curve, obtain 69% performance on the validation subset. We conclude that multiple-label classification algorithms are an appropriate method of representing the diagnoses of multiple radiologists on lung CT scans when ground truth is unavailable.
A robust music genre classification framework is proposed that combines the rich, psycho-physiologically grounded properties of slow temporal modulations of music recordings and the power of sparse representation-base...
详细信息
ISBN:
(纸本)9781617388767
A robust music genre classification framework is proposed that combines the rich, psycho-physiologically grounded properties of slow temporal modulations of music recordings and the power of sparse representation-based classifiers. Linear subspace dimensionality reduction techniques are shown to play a crucial role within the framework under study. The proposed method yields a music genre classification accuracy of 91% and 93.56% on the GTZAN and the ISMIR2004 Genre dataset, respectively. Both accuracies outperform any reported accuracy ever obtained by state of the art music genre classification algorithms in the aforementioned datasets.
暂无评论