In order to generate a report for an enterprise where there is neither the API supporting from their existing website systems nor the granted database access rights approval,a daily business report generator system ba...
详细信息
In order to generate a report for an enterprise where there is neither the API supporting from their existing website systems nor the granted database access rights approval,a daily business report generator system based on web scraping with k nearest neighbor(kNN)classification algorithm is proposed in this *** covers the web crawler technology that is to access existing website system and extract business *** k NN algorithm is applied to identify the verification code on the login page,and the brief daily report generating in a spreadsheet style *** with some OCR engine for image recognition,the system in Python can automatically generate the brief daily business reports by the kNN algorithm,which is better than some library with default training set on validating the verification code.
As the conventional feature selection algorithms are prone to the poor running efficiency in largescale datasets with interacting features, this paper aims at proposing a novel rough feature selection algorithm whose ...
详细信息
As the conventional feature selection algorithms are prone to the poor running efficiency in largescale datasets with interacting features, this paper aims at proposing a novel rough feature selection algorithm whose innovation centers on the layered co-evolutionary strategy with neighborhood radius hierarchy. This hierarchy can adapt the rough feature scales among different layers as well as produce the reasonable decompositions through exploiting any correlation and interdependency among feature subsets. Both neighborhood interaction within layer and neighborhood cascade between layers are adopted to implement the interactive optimization of neighborhood radius matrix, so that both the optimal rough feature selection subsets and their global optimal set are obtained efficiently. Our experimental results substantiate the proposed algorithm can achieve better effectiveness, accuracy and applicability than some traditional feature selection algorithms.
With the burgeoning of IT industry, more and more companies and universities concentrate on the scientific evaluation of science-and-engineering students. Existing evaluation strategies typically lie on grades or scor...
详细信息
Model counting is the problem of computing the number of satisfying assignments of a given propositional formula. Although exact model counters can be naturally furnished by most of the knowledge compilation (KC) meth...
详细信息
The genetic fuzzy system is applied to weapon control module of unmanned tanks. In this paper, the simulation system is built to train the fuzzy rule base of genetic fuzzy system in order to get the optimal fuzzy rule...
详细信息
The genetic fuzzy system is applied to weapon control module of unmanned tanks. In this paper, the simulation system is built to train the fuzzy rule base of genetic fuzzy system in order to get the optimal fuzzy rule base with training tasks. Testing tasks are used to test the success rate of different missions under the optimal rule base. The fuzzy inference system obtained by the algorithm achieves high success rate in various tasks.
Traditional fuzzy C-means clustering algorithm has poor noise immunity and clustering results in image segmentation. To overcome this problem, a novel image clustering algorithm based on SLIC superpixel and transfer l...
详细信息
Obtaining interesting and topic-relevant information is a very important task in Web mining. Text classification using a small proportion of labeled data and a large proportion of unlabeled data, also called semi-supe...
详细信息
For each microarray data set, only a small number of genes are beneficial. Due to the high-dimensional problem, gene selection research work remains a challenge. In order to solve the high-dimensional problem, we prop...
详细信息
With the burgeoning of IT industry, more and more companies and universities concentrate on the scientific evaluation of science-and-engineering students. Existing evaluation strategies typically lie on grades or scor...
With the burgeoning of IT industry, more and more companies and universities concentrate on the scientific evaluation of science-and-engineering students. Existing evaluation strategies typically lie on grades or scores of the courses taken by students, which have obvious drawbacks nowadays and cannot lead to a proper improvement of education management. This paper proposes an overall student evaluation system architecture that includes three levels, i.e. data collection infrastructure which comprises student data collection and student data transmission, data center which is composed of four sub-levels, and unified portal which defines unified applications and classes of visiting terminals. In the proposed architecture, the main contribution lie in the upper most sub-level of data center, i.e. evaluation services. Four categories of student evaluation services, i.e. moral trait, civic literacy, knowledge level and comprehensive ability,are defined. Furthermore, in order to have a satisfactory feedback in the practical teaching process, the most important comprehensive ability for science-andengineering students is fractionized into four sub-categories and visualized from several different aspects for a good feedback in practical teaching process.
暂无评论