In adversarial environment such as intrusion detection and spam filtering, the adversary-intruder or spam advertiser may attempt to produce contaminate training instance and manipulate the learning of classifier. In o...
详细信息
In adversarial environment such as intrusion detection and spam filtering, the adversary-intruder or spam advertiser may attempt to produce contaminate training instance and manipulate the learning of classifier. In order to keep good classification performance, many robuster learning methods have been proposed to deal with the adversarial attack. Support Vector Machines(SVMs) is a kind of successful approach in the adversarial classification tasks and the investigation of robust SVMs is very popular. However, in many real application, the data including stain instance is coming dynamically. Batch learning which needs retraining when encountering new samples, will consume more computing resources. In this paper, we propose a robust Lagrangian support vector machine (RLSVM) with modified kernel matrix and explore the online learning algorithm on it. The experimental results show the robustness of RLSVM against label noise produced by adversaries under the online adversarial environment.
The multilingual focused crawler system combines web content extraction with path configuration to make use of their advantages and achieve automatic collection of network information in multiple languages. Firstly, s...
The multilingual focused crawler system combines web content extraction with path configuration to make use of their advantages and achieve automatic collection of network information in multiple languages. Firstly, system selects foreign language keywords according to crawling webpage language and Chinese keywords, and uses initial link to obtain webpage information. Then, it uses path configuration information or web content extraction algorithm based on the distribution line block to get webpage content, and adopts rules or configuration information to acquire new links, published time and title. Next, keywords are used to filter irrelevant information. Finally, results are presented as a list. When users use focused crawler system, the webpage path information can be configured or not according to requirements, and the collected network resources can also be searched or filtered.
Finding semantically rich and computerunderstandable representations for textual dialogues, utterances and words is crucial for dialogue systems (or conversational agents), as their performance mostly depends on under...
详细信息
Performance appraisal has always been an important research topic in human resource management. A reasonable performance appraisal plan lays a solid foundation for the development of an enterprise. Especially as globa...
详细信息
Performance appraisal has always been an important research topic in human resource management. A reasonable performance appraisal plan lays a solid foundation for the development of an enterprise. Especially as globalization and technology advance, in order to meet the fast-changing strategic goals and increasing cross-functional tasks, enterprises face new challenges in performance appraisal. How to improve employees’ ability to accept new knowledge efficiently and constantly has been an urgent problem for enterprises. In this paper, we propose an automatic method which generation multiple-choice questions by utilizing the relations between different terminology. Graphical model is used to extract core concept from different corpus while word embedding technology is used to indicate the relevant relations. Experimental results demonstrate that the proposed question generation method outperforms the traditional manual method in both efficiency and confusion.
Consider a problem where 4k given vectors need to be partitioned into k clusters of four vectors each. A cluster of four vectors is called a quad, and the cost of a quad is the sum of the component-wise maxima of the ...
详细信息
Recently, more and more authors have been encouraged for collaboration because it often produces good results. However, the author collaboration network contains experts in various research directions within various f...
详细信息
Set similarity join is an essential operation in big data analytics, e.g., data integration and data cleaning, that finds similar pairs from two collections of sets. To cope with the increasing scale of the data, dist...
详细信息
Mobile crowd sensing has the potential to acquire massive data from places and address large-scale societal problems. However, most currently existing crowd sensing systems suffer from insufficient participants. There...
详细信息
Mobile crowd sensing has the potential to acquire massive data from places and address large-scale societal problems. However, most currently existing crowd sensing systems suffer from insufficient participants. Therefore, incentive design for crowd sensing is essential and urgent. In this paper, different from the auction-based and server-dominant incentives, we design a personalized incentive, PIE, with partiality for neither the server nor the participants with budget constraint. The total payment for all the participants accords to their collective participation level, and the individual reward for each participant depends on individual contribution. We measure the individual contribution and participation level based on Voronoi diagram and Shannon entropy. Both offline and online incentives are proposed with budget constraint. Experimental study shows that our incentives are participation-aware and contribution-dependent, which encourages participants' active join, balanced distribution and flexible reward.
Massive Open Online Courses (MOOCs) are attracting the attention of people all over the world. Regardless the platform, numbers of registrants for online courses are impressive but in the same time, completion rates a...
详细信息
Intelligent Manufacturing has attracted global and continuous attention recent years, with more and more intelligent devices and systems applied in production. In this paper, we take China’s manufacturing listed firm...
详细信息
Intelligent Manufacturing has attracted global and continuous attention recent years, with more and more intelligent devices and systems applied in production. In this paper, we take China’s manufacturing listed firms to investigate the productivity difference between intelligent and general manufacturing firms. By the Cobb-Douglas production function, we built a Coefficient-varying Model and used Seemingly Unrelated Regression (SUR) to estimate the time-varying trend of productivity from 2011 to 2017. The empirical results show that “Intelligent Manufacturing” has obviously promoted the Labor factor utilization efficiency and the Total Factor Productivity (TFP) through the advances in technology. But it doesn’t have universally enhancing effect of all industries. The impact of “Intelligent Manufacturing” still remains to be observed overtime.
暂无评论