In order to solve the problem of high difficulty and large amount of computation in feature extraction for web spam detection, a method for extracting semantic features only based on the HTML script of the current pag...
详细信息
In order to solve the problem of high difficulty and large amount of computation in feature extraction for web spam detection, a method for extracting semantic features only based on the HTML script of the current page is proposed. Firstly, the domain name is segmented by a memorization search algorithm combining depth-first search and dynamic programming. Secondly, the Latent Dirichlet Distribution is used to extract subject words of the web page. Lastly, three single-page semantic similarity features are calculated based on Word2Vec and word mover distance. Combining the single-page semantic similarity features with single-page statistical features, classification algorithms such as random forest are used to build classification models for web spam detection. the experimental results show that the AUC value of single-page content extraction based on semantic and statistical features for classification reaches 83.1%, which is about 7% higher than that of the control method.
Sparse and unstructured computations are widely involved in scientific and engineering applications. It means that data arrays could be indexed indirectly through the values of other arrays or non-affine subscripts. D...
详细信息
Sparse and unstructured computations are widely involved in scientific and engineering applications. It means that data arrays could be indexed indirectly through the values of other arrays or non-affine subscripts. Data access pattern would not be known until runtime. So far all the parallel computing strategies for this kind of irregular problem are single network topology oriented, which cannot fully exploit the advantages of modern hierarchical computing architecture, like grid. We proposed a hybrid parallel computing strategy RP, shorted for "Replicated and Partially-shared", to improve the performance of irregular applications in the COC (Cluster of Clusters) environment. A detailed comparison is made between our strategy and other traditional models, along with experimental results demonstrating its effectiveness. A class of practical irregular applications employed RP strategy could obtain much shorter execution time and better scalability in heterogeneous network based computation environment.
A velocity planning strategy for the autonomous-rail rapid transit (art) based on the pseudospectral (PS) method is proposed in this paper. the PS method is used as a replanning algorithm provided with its real-time p...
详细信息
ISBN:
(数字)9781728184975
ISBN:
(纸本)9781728184982
A velocity planning strategy for the autonomous-rail rapid transit (art) based on the pseudospectral (PS) method is proposed in this paper. the PS method is used as a replanning algorithm provided with its real-time performance. the multi-particle model is adopted in the dynamics model of art. the energy consumption of art is chosen as the optimization goal, and the arrival time and arrival velocity are taken as constraints to ensure economic efficiency and punctuality of the art. When encountering obstacles such as pedestrians and lower-speed vehicles, the velocity planning strategy based on the PS method is applied to re-plan velocity of the art. Performance of the proposed strategy is evaluated by comparing withthe strategy to track the original velocity planned offline by dynamic programming (DP) algorithm. the simulation results in Matlab/Simulink- Trucksim environment illustrate that the PS based method has better real-time performance than the DP based method. the proposed planning strategy also makes art arrive at the next station punctually, as well as leading to 36.21% reduction of energy consumption compared withthe DP based method. Results of the jerk of the art with PS strategy also show better performance in passenger comfort.
Based on the research of Chinese engineering valuation system and the application of bidding strategy, a computer aided valuation system is developed to improve work efficiency. the planning, modeling, design, develop...
详细信息
Based on the research of Chinese engineering valuation system and the application of bidding strategy, a computer aided valuation system is developed to improve work efficiency. the planning, modeling, design, development process of the system is discussed with introducing the development background, system programming, database design, module design etc., and the system's implementation effect is finally described.
this work introduces a novel approach called the Multi-Objective Integrated Immune Moth Flame Evolutionary programming (MO-IIMFEP) algorithm. this algorithm aims to determine the optimal sizes and positions for Type I...
this work introduces a novel approach called the Multi-Objective Integrated Immune Moth Flame Evolutionary programming (MO-IIMFEP) algorithm. this algorithm aims to determine the optimal sizes and positions for Type III distributed generators (DGs) that generate both active and reactive power. the objectives involve reducing overall losses in the distribution system while adhering to voltage restrictions and taking into account the cost limitations connected withthe installation of DG. MO-IIMFEP overcomes the constraints of traditional Evolutionary programming (EP) and Moth Flame Optimization (MFO), particularly in effectively handling local optima. Fuzzy logic is employed in MO-IIMFEP to determine the best solution to compromise conflicting goals, as obtained from the non-dominated Pareto solutions. the efficacy of MOIIMFEP in identifying optimal solutions for multi-objective problems is demonstrated through comprehensive assessments conducted on the 118-Bus Radial Distribution Systems (RDS), comparing it against MO-EP and MO-MFO. the results underscore the strategic benefits of DG installation in sustaining voltage levels, reducing power losses, and minimizing total operating costs for power suppliers.
Recently, there have been a great need in society for MCU learning especially in engineering and science. However, the environment development didn't match the increasing demand for MCU teaching in fact. On the on...
详细信息
ISBN:
(纸本)9781510819085
Recently, there have been a great need in society for MCU learning especially in engineering and science. However, the environment development didn't match the increasing demand for MCU teaching in fact. On the one hand, the costs of learning at their own expense are high. On the other hand, limits of learning conditions and shortage of devices and space in MCU laboratories still exist. In this circumstance, the paper designs a remote MCU learning platform based on virtual reality where the MCU learners can learn MCU programmingthrough network instead of on the MCU. the platform with low costs in this paper is not only convenient and user-friendly, but also reliable.
Basic course of modeling refers to a required basic course covering "design sketch" and "three components". the research field of digital media application technology major is software interface de...
详细信息
Basic course of modeling refers to a required basic course covering "design sketch" and "three components". the research field of digital media application technology major is software interface design, requiring students to have certain fine arts foundation. the students of digital media application technology major of higher vocational colleges are non-art students who have no fine arts foundation. this paper is intended to analyze how teachers can guide students to overcome anxiety and develop an interest based on practical teaching experience, and put forward some practical methods of teaching of basic course of modeling of digital media application technology major.
As one of the key technologies for deployment of future wireless networks, the state-of-the-art reconfigurable intelligent surfaces (RISs) have rapidly gained a massive interest among researchers. In a specific case s...
详细信息
Various state-of-the-art automated reasoning (AR) tools are widely used as backend tools in research of knowledge representation and reasoning as well as in industrial applications. In testing and verification, those ...
详细信息
In contrast withthe increasing popularity of heterogeneous systems, programming on these systems remains complex and time-consuming. Developers have to access heterogeneous processors through explicitly and error-pro...
详细信息
暂无评论