检索结果-内蒙古大学图书馆

Representation learning on textual network with personalized Page Rank

Science China(Information Sciences) 2021年第11期64卷 95-104页

作者： Teng LI Yong DOU National Laboratory for Parallel and Distributed Processing National University of Defense Technology

Representation learning on textual network or textual network embedding, which leverages rich textual information associated with the network structure to learn low-dimensional embedding of vertices, has been useful in a variety of tasks. However, most approaches learn textual network embedding by using direct neighbors. In this paper, we employ a powerful and spatially localized operation: personalized Page Rank(PPR) to eliminate the restriction of using only the direct connection relationship. Also, we analyze the relationship between PPR and spectral-domain theory, which provides insight into the empirical performance boost. From the experiment, we discovered that the proposed method provides a great improvement in linkprediction tasks, when compared to existing methods, achieving a new state-of-the-art on several real-world benchmark datasets.

关键词： representation learning network embedding PageRank textual network personalized PageRank

来源：评论

学校读者我要写书评

暂无评论

Prophet: Fine-grained Load Balancing for parallel Training of Large-scale MoE Models

Prophet: Fine-grained Load Balancing for Parallel Training o...

引用

IEEE International Conference on Cluster Computing

作者： Wei Wang Zhiquan Lai Shengwei Li Weijie Liu Keshi Ge Yujie Liu Ao Shen Dongsheng Li National Laboratory for Parallel and Distributed Processing(PDL) College Of Computer National University Of Defense Technology Changsha China

Mixture of Expert (MoE) has received increasing attention for scaling DNN models to extra-large size with negligible increases in computation. The MoE model has achieved the highest accuracy in several domains. However, a significant load imbalance occurs in the device during the training of a MoE model, resulting in significantly reduced throughput. Previous works on load balancing either harm model convergence or suffer from high execution overhead. To address these issues, we present Prophet: a fine-grained load balancing method for parallel training of large-scale MoE models, which consists of a planner and a scheduler. Prophet planner first employs a fine-grained resource allocation method to determine the possible scenarios for the expert placement in a fine-grained manner, and then efficiently searches for a well-balanced expert placement to balance the load without introducing additional overhead. Prophet scheduler exploits the locality of the token distribution to schedule the resource allocation operations using a layer-wise fine-grained schedule strategy to hide their overhead. We conduct extensive experiments in four clusters and five representative models. The results indicate that Prophet gains up to 2.3x speedup compared to the state-of-the-art MoE frameworks including Deepspeed-MoE and FasterMoE. Additionally, Prophet achieves a load balancing enhancement of up to 12.06x when compared to FasterMoE.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Prediction of the Cyanobacteria Coverage in Time-series Images based on Convolutional Neural Network 21

Prediction of the Cyanobacteria Coverage in Time-series Imag...

引用

4th International Conference on Control and Computer Vision, ICCCV 2021

作者： Ye, Xiangyu Lai, Zhiquan Li, Dongsheng National Key Laboratory of Parallel and Distributed Processing Computer College National University of Defense Technology China

ISBN: (纸本)9781450390477

In recent years, the problem of lake eutrophication has become increasingly severe. The monitoring and control of cyanobacteria in lakes are of great significance. The information obtained by existing monitoring methods is relatively lagging, and it is impossible to monitor the sudden outbreak of cyanobacteria in time. Getting cyanobacteria information directly through camera images is a breakthrough. In this paper, after analyzing the characteristics of time series cyanobacteria images, we propose a block prediction scheme based on the CNN model. Experiments show that this method can quickly calculate the coverage of cyanobacteria in the monitoring image in a short time. It can also effectively distinguish cyanobacteria-rich water areas, which significantly facilitates water quality monitoring and cyanobacteria management. We can draw a chart of the changes in the coverage of cyanobacteria by analyzing multi-day time-series images. The chart helps us conduct a short-term water quality analysis to better deal with the outbreak of cyanobacteria. © 2021 ACM.

关键词： Lakes

来源：评论

学校读者我要写书评

暂无评论

Effective Anomaly Detection Based on Reinforcement Learning in Network Traffic Data 27

Effective Anomaly Detection Based on Reinforcement Learning ...

引用

27th IEEE International Conference on parallel and distributed Systems, ICPADS 2021

作者： Wang, Zhongyang Wang, Yijie Xu, Hongzuo Wang, Yongjun National University of Defense Technology Science and Technology on Parallel Distributed Processing Laboratory College of Computer China

ISBN: (纸本)9781665408783

Mixed-type data with both categorical and numerical features are ubiquitous in network security, but the existing methods are minimal to deal with them. Existing methods usually process mixed-type data through feature conversion, whereas their performance is downgraded by information loss and noise caused by the transformation. Meanwhile, existing methods usually superimpose domain knowledge and machine learning in which fixed thresholds are used. It cannot dynamically adjust the anomaly threshold to the actual scenario, resulting in inaccurate anomalies obtained, which results in poor performance. To address these issues, this paper proposes a novel Anomaly Detection method based on Reinforcement Learning, termed ADRL, which uses reinforcement learning to dynamically search for thresholds and accurately obtain anomaly candidate sets, fusing domain knowledge and machine learning fully and promoting each other. Specifically, ADRL uses prior domain knowledge to label known anomalies and uses entropy and deep autoencoder in the categorical and numerical feature spaces, respectively, to obtain anomaly scores combining with known anomaly information, which are integrated to get the overall anomaly scores via a dynamic integration strategy. To obtain accurate anomaly candidate sets, ADRL uses reinforcement learning to search for the best threshold. Detailedly, it initializes the anomaly threshold to get the initial anomaly candidate set and carries on the frequent rule mining to the anomaly candidate set to form the new knowledge. Then, ADRL uses the obtained knowledge to adjust the anomaly score and get the score modification rate. According to the modification rate, different threshold modification strategies are executed, and the best threshold, that is, the threshold under the maximum modification rate, is finally obtained, and the modified anomaly scores are obtained. The scores are used to re-carry out machine learning to improve the algorithm's accuracy for anomalo

关键词： Anomaly detection

来源：评论

学校读者我要写书评

暂无评论

Improving the Performance of Lattice Boltzmann Method with Pipelined Algorithm on A Heterogeneous Multi-zone Processor 23rd

Improving the Performance of Lattice Boltzmann Method with...

引用

23rd International Conference on parallel and distributed Computing, Applications, and Technologies, PDCAT 2022

作者： Zhang, Qingyang Xu, Lei Chen, Rongliang Chen, Lin Chen, Xinhai Wang, Qinglin Liu, Jie Yang, Bo Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Changsha410000 China Laboratory of Software Engineering for Complex Systems National University of Defense Technology Changsha410000 China Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences Shenzhen518055 China

ISBN: (纸本)9783031299261

Lattice Boltzmann method (LBM) has become a powerful method in computational fluid dynamics and has drawn more and more attention in high-performance computing due to its particulate nature and local dynamics, especially on recent multi-core or many-core platforms. This paper develops a parallel software framework for 3D LBM simulation on a heterogeneous multi-zone processor, MT-3000. An improved pipelined algorithm named Pencil-H is proposed, which can not only fully exploit the advantages of each component of MT-3000 but also overlap the time of calculation and communication. Moreover, an architecture-aware multi-level parallelization algorithm is developed to fully utilize the computational performance of MT-3000. A benchmark test is performed to verify the reliability and test the performance of the LBM code. Experimental results show that the optimized code achieves a 32.02 × speedup compared with using 16 CPU cores and achieves a performance of 286.03MLUPS which reaches 72.3% of the theoretical peak performance. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Computational fluid dynamics

来源：评论

学校读者我要写书评

暂无评论

RiverMapper: Step-wisely Mapping the Surface Rivers on Optical Remote Sensing Images

RiverMapper: Step-wisely Mapping the Surface Rivers on Optic...

引用

2022 International Conference on Neural Networks, Information, and Communication Engineering, NNICE 2022

作者： Zhang, Peng Pan, Hengyue Yang, Ke Dou, Yong Niu, Xin National Laboratory for Parallel and Distributed Processing National University of Defense Technology 410073 China Artificial Intelligence Research Center National Innovation Institute of Defense Technology BeiJing100850 China

ISBN: (纸本)9781510655171

Accurately mapping the surface rivers is important in ecological environment monitoring and disaster prevention. The development of remote sensing technology and computer vision greatly improves the efficiency of this task. However, there are few methods that map the rivers from an image directly. The existing automatic river mapping methods usually had two successive stages: waterbody extraction and flow-path extraction, where the latter methods were very dependent on the waterbody masks generated by the former methods. Errors in waterbody masks caused breaks and redundancies in the extracted graphs. This paper proposed RiverMapper, which mapped the rivers step-wisely without dividing into two stages. Following the directions and actions predicted by the convolution neural network, RiverMapper walked along the rivers step by step and cropped the fixed-size image patches at each step for segmentation. Final river graphs were constructed by the waterbody mask patches and those tracks generated by RiverMapper. We applied RiverMapper on optical remote sensing images containing the Changjiang River and the Huanghe River. Without the degradation of the performance on waterbody extraction, RiverMapper outperformed other methods in terms of the local topological and geometrical similarity between the predicted and the ground-truth river graphs. © 2022 SPIE.

关键词： Mapping

来源：评论

学校读者我要写书评

暂无评论

Deep reinforcement learning:a survey

引用

Frontiers of Information Technology & Electronic Engineering 2020年第12期21卷 1726-1744页

作者： Hao-nan WANG Ning LIU Yi-yun ZHANG Da-wei FENG Feng HUANG Dong-sheng LI Yi-ming ZHANG Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense TechnologyChangsha 41OOOOChina

Deep reinforcement learning(RL)has become one of the most popular topics in artificial intelligence *** has been widely used in various fields,such as end-to-end control,robotic control,recommendation systems,and natural language dialogue *** this survey,we systematically categorize the deep RL algorithms and applications,and provide a detailed review over existing deep RL algorithms by dividing them into modelbased methods,model-free methods,and advanced RL *** thoroughly analyze the advances including exploration,inverse RL,and transfer ***,we outline the current representative applications,and analyze four open problems for future research.

关键词： Reinforcement learning Deep reinforcement learning Reinforcement learning applications

来源：评论

学校读者我要写书评

暂无评论

Evaluating matrix multiplication-based convolution algorithm on multi-core digital signal processors

引用

Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology 2023年第1期45卷 86-94页

作者： Wang, Qinglin Pei, Xiangdong Liao, Linyu Wang, Haoxu Li, Rongchun Mei, Songzhu Li, Dongsheng College of Computer Science and Technology National University of Defense Technology Changsha410073 China Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Changsha410073 China

The matrix multiplication-based convolutional algorithm, which can efficiently implement convolutions with different parameters, is the first choice of convolution performance optimization for a given chip. Based on the architecture of Phytium heterogeneous multi-core DSPs(digital signal processors) developed by National University of Defense Technology and the characteristic of the matrix multiplication-based convolutional algorithm, a parallel implementation of the matrix multiplication-based convolutional algorithm (called ftmEConv) for different convolutions on multi-core DSPs was proposed. The ftmEConv consists of four parallelized parts(input feature maps transformation, filter transformation, matrix multiplication, and output feature maps transformation), all of which were optimized for multi-core DSPs, and the performance of each part was improved by effectively exploiting the potential of all functional units in DSP cores. The experimental results demonstrate that ftmEConv achieves computational efficiency of up to 42.90%. Compared with other implementations of the matrix multiplication-based convolutional algorithm on heterogeneous chips, ftmEConv gets a speedup of up to 7.79 times. © 2023 National University of Defense Technology. All rights reserved.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Word Embedding-based Context-sensitive Network Flow Payload Anomaly Detection 3

Word Embedding-based Context-sensitive Network Flow Payload ...

引用

3rd International Conference on Applied Machine Learning, ICAML 2021

作者： Li, Yizhou Wang, Yijie Cheng, Li Xu, Hongzuo Science and Technology on Parallel and Distributed Processing Laboratory College of Computer National University of Defense Technology Changsha China

ISBN: (纸本)9781665421256

Payload anomaly detection can discover malicious beliaviors tiidden in network packets. It is liard to liandle payload due to its various possible characters and complex semantic context, and tlius identifying abnormal payload is also a non-trivial task. Prior art only uses the n-gram language model to extract features, which directly leads to ultra-high-dimensional feature space and also fails to capture the context semantics fully. Accordingly, this paper proposes a word embedding-based context-sensitive network flow payload anomaly detection method (termed WECAD). First, WECAD obtains the initial feature representation of the payload through the word embedding-based method. Then, we propose a corpus pruning algorithm, which appUes the cosine similarity clustering and frequency distribution to prune inconsequential characters. We only keep the essential characters to reduce the calculation space. Subsequently, we propose a context learning algorithm. It employs the co-occurrence matrix transformation technology and introduces the backward step size to consider the order relationship of essential characters. Comprehensive experiments on real-world intrusion detection datasets validate the effectiveness of our method. © 2021 IEEE

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

A Novel Deep Neural Network Model for Credit Risk Prediction of Chinese Farmers 6

A Novel Deep Neural Network Model for Credit Risk Prediction...

引用

6th IEEE International Conference on Data Science in Cyberspace, DSC 2021

作者： Xie, Yalong Li, Aiping Liu, Ziniu Chen, Kai Tu, Hongkui College Of Computer National University Of Defense Technology Science And Technology On Parallel And Distributed Processing Laboratory Changsha China

ISBN: (纸本)9781665418157

China is a big agricultural county with more than 500 million rural population. In China, farmers usually loan from rural commercial banks or rural credit cooperatives. It is crucial for the national economic development and the improvement of people's standard of living that how to reasonably use funds to subsidize the agricultural population and reduce the risk of rural loans. At present, credit risk prediction of farmers mainly depends on the experience of experts in the business field, and there is little published research on using artificial intelligence methods to solve this problem. This paper presents a complete set of methods, including data collection, feature selection, etc. We propose a novel deep neural network model named DNN-CRP for credit risk prediction of Chinese framers. Experiments on an actual credit loan dataset of Chinese farmers are presented, and experimental results show that the comprehensive performance of the DNN-CRP model is better than current state-of-the-art models. It is believed that the DNN-CRP model proposed in this paper can help banks improve the efficiency of the credit loan business of farmers and reduce credit risks. © 2021 IEEE.

关键词： Neural network models

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：