检索结果-内蒙古大学图书馆

International Conference on Computer Supported Cooperative Work in Design

作者： Yu Zhang Jianqiang Zhang Gongpeng Song Qin Lu Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Provincial Key Laboratory of Computer Networks Shandong Fundamental Research Center for Computer Science Jinan China Shandong Branch of China Mobile Communication Group Design Institute Co Jinan China

ISBN: (数字)9798350349184

ISBN: (纸本)9798350349191

With the rapid development of Natural Language Processing (NLP), text matching has become the basis of many downstream tasks in NLP, and the study of text matching is of great research significance for solving tasks such as question and answer and information retrieval in NLP. Most of the current text matching methods tend to have problems such as mismatch of grammatical structures and insufficient interaction information in sentences. In order to solve the problems of insufficient interaction information and lack of features and ability to capture keyword and sequence information in text matching, this paper proposes a text matching method based on multi-layer coding and soft attention mechanism. The method first embeds the text and sends it to a gating module for processing, then sends the processed result to a module containing a combination of multilayer coding and soft attention mechanism for further operations such as multiple alignment, and finally sends it to a classifier containing a three-layer fully-connected network for predicting whether the input text pairs match or not. The two modules proposed in this paper are practically feasible, and comparison and ablation experiments have been conducted on the publicly available datasets LCQMC dataset and BQ dataset, and the experimental results show that the two modules improve the accuracy of text matching by 2.44% and 1.02%, respectively.

关键词： Attention mechanisms Accuracy Computational modeling Impedance matching Nonhomogeneous media Information retrieval Encoding

来源：评论

学校读者我要写书评

暂无评论

Mixture Gaussian Distribution-Based Collaborative Reinforcement Learning for 3D UAV Localization Optimization Against Jamming Attacks

Mixture Gaussian Distribution-Based Collaborative Reinforcem...

引用

IEEE Conference on Global Communications (GLOBECOM)

作者： Yujiao Zhu Mingzhe Chen Sihua Wang Yuchen Liu Gaolei Li Changchuan Yin Tony Q.S. Quek Beijing Laboratory of Advanced Information Network Beijing University of Posts and Telecommunications Beijing China Information Systems Technology and Design Pillar Singapore University of Technology and Design Singapore Department of Electrical and Computer Engineering and Institute for Data Science and Computing University of Miami Coral Gables FL USA Department of Computer Science North Carolina State University Raleigh NC USA School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University China

ISBN: (数字)9798350351255

ISBN: (纸本)9798350351262

In this paper, the optimization of unmanned aerial vehicle (UAV) localization under jamming attacks is studied. In the considered network, a base station (BS) collaborates with an active UAV to localize a target UAV. During this positioning process, a jamming UAV transmits discontinuous signals to passive UAVs to interfere the distance information measurement. To localize the target UAV under jamming attacks, the BS jointly use two localization methods: 1) generative adversarial network (GAN)-based positioning method and 2) time difference of arrival (TDOA)-based positioning method. Since GAN-based positioning method cannot defense in a strong jamming signal while TDOA-based positioning method may consume more energy and sacrifice localization accuracy, the BS must select an appropriate positioning method (GAN-based or TDOA-based methods) and four distance measurement information of passive UAVs to estimate the position of the target UAV. This problem is formulated as an optimization problem whose goal is to minimize the positioning error between the estimated and the ground truth positions of the target UAV while considering jamming attacks and the trajectory of passive UAVs. To solve this problem, we propose a mixture Gaussian distribution model-based collaborative reinforcement learning (RL) method which enables the active UAV to determine its transmit power and trajectory, and enables the BS to select the most appropriate subsets of distance measurement information and the optimal positioning method according to the movement of passive UAVs and the unknown jamming attack pattern of the jamming UAV. Simulation results show the proposed method can reduce the positioning error of the target UAV by up to 36.5% compared to the method that does not consider the GAN-based positioning method.

关键词： Location awareness Simulation Collaboration Reinforcement learning Gaussian distribution Autonomous aerial vehicles Distance measurement Trajectory Jamming Optimization

来源：评论

学校读者我要写书评

暂无评论

Dual attention mechanism object tracking algorithm based on Fully-convolutional Siamese network

Dual attention mechanism object tracking algorithm based on ...

引用

2021 International Conference on networking and network Applications, NaNA 2021

作者： Ma, Sugang Zhang, Zixian Zhang, Lei Chen, Yanping Yang, Xiaobao Pu, Lei Hou, Zhiqiang Xi'an University of Posts and Telecommunications School of Computer Science and Technology Shaanxi Xi'an710121 China Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing Xi'an Key Laboratory of Big Data and Intelligent Computing Xi'an University of Posts and Telecommunications Shaanxi Xi'an710121 China School of Information and Navigation Air Force Engineering University Shaanxi Xi'an710077 China School of Information Engineering Chang'an University Shaanxi Xi'an710064 China

ISBN: (纸本)9781665441582

In an effort to the problem of insufficient tracking performance of the Fully-convolutional Siamese network (SiamFC) in complex scenarios, a dual attention mechanism object tracking algorithm based on the Fully-convolutional Siamese network is proposed to improve the generalization capability of the tracker by ameliorating the robustness of the template characteristics. Firstly, a global context attention module is appended after the backbone network of SiamFC to ameliorate the power of original feature extraction from two dimensions of spatial and channel. Then, a coordinate attention module is introduced to augment the capability of feature extraction in the channel dimension. Finally, the model of the proposed algorithm is trained on the Got-10k dataset. Five related algorithms are tested on the OTB2015 dataset, the results of experiments manifest that our algorithm outperforms the baseline trackers, the success and precision rate of the proposed algorithm are improved by 3.3% and 6.3%. The average tracking speed is 145FPS, which can demand the requirement of real-time tracking. © 2021 IEEE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Research on long and short-term social recommendation based on convolutional and gated recurrent units

Research on long and short-term social recommendation based ...

引用

International Conference on Parallel and Distributed Systems (ICPADS)

作者： Zihe Jia Peng Xue Zhiqiang Dai Qian Gao Xiaomeng Zhang Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Provincial Key Laboratory of Computer Networks Shandong Fundamental Research Center for Computer Science Jinan China

The development of the Internet has made people more closely related and has put forward higher requirements for recommendation models. Most recommendation models are studied only for the long-term interests of users. In this paper, the interaction time between the user and the item is introduced as auxiliary information in the model construction. Interaction time is used to determine users’ long-term preferences and short-term preferences. In this paper, temporal features are extracted by building a convolutional gated recurrent unit with attention neural network (CNN-GRU-Attention). Firstly, for the problem of accurate feature extraction, CNN are constructed to extract higher-level and more abstract features of themselves and transform high-dimensional data into low-dimensional data; secondly, for the problem of social temporality, GRU are used to not only extract temporal information, but also effectively reduce gradient dispersion, making model convergence and training easier; finally, Graph Attention networks are used to aggregate the social relationship information of users and items respectively, which constitute the final feature representation of users and items respectively. In particular, a modified cosine similarity is used to reduce the error caused by data insensitivity when constructing the social information of the item. In this study, simulation experiments are conducted on two publicly available datasets (Epinions and Ciao), and the experimental results show that the proposed recommended model performs better than other social recommendation models, improving the evaluation metrics of MAE and RMSE by 1.06%-1.33% and 1.19%-1.37%, respectively. The effectiveness of the model innovation is proved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MFFLEN: Multi-Label Text Classification Based on Multi-Feature Fusion and Label Embedding

MFFLEN: Multi-Label Text Classification Based on Multi-Featu...

引用

IEEE International Conference on Systems, Man and Cybernetics

作者： Qiliang Gu Shuo Zhao Jianqiang Zhang Gongpeng Song Qin Lu Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Shandong Engineering Research Center of Big Data Applied Technology Jinan China Shandong Provincial Key Laboratory of Computer Networks Shandong Fundamental Research Center for Computer Science Jinan China Shandong Branch of China Mobile Communication Group Design Institute Co. Jinan China

ISBN: (数字)9781665410205

ISBN: (纸本)9781665410212

To address the challenges associated with insufficiently extracting and utilizing features at different levels, overlooking the connection between label meanings and text, and facing problems of over-compression or information loss when extracting global information using recurrent neural networks in the field of multi-label text categorization, this paper introduces an innovative model known as MFFLEN (MultiFeature Fusion and Label Embedding Neural network). First, a back-translated enhanced label set is constructed by back-translated splicing enhancement of the original label set. This set, together with the text, is then input into the embedding layer, which consists of the pre-trained model of bert-baseChinese, thus establishing the initial connection between the text and the labels within the same vector space. Then, to comprehensively extract multi-level semantic features, the model uses a convolutional layer to extract local features and an embedding layer to extract sentence-level features. A bidirectional attention embedded GRU (BAE-GRU) layer is used to extract hybrid finegrained features, which are then fed into the attention layer to further extract hybrid labeled features based on labeling information. Finally, these three different types of features are fused and multi-label text classification results are obtained using a classifier. The experiments proved that the MFFLEN model achieved 73.82% and 88.44% macro-F1 and 88.00% and 88.86% micro-F1 on the two datasets CAIL 2018 Small and CAIL 2018 Split, respectively, which is better than other baseline models.

关键词： Measurement Recurrent neural networks Splicing Text categorization Semantics Multi label classification Logic gates Feature extraction Vectors data mining

来源：评论

学校读者我要写书评

暂无评论

Energy-Efficient Wireless Federated Learning via Doubly Adaptive Quantization

arXiv

引用

arXiv 2024年

作者： Han, Xuefeng Chen, Wen Li, Jun Ding, Ming Wu, Qingqing Wei, Kang Deng, Xiumei Mei, Zhen The Broadband Access Network Laboratory Shanghai Jiao Tong University Minhang200240 China School of Electrical and Optical Engineering Nanjing University of Science and Technology Ministry of Education Nanjing210094 China Data61 CSIRO SydneyNSW2015 Australia The Department of Computing Hong Kong Polytechnic University Hong Kong999077 Hong Kong

Federated learning (FL) has been recognized as a viable distributed learning paradigm for training a machine learning model across distributed clients without uploading raw data. However, FL in wireless networks still faces two major challenges, i.e., large communication overhead and high energy consumption, which are exacerbated by client heterogeneity in dataset sizes and wireless channels. While model quantization is effective for energy reduction, existing works ignore adapting quantization to heterogeneous clients and FL convergence. To address these challenges, this paper develops an energy optimization problem of jointly designing quantization levels, scheduling clients, allocating channels, and controlling computation frequencies (QCCF) in wireless FL. Specifically, we derive an upper bound identifying the influence of client scheduling and quantization errors on FL convergence. Under the long-term convergence constraints and wireless constraints, the problem is established and transformed into an instantaneous problem with Lyapunov optimization. Solving Karush-Kuhn-Tucker conditions, our closed-form solution indicates that the doubly adaptive quantization level rises with the training process and correlates negatively with dataset sizes. Experiment results validate our theoretical results, showing that QCCF consumes less energy with faster convergence compared with state-of-the-art baselines. Copyright © 2024, The Authors. All rights reserved.

关键词： Energy utilization

来源：评论

学校读者我要写书评

暂无评论

A Parallelizable Counterfactual Generation Method Based on Gradient Optimization

A Parallelizable Counterfactual Generation Method Based on G...

引用

International Conference on Parallel and Distributed Systems (ICPADS)

作者： Haorun Ding Xuesong Jiang Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Provincial Key Laboratory of Computer Networks Shandong Fundamental Research Center for Computer Science Jinan China Quan Cheng Laboratory Jinan China State Key Laboratory of High-end Server & Storage Technology Jinan China

Post-hoc explanations are important for people to understand the predictions of explanation models. One class of methods in post-hoc explanation is the generation of counterfactuals, where a hypothetical example is obtained by perturbing the inputs to show how one could obtain a different prediction from the decision model. Counter-factual explanations should satisfy several properties: One is that counterfactuals generated under specific scenarios and constraints should be feasible for users, i.e., they should accommodate different causal constraints. The other is that it is more desirable for users to have a wider variety of viable examples, i.e., counterfactual diversity. To this end, we propose a parallelizable method based on gradient optimization. We partition the input feasible domains, perform counterfactual generation independently for each feasible domain, and then parallelize the counterfactual generation process for each feasible domain. Experimental results show that our approach effectively improves the diversity, sparsity, and proximity of the generated counterfactual instances on the public datasets Adult-Income, Lending-Club, German-Credit, and COMPAS compared to other models.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A hybrid Chinese named entity recognition method for Internet of Things

A hybrid Chinese named entity recognition method for Interne...

引用

2022 International Conference on Algorithms, Microchips and network Applications

作者： Wang, Ying Wang, Zehao Li, Hong Liu, Peipei Li, Yachao Zuo, Fang Institute of Intelligence Networks System Henan University Henan475001 China Henan Experimental Teaching Demonstration Centre of Modern Network Technology Henan University Henan475001 China Intelligent Data Processing Engineering Research Center of Henan Province Henan University Henan475001 China Institute of Information Engineering Chinese Academy of Sciences Beijing China Henan International Joint Laboratory of Theories and Key Technologies on Intelligence Networks Henan University Henan475001 China

ISBN: (纸本)9781510653290

In order to facilitate government departments to assess security risks and prevent infiltration, it's necessary to recognize the IoT device from open data by the method of Named entity recognition (NER). In this study, a hybrid method of chinese NER which contains neural networks and templates is proposed to extract IoT device information from open web. The model first initiates characters of input through BERT, which can create character embedding while reduce dependency with external datasets. Then, the initiated vectors are fed into a multi-layers BiLSTM model to encode the contextual representation of each character, and a linear CRF layer after BiLSTM is used to assign the scores to every character for entity annotation. In addition, a dictionary and rule base is constructed based on the characteristics of IoT devices to correct the annotation results. We experimented the methods on the dataset of IoT, the results prove the proposed model achieves better recognition effect than other models, and the F-scores value is 88.9%. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.

关键词： Internet of things

来源：评论

学校读者我要写书评

暂无评论

Deep Reinforcement Learning Based Scheduling Strategy in Blockchain Payment Channel networks

引用

IEEE/ACM Transactions on networking 2024年

作者： Ren, Zhe Wang, Zihao Li, Xinghua Miao, Yinbin Li, Zhuowen Liu, Ximeng Han, Lei Deng, Robert H. Xidian University State Key Laboratory of Integrated Services Networks The School of Cyber Engineering Xi'an710071 China Ministry of Education Engineering Research Center of Big Data Security Xi'an710071 China Fuzhou University College of Mathematics and Computer Science Fuzhou350108 China Fuzhou University Fujian Provincial Key Laboratory of Information Security of Network Systems Fuzhou350116 China Beijing Institute of Computer Technology and Application Beijing100000 China Singapore Management University School of Information Systems Singapore178902 Singapore

With the popularity of blockchains, low transaction throughput has become a significant bottleneck in applications such as cryptocurrencies. Payment channel networks (PCNs) have received attention as a way to improve throughput. However, due to the difficulty of predicting future transactions for nodes, the transactions are prone to failure when the channel balances do not meet required conditions. It has been shown that increasing buffers (queues) in PCNs can increase the success rate of transactions and throughput. Nevertheless, there is no effective transaction scheduling strategy in buffers when transaction values are flexible and variable. To solve this problem, we first formulate the Scheduling Problem in PCNs (named PSP), and then prove it is NP-hard. We design a neural network solver based on the Sequence to Sequence (Seq2Seq) architecture and train the solver using the reinforcement learning method. With the solver, we first give two scheduling strategies to maximize transaction throughput, and then design a PCN simulator for performance evaluation. Extensive experiments are conducted to show the superiority and various performances of our proposal and illustrate that our proposal can get a significant advantage in terms of the transaction throughput compared to the existing works. © 2024 IEEE.

关键词： Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Sequence-to-Sequence Knowledge Graph Completion Based On Gated Attention Unit

Sequence-to-Sequence Knowledge Graph Completion Based On Gat...

引用

International Conference on Parallel and Distributed Systems (ICPADS)

作者： Fengge Yi Xiumei Wei Xiaojing Liu Xuesong Jiang Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Provincial Key Laboratory of Computer Networks Shandong Fundamental Research Center for Computer Science Jinan China State Key Laboratory of High-end Server & Storage Technology Jinan China

We present GauKGT5, a sequence-to-sequence model proposed for knowledge graph completion (KGC). Our research extends the KGT5 model, a recent sequence-to-sequence link prediction (LP) model. GauKGT5 takes advantage of textual characteristics inherent in the knowledge graph, exhibiting a small model size. However, KGT5’s proficiency in link prediction necessitates the ensemble with a knowledge graph embedding model, which itself poses challenges due to its substantial size and expense. By integrating the Gated Attention Unit into the KGT5 model and directly applying it to the encoder-decoder structure, we achieve improved contextual dependency capturing within the sequence, resulting in enhanced prediction accuracy, accelerated training speed, and enhanced computational efficiency. At the same time, we introduce parallel computing as a means to enhance the efficiency of model training and inference within the XPU distributed computing environment.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：