检索结果-内蒙古大学图书馆

Frontiers of computer science 2025年第4期19卷 43-57页

作者： Xiao MA Shen-Yi ZHAO Zhao-Heng YIN Wu-Jun LI National Key Laboratory for Novel Software Technology Department of Computer Science and TechnologyNanjing UniversityNanjing 210023China Department of Electrical Engineering and Computer Sciences University of CaliforniaBerkeleyCA 94720-1770USA

Exploration strategy design is a challenging problem in reinforcement learning(RL),especially when the environment contains a large state space or sparse *** exploration,the agent tries to discover unexplored(novel)areas or high reward(quality)*** existing methods perform exploration by only utilizing the novelty of *** novelty and quality in the neighboring area of the current state have not been well utilized to simultaneously guide the agent’s *** address this problem,this paper proposes a novel RL framework,called clustered reinforcement learning(CRL),for efficient exploration in *** adopts clustering to divide the collected states into several clusters,based on which a bonus reward reflecting both novelty and quality in the neighboring area(cluster)of the current state is given to the *** leverages these bonus rewards to guide the agent to perform efficient ***,CRL can be combined with existing exploration strategies to improve their performance,as the bonus rewards employed by these existing exploration strategies solely capture the novelty of *** on four continuous control tasks and six hard-exploration Atari-2600 games show that our method can outperform other state-of-the-art methods to achieve the best performance.

关键词： deep reinforcement learning exploration count-based method clustering K-means

来源：评论

学校读者我要写书评

暂无评论

Question Selection for Multi-Modal Code Search Synthesis using Probabilistic Version Spaces

引用

IEEE Transactions on software Engineering 2025年第6期51卷 1724-1744页

作者： Wu, Jiarong Jiang, Yanyan Wei, Lili Xu, Congying Cheung, Shing-Chi Xu, Chang The Hong Kong University of Science and Technology Department of Computer Science and Engineering Hong Kong McGill University Department of Electrical and Computer Engineering Montreal Canada Nanjing University State Key Laboratory for Novel Software Technology Department of Computer Science and Technology Nanjing China

Searching the occurrences of specific code patterns (code search) is a common task in software engineering, and programming by example (PBE) techniques have been applied to ease customizing code patterns. However, previous PBE tools only synthesize programs meeting the input-output examples, which may not always align with the user intent. To bridge this gap, this paper proposes Excalibur, a multi-modal (example and natural language description) and interactive synthesizer for code search. Excalibur ensures that the generated programs are correct for the provided examples (soundness) and include the user-intended program (bounded completeness). Furthermore, Excalibur helps the user identify the user-intended program through question-answer interaction. To minimize the required interaction efforts, question selection is crucial. To improve question selection for code search, we propose probabilistic version spaces (ProbVS), in which the user-intended program’s probability is high and others are low. ProbVS combines traditional version spaces for compactly representing extensive programs and large language models (on the user-provided natural language description) for adjusting programs’ probabilities to align with users’ intents. Extensive experiments on a benchmark of 44 tasks demonstrated the effectiveness of Excalibur and ProbVS and demystified how ProbVS affects probability distributions and how the configurable parameters affect ProbVS. © 1976-2012 IEEE.

关键词： Normal distribution

来源：评论

学校读者我要写书评

暂无评论

Security experimental framework of trajectory planning for autonomous vehicles

引用

International Journal of Intelligent Networks 2024年第1期5卷 315-324页

作者： Al-sheyab, Sujoud Al-shara, Zakarea Al-khaleel, Osama Department of Computer Engineering Jordan University of Science and Technology Irbid Jordan Department of Software Engineering Jordan University of Science and Technology Irbid Jordan

In the contemporary landscape, autonomous vehicles (AVs) have emerged as a prominent technological advancement globally. Despite their widespread adoption, significant hurdles remain, with security standing out as a critical concern. The potential for attacks within AV networks, exemplified by the Trajectory Privacy Attack on Autonomous Driving (T-PAAD), underscores the urgency for robust security measures. Unfortunately, existing simulations for preemptively assessing the T-PAAD attack's impact are scarce. This paper introduces the Security Experimental Framework for Autonomous Vehicles (SEFAV), designed to address this gap by providing a versatile platform for simulating security scenarios in AV environments. SEFAV is cross-platform and compatible with different operating systems such as Windows and Linux, enhancing accessibility for researchers and practitioners. Our primary focus lies in showcasing the T-PAAD attack within our framework, highlighting its efficacy in evaluating and fortifying AV security. © 2024 The Authors

关键词： Vehicle routing

来源：评论

学校读者我要写书评

暂无评论

Predicting Diabetes Disease Occurrence Using Logistic Regression: An Early Detection Approach

Iraqi Journal for Computer Science and Mathematics

引用

Iraqi Journal for computer science and Mathematics 2024年第1期5卷 160-167页

作者： Abdalrada, Ahmad Shaker Neamah, Ali Fahem Murad, Hayder Department of Software Faculty of Computer Science and Information Technology Wasit University Iraq Department of Computer Faculty of Computer Science and Information Technology Wasit University Iraq College of Medicine Wasit University Iraq

Diabetes disease is prevalent worldwide, and predicting its progression is crucial. Several model have been proposed to predict such disease. Those models only determine the disease label, leaving the likelihood of developing the disease unclear. Proposing a model for predicting the progression of disease becomes essential. Therefore, this article proposes a logistic regression model to anticipate the likelihood of Diabetes syndrome incidence. The model exploit capabilities of logistic regression by using sigmoid function. The model's performance was evaluated using the Pima Indians Diabetes dataset and demonstrated high accuracy, sensitivity, and specificity. The prediction accuracy rate was 77.6%, with a sensitivity of 72.4%, specificity of 79.6%, Type I Error of 27.6%, and Type II Error of 20.4%. Furthermore, the model indicates the feasibility of using laboratory tests, such as Pregnancies, Glucose, Blood Pressure, BMI, and DiabetesPedigreeFunction, to predict disease progress. The proposed model can aid patients and physicians in understanding the disease's progression and implementing timely interventions © 2024 College of Education, Al-Iraqia University. All rights reserved.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Solving Sparse Reward Tasks Using Self-Balancing Exploration and Exploitation

引用

Journal of Internet technology 2025年第3期26卷 293-301页

作者： Kong, Yan Wei, Junfeng Hsia, Chih-Hsien School of Computer and Software Nanjing University of Information Science and Technology China Department of Computer Science and Information Engineering National Ilan University Taiwan Department of Business Administration Chaoyang University of Technology Taiwan

A core challenge in applying deep reinforcement learning (DRL) to real-world tasks is the sparse reward problem, and shaping reward has been one effective method to solve it. However, due to the enormous state space and sparse rewards in the real world, a large number of useless samples may be generated, leading to reduced sample efficiency and potential local optima. To address this issue, this study proposes a self-balancing method of exploration and development to solve the issue of sparse rewards. Firstly, we shape the reward function according to the evaluated progress, to guide the agent’s learning of high-reward samples. Secondly, we construct a dual-trajectory exploration network, which provides intrinsic rewards based on the novelty of states and the trajectory difference of sibling agents to encourage the agent to explore and adjust the balance between exploration and exploitation. This method effectively prevents the generation of a large amount of useless training data during the interaction between the agent and the environment, resolves local optimal dilemmas through state novelty, and adjusts the strategy in a timely manner to solve sparse reward tasks. Our method outperforms basic reinforcement learning (RL) and curiosity-driven incentives in these experimental tasks. The self-balancing exploration and exploitation approach in our research provides a new perspective and effective solution for addressing the problem of sparse rewards, thereby advancing the application of DRL in real-world problems and achieving greater success. © 2025 Taiwan Academic Network Management Committee. All rights reserved.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Understanding and Detecting Inefficient Image Displaying Issues in Android Apps

引用

Journal of computer science & technology 2024年第2期39卷 434-459页

作者：李文杰马骏蒋炎岩许畅马晓星 State Key Laboratory of Novel Software Technology Nanjing UniversityNanjing 210023China Department of Computer Science and Technology Nanjing UniversityNanjing 210023China

Mobile applications(apps for short)often need to display ***,inefficient image displaying(IID)issues are pervasive in mobile apps,and can severely impact app performance and user *** paper first establishes a descriptive framework for the image displaying procedures of IID *** on the descriptive framework,we conduct an empirical study of 216 real-world IID issues collected from 243 popular open-source Android apps to validate the presence and severity of IID issues,and then shed light on these issues’characteristics to support research on effective issue *** the findings of this study,we propose a static IID issue detection tool TAPIR and evaluate it with 243 real-world Android ***,49 and 64 previously-unknown IID issues in two different versions of 16 apps reported by TAPIR are manually confirmed as true positives,respectively,and 16 previously-unknown IID issues reported by TAPIR have been confirmed by developers and 13 have been ***,we further evaluate the performance impact of these detected IID issues and the performance improvement if they are *** results demonstrate that the IID issues detected by TAPIR indeed cause significant performance degradation,which further show the effectiveness and efficiency of TAPIR.

关键词： Android application(app) inefficient image displaying(IID) performance empirical study static analysis

来源：评论

学校读者我要写书评

暂无评论

A survey on cross-user federated recommendation

引用

science China(Information sciences) 2025年第4期68卷 7-32页

作者： Enyue YANG Yudi XIONG Wei YUAN Weike PAN Qiang YANG Zhong MING College of Computer Science and Software Engineering Shenzhen University School of Electrical Engineering and Computer Science The University of Queensland WeBank AI Lab WeBank Department of Computer Science and Engineering Hong Kong University of Science and Technology College of Big Data and Internet Shenzhen Technology University Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)

Recommender systems are effective in mitigating information overload, yet the centralized storage of user data raises significant privacy concerns. Cross-user federated recommendation(CUFR) provides a promising distributed paradigm to address these concerns by enabling privacy-preserving recommendations directly on user devices. In this survey, we review and categorize current progress in CUFR, focusing on four key aspects: privacy, security, accuracy, and efficiency. Firstly,we conduct an in-depth privacy analysis, discuss various cases of privacy leakage, and then review recent methods for privacy protection. Secondly, we analyze security concerns and review recent methods for untargeted and targeted *** untargeted attack methods, we categorize them into data poisoning attack methods and parameter poisoning attack methods. For targeted attack methods, we categorize them into user-based methods and item-based methods. Thirdly,we provide an overview of the federated variants of some representative methods, and then review the recent methods for improving accuracy from two categories: data heterogeneity and high-order information. Fourthly, we review recent methods for improving training efficiency from two categories: client sampling and model compression. Finally, we conclude this survey and explore some potential future research topics in CUFR.

关键词： cross-user federated recommendation federated recommendation federated learning recommender systems user privacy

来源：评论

学校读者我要写书评

暂无评论

An aspect-based sentiment analysis model for Arabic game reviews based on hybrid transformers models

引用

Neural Computing and Applications 2025年第16期37卷 10309-10331页

作者： Hammad, Mahmoud AbuEnnab, Noor Al-Refai, Mohammed IT Department Ajman University Ajman United Arab Emirates Software Engineering Department Jordan University of Science and Technology Irbid Jordan Computer Science Department Jordan University of Science and Technology Irbid Jordan

Aspect-based sentiment analysis (ABSA) is a natural language processing (NLP) technique to determine the various sentiments of a customer in a single comment regarding different aspects. The increasing online data content generated by interested customers and reviewers motivated researchers and data scientists to conduct ABSA. ABSA has become increasingly popular in recent years due to its versatility in e-commerce, social media, and customer feedback analysis. However, ABSA faces several significant challenges, including determining the aspects and their sentiment polarities (positive, negative, or neutral) in a given text. Moreover, ABSA faces particular challenges in non-English languages such as Arabic due to the lack of resources and mature models. Typically, ABSA tackles one or more of the ABSA research tasks: (T1) aspect term extraction, (T2) aspect term polarity, (T3) aspect category identification, and (T4) aspect category polarity. To identify the aspects and their corresponding sentiment polarities in a given text, accurate and efficient NLP techniques are required. Despite growing interest in Arabic ABSA, the lack of annotated datasets and pre-trained models has hindered its development. In this research, we have collected a dataset of Arabic game reviews and annotated them using three annotators, and then we trained an ABSA deep learning model based on the BERT pre-trained model combined with zero-shot learning (ZSL) to tackle all the four aforementioned tasks. Our best performing model achieved a high accuracy on all four tasks with an accuracy of 91.61% on T1, 90.99% on T2, 79.08% on T3, and 88.17% on T4. Finally, we compared our model’s accuracy with the state-of-the-art Arabic-based ABSA models on different datasets. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.

关键词： Sales

来源：评论

学校读者我要写书评

暂无评论

DERLight: A Deep Reinforcement Learning Traffic Light Control Algorithm with Dual Experience Replay

引用

Journal of Internet technology 2024年第1期25卷 79-86页

作者： Yang, Zhichao Kong, Yan Hsia, Chih-Hsien School of Computer and Software Nanjing University of Information Science and Technology China Department of Computer Science and Information Engineering National Ilan University Taiwan Department of Business Administration Chaoyang University of Technology Taiwan

In recent years, with the increasingly severe traffic environment, most cities are facing various traffic congestion problems, and the demand for intelligent regulation of traffic signals is also increasing. In this study, we propose a new intelligent traffic light control algorithm, dual experience replay light (DERLight), which innovatively and efficiently designs a dual experience replay training mechanism based on the classic deep Q network (DQN) framework and considers the dynamic epoch function. As results show that compared with some state-of-the-art algorithms, DERLight can shorten the average travel time of vehicles, increase the throughput at intersections, and also speed up the convergence of the network. In addition, the design of this algorithm framework is not only limited to the field of intelligent transportation, but also has transferability for some other fields. © 2024 Taiwan Academic Network Management Committee. All rights reserved.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Anomaly Detection Model of Time Segment Power Usage Behavior Using Unsupervised Learning

引用

Journal of Internet technology 2024年第3期25卷 455-463页

作者： Ho, Wen-Jen Hsieh, Hsin-Yuan Tsai, Chia-Wei Software Technology Institute Institute for Information Industry Taiwan Department of Computer Science and Information Engineering National Taitung University Taiwan Department of Computer Science and Information Engineering National Taichung University of Science and Technology Taiwan

In Taiwan, the current electricity prices for residential users remain relatively low. This results in a diminished incentive for these users to invest in energy-saving improvements. Consequently, devising strategies to encourage residential users to adopt energy-saving measures becomes a vital research area. Grounded in behavioral science, this study introduces a feasible approach where an energy management system provides alerts and corresponding energy-saving recommendations to residential users upon detecting abnormal electricity consumption behavior. To pinpoint anomalous electricity usage within specific time segments, this research employs an unsupervised machine learning method, developing an anomaly detection model for the overall electricity consumption behavior of residential users. The model focuses on analyzing 2-hour intervals of electricity consumption, enabling more effective detection of abnormal usage patterns. It is trained using power consumption data collected from five actual residential users as part of an experimental study. The results indicate that the proposed anomaly detection model achieves performance metrics such as Precision, Recall, and F1-score of 0.90 or above, showcasing its potential for practical implementation. © 2024 Taiwan Academic Network Management Committee. All rights reserved.

关键词： Energy conservation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：