This paper presents a strategy for discovering flaws in web applications through machinelearning (ML). Web-based applications are especially troublesome to examine attributed to their variety and extensive usage of c...
详细信息
software defect prediction is an essential activity for software quality assurance. Various researchers have developed a large number of models for software defect prediction based on machinelearning. Different model...
详细信息
The memory dirty page prediction technology can effectively predict whether a memory page will be modified (dirty) at the next moment, and is widely used in virtual machine migration, container migration and other fie...
详细信息
In our increasingly digital world, cyber security has become a paramount concern, with threats evolving from malicious software to sophisticated hacking techniques. To effectively combat these challenges, the integrat...
详细信息
The quality of a software can be defined with respect to the parameters of reliability and consistency. Both these parameters are strongly associated with software errors. software engineers ensure the quality by prio...
详细信息
Aiming at the problem of unstable machine vision recognition accuracy caused by various kinds and inconspicuous features of workpiece defects, this paper puts forward a workpiece defect recognition algorithm based on ...
详细信息
Spectrum scarcity is a major problem in this millennial communication engineering. machinelearning based spectrum sensing approaches are getting more attention among research community. The spectrum sensing technique...
详细信息
In recent years, there has been a significant rise in the number of software startups globally, driven by advances in technology and increasing reliance on digital solutions. These startups are crucial in shaping the ...
详细信息
Deep reinforcement learning has made significant development in recent years, and it is currently applied not only in simulators and games but also in embedded systems. However, when implemented in a real-world contex...
详细信息
ISBN:
(纸本)9781665464598
Deep reinforcement learning has made significant development in recent years, and it is currently applied not only in simulators and games but also in embedded systems. However, when implemented in a real-world context, reinforcement learning is frequently shown to be unstable and incapable of adapting to realistic situations, particularly when directing a large number of agents. In this paper, we develop a decentralized architecture for reinforcement learning to allow multiple agents to learn optimal control policies on their own devices of the same kind but in varied environments. For such multiple agents, the traditional centralized learning algorithm usually requires a costly or time-consuming effort to develop the best-regulating policy and is incapable of scaling to a large-scale system. To address this issue, we propose a decentralized reinforcement learning algorithm (DecRL) and information exchange scheme for each individual device, in which each agent shares the individual learning experience and information with other agents based on local model training. We incorporate the algorithm into each agent in the proposed collaborative architecture and validate it in the telecommunication domain under emergency conditions, in which a macro base station (BS) is broken due to a natural disaster, and three unmanned aerial vehicles carrying BSs (UAV-BSs) are deployed to provide temporary coverage for missioncritical (MC) users in the disaster area. Based on the findings, we show that the proposed decentralized reinforcement learning algorithm can successfully support multi-agent learning, while the learning speed and service quality can be further enhanced.
The most practical configurable network is software Defined Networking (SDN). SDN delineates the network's control layer from the data layer, while operationally integrating them. With global visibility into netwo...
详细信息
暂无评论