In this paper, a method using deep reinforcement learning is proposed to deal with the 3D online bin packing problem. The packing objects are not limited to several specific or fixed cuboid objects, but are composed o...
In this paper, a method using deep reinforcement learning is proposed to deal with the 3D online bin packing problem. The packing objects are not limited to several specific or fixed cuboid objects, but are composed of more than a thousand objects and randomly generated cuboids, which make the trained policy network can handle novel unknown objects. In addition, the posture of the object in the box can be any angle, not limited to horizontal and vertical. In the proposed method, four voxel maps are used as inputs, and a Soft Actor-Critic (SAC) algorithm is used to train a policy network. On the other hand, in order to deal with various objects with irregular shapes, a packing task simulator with physics engine enable the policy network to learn the state of falling and stacking objects. In terms of training environment of deep reinforcement learning, the proposed method can be applied to boxes of different sizes because of the scalable image information. Moreover, a reward function and a training strategy with gradually increasing difficulty are proposed to effectively improve the learning of policy network. In terms of experimental results, the results on a random object bin packing task in a simulator illustrate the effectiveness of the proposed method.
The placement of distributed generation (DG) units in power systems is an efficient way for energy loss reduction, especially when the penetration of DG in modern systems is growing due to their impacts on environment...
详细信息
The placement of distributed generation (DG) units in power systems is an efficient way for energy loss reduction, especially when the penetration of DG in modern systems is growing due to their impacts on environmental sustainability. On the other hand, load variations and methods of electricity consumption affect energy losses amount. Therefore, power demand variations have an essential role in the determination of energy loss amount and optimal generation of DG. However, considering the variability of load level in the DG allocation problem increases the burden and computational time, and neglecting it causes the energy losses to be calculated inaccurately. Therefore, this paper aims to evaluate the effect of load patterns on renewable DG allocation plans in order to find out the importance of considering load variations in energy loss minimization via DG placement. The analysis has been conducted on 7-, 12-, 16-, 28-, 30-, 33-, 59-, 69-, 70-, 84-, and 119-bus distribution systems by a classic optimization tool named AMPL.
Virtualization technologies are still growing bigger and faster. Despite the greatness of its advancement, the costume industry is still very accessible when it comes to real trials. Off-the-shelf stuff are inadequate...
Virtualization technologies are still growing bigger and faster. Despite the greatness of its advancement, the costume industry is still very accessible when it comes to real trials. Off-the-shelf stuff are inadequate details for the desired individual to assess its in-depth utility for each garment trying on for a second, including custom stuff are much harder to try out right away. To this end, 2D image-based 3D reconstruction inclusive of touchable-virtualized space is accessible easier to stuff details for mans' decision making in purchasing. We establish the overall end-to-end pipeline from reconstruction until visualization for one instance to be triable on its stuff for a moment. As an expectation, our proposed approach can bring objects into the experimental area and use them immediately without any obstacle.
As the complexity of tasks addressed through reinforcement learning (RL) increases, the definition of reward functions also has become highly complicated. We introduce an RL method aimed at simplifying the reward-shap...
详细信息
The article introduces a two-dimensional polynomial regression model for the predictive analysis of glucose concentration in a fractal microwave sensor NP model, utilizing frequency and transmission coefficient differ...
详细信息
A membership inference attack (MIA) identifies if an instance was included in the victim model's train dataset. Without an appropriate defense mechanism, MIA can result in serious privacy breaches. Although severa...
详细信息
Cobalt titanate, CoTiO3, is a honeycomb antiferromagnet recently confirmed experimentally to host Dirac magnons, topological spin-orbit excitons, and chiral phonons. Here, we investigate a magnon gap at the zone cente...
详细信息
Cobalt titanate, CoTiO3, is a honeycomb antiferromagnet recently confirmed experimentally to host Dirac magnons, topological spin-orbit excitons, and chiral phonons. Here, we investigate a magnon gap at the zone center which calls for a refined spin Hamiltonian. We propose a microscopic model for the magnon gap and attribute it to a lattice-distortion (phonon)-induced higher-order spin interaction. Strong magnetoelastic coupling in CoTiO3 is also evident in Raman spectra, in which the magnetic order exerts a stronger influence on phonons corresponding to in-plane ionic motions than those with out-of-plane motions. We further examine the evolution of the zone-center magnons in a high magnetic field up to 18.5 T via THz absorption spectroscopy measurements. Based on this field dependence, we propose a spin Hamiltonian that not only agrees with magnon dispersion measured by inelastic neutron scattering but also includes fewer exchange constants and a realistic anisotropy term. Our work highlights the broad implications of magnetoelastic coupling in the study of topologically protected bosonic excitations.
This paper proposes an economic model predictive control (EMPC) design for a Direct Contact Membrane Distillation powered by a solar collector system which aims at enhancing its economical performances. A differential...
详细信息
This paper proposes an economic model predictive control (EMPC) design for a Direct Contact Membrane Distillation powered by a solar collector system which aims at enhancing its economical performances. A differential algebraic equations-based model is used for the design of the EMPC control. Moreover, a nonlinear observer is developed for the estimation of the unmeasured state. A neural network is proposed to predict the unknown solar irradiance for future horizon where a solar model provides temperature predictions. The proposed control design has been validated in simulation using data provided by a partial differential equation-based model mimicking the real plant.
Load power can be changed by voltage fluctuation of network buses, in which reconfiguring the topology of distribution systems impacts the magnitude of bus voltages. Only a few papers have considered voltage-dependent...
详细信息
Load power can be changed by voltage fluctuation of network buses, in which reconfiguring the topology of distribution systems impacts the magnitude of bus voltages. Only a few papers have considered voltage-dependent loads in their reconfiguration models but introduced nonlinear formulations or highly approximated linear approaches. While nonlinear models can be solved by metaheuristic algorithms without guaranteeing the optimality of solutions or may be implemented by commercial nonlinear solvers, they require intensive and time-consuming computations. Moreover, linearized models are highly approximated by piecewise linear functions with many unknown parameters. Thus, this paper presents the effective design of the reconfiguration problems, including voltage dependency of loads, which can be solved using commercial linear solvers. The proposed models are precise enough to find accurate results for the reconfiguration problem and are adequately fast to converge to optimal solutions. The results show that the proposed solutions not only reduce active power losses significantly but increase the minimum voltage of the system effectively.
In the realm of autonomous agents, ensuring safety and reliability in complex and dynamic environments remains a paramount challenge. Safe reinforcement learning addresses these concerns by introducing safety constrai...
详细信息
暂无评论