We demonstrate 6-bit DAC resolution with an ultra-compact slow-light electro-optic modulator. The 10× modulation length reduction enables 31 × compute density, 1.17 × energy efficiency, and 36.1 × ...
详细信息
We devise a control-theoretic reinforcement learning approach to support direct learning of the optimal policy. We establish various theoretical properties of our approach, such as convergence and optimality of our an...
详细信息
In this paper, the problem of backward compatibility of active disturbance rejection control (ADRC) is investigated. The goal is to contextualize ADRC to deliver its interpretations from the established field of linea...
详细信息
Contrary to traditional deterministic notions of algorithmic fairness, this paper argues that fairly allocating scarce resources using machine learning often requires randomness. We address why, when, and how to rando...
Contrary to traditional deterministic notions of algorithmic fairness, this paper argues that fairly allocating scarce resources using machine learning often requires randomness. We address why, when, and how to randomize by proposing stochastic procedures that more adequately account for all of the claims that individuals have to allocations of social goods or opportunities.
Non-orthogonal multiple access is one of the best methods for addressing the needs of 5G wireless services (NOMA). The proliferation of mobile devices in recent years has increased the importance of cooperative spectr...
详细信息
In the last few years Artificial Intelligence (AI) is emerging as a game changer in many areas of society and, in particular, its integration in medicine heralds a transformative approach towards personalized healthca...
详细信息
In the field of autonomous Unmanned Aerial Vehicles (UAVs) landing, conventional approaches fall short in delivering not only the required precision but also the resilience against environmental disturbances. Yet, lea...
详细信息
ISBN:
(数字)9798331516239
ISBN:
(纸本)9798331516246
In the field of autonomous Unmanned Aerial Vehicles (UAVs) landing, conventional approaches fall short in delivering not only the required precision but also the resilience against environmental disturbances. Yet, learning-based algorithms can offer promising solutions by leveraging their ability to learn the intelligent behaviour from data. On one hand, this paper introduces a novel multimodal transformer-based Deep Learning detector, that can provide reliable positioning for precise autonomous landing. It surpasses standard approaches by addressing individual sensor limitations, achieving high reliability even in diverse weather and sensor failure conditions. It was rigorously validated across varying environments, achieving optimal true positive rates and average precisions of up to 90%. On the other hand, it is proposed a Reinforcement Learning (RL) decision-making model, based on a Deep Q-Network (DQN) rationale. Initially trained in simulation, its adaptive behaviour is successfully transferred and validated in a real outdoor scenario. Furthermore, this approach demonstrates rapid inference times of approximately 5ms, validating its applicability on edge devices.
In India, water wastage in agricultural fields becomes a challengingissue and it is needed to minimize the loss of water in the irrigation *** the conventional irrigation system needs massive quantity of waterutilizat...
详细信息
In India, water wastage in agricultural fields becomes a challengingissue and it is needed to minimize the loss of water in the irrigation *** the conventional irrigation system needs massive quantity of waterutilization, a smart irrigation system can be designed with the help of recenttechnologies such as machine learning (ML) and the Internet of Things (IoT).With this motivation, this paper designs a novel IoT enabled deep learningenabled smart irrigation system (IoTDL-SIS) technique. The goal of theIoTDL-SIS technique focuses on the design of smart irrigation techniquesfor effectual water utilization with less human interventions. The proposedIoTDL-SIS technique involves distinct sensors namely soil moisture, temperature, air temperature, and humidity for data acquisition purposes. The sensordata are transmitted to the Arduino module which then transmits the sensordata to the cloud server for further process. The cloud server performs the dataanalysis process using three distinct processes namely regression, clustering,and binary classification. Firstly, deep support vector machine (DSVM) basedregression is employed was utilized for predicting the soil and environmentalparameters in advances such as atmospheric pressure, precipitation, solarradiation, and wind speed. Secondly, these estimated outcomes are fed intothe clustering technique to minimize the predicted error. Thirdly, ArtificialImmune Optimization Algorithm (AIOA) with deep belief network (DBN)model receives the clustering data with the estimated weather data as inputand performs classification process. A detailed experimental results analysisdemonstrated the promising performance of the presented technique over theother recent state of art techniques with the higher accuracy of 0.971.
Terahertz (THz) sensing technology enables 6G communication, detection of biological and chemical hazardous agents, cancer detection, monitoring of industrial processes and products, and detection of mines and explosi...
详细信息
This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition *** this setting, the Q-function of each RL probl...
详细信息
This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition *** this setting, the Q-function of each RL problem (task) can be decomposed into a successor feature (SF) and a reward mapping: the former characterizes the transition dynamics, and the latter characterizes the task-specific reward *** Q-function decomposition, coupled with a policy improvement operator known as generalized policy improvement (GPI), reduces the sample complexity of finding the optimal Q-function, and thus the SF & GPI framework exhibits promising empirical performance compared to traditional RL methods like ***, its theoretical foundations remain largely unestablished, especially when learning the successor features using deep neural networks (SF-DQN).This paper studies the provable knowledge transfer using SFs-DQN in transfer RL *** establish the first convergence analysis with provable generalization guarantees for SF-DQN with *** theory reveals that SF-DQN with GPI outperforms conventional RL approaches, such as deep Q-network, in terms of both faster convergence rate and better *** experiments on real and synthetic RL tasks support the superior performance of SF-DQN & GPI, aligning with our theoretical findings. Copyright 2024 by the author(s)
暂无评论