检索结果-内蒙古大学图书馆

IEEE Transactions on Intelligent Vehicles 2023年 1-10页

作者： Xue, Jun Liu, Ziniu Liu, Guanjun Zhou, Ziyuan Zhang, Kaiwen Tang, Ying Wang, Jiacun Department of Control Science and Engineering Tongji University Shanghai China Department of Computer Science Tongji University Shanghai China Department of Electrical and Computer Engineering Rowan University Glassboro NJ USA Department of Computer Science and Software Engineering Monmouth University West Long Branch NJ USA

Unmanned Aerial Vehicles (UAVs) have extensive applications such as logistics transportation and aerial photography. However, UAVs are sensitive to winds. Traditional control methods, such as proportional- integral-derivative controllers, generally fail to work well when the strength and direction of winds are changing frequently. In this work deep reinforcement learning algorithms are combined with a domain randomization method to learn robust wind-resistant hovering policies. A novel reward function is designed to guide learning. This reward function uses a constant reward to maintain a continuous flight of a UAV as well as a weight of the horizontal distance error to ensure the stability of the UAV at altitude. A five-dimensional representation of actions instead of the traditional four dimensions is designed to strengthen the coordination of wings of a UAV. We theoretically explain the rationality of our reward function based on the theories of Q-learning and reward shaping. Experiments in the simulation and real-world application both illustrate the effectiveness of our method. To the best of our knowledge, it is the first paper to use reinforcement learning and domain randomization to explore the problem of robust wind-resistant hovering control of quadrotor UAVs, providing a new way for the study of wind-resistant hovering and flying of UAVs. IEEE

关键词： Heuristic algorithms

来源：评论

学校读者我要写书评

暂无评论

Meta-ETI: Meta-Reinforcement Learning with Explicit Task Inference for UAV-IoT Coverage

引用

IEEE Internet of Things Journal 2025年第13期12卷 23852-23865页

作者： Huang, Songjun Sun, Chuanneng Pompili, Dario Rutgers University-New Brunswick Department of Electrical and Computer Engineering NJ United States

To better enhance the network service for different user devices in various scenarios, unmanned aerial vehicles (UAVs) are increasingly used as aerial base stations (ABSs). However, optimizing coverage for user devices via UAV team control is an NP-hard problem and escalates exponentially in complexity with the growing number of user devices. To address this challenge, researchers have turned to reinforcement learning (RL) for a more practical solution. With the growing prevalence of the Internet of Things (IoT), the diversity of user devices increases, posing challenges for traditional RL, as i) the spatial distribution of devices becomes more complex;ii) variations in device types and device mobility increase the training latency;iii) the high-speed movement of IoT devices can lead to performance deterioration in widely used RL algorithms with discrete action space;and iv) traditional RL struggles to adapt to new environments. To solve these problems, we propose a new meta-RL framework, Meta-RL with Explicit Task Inference (Meta-ETI). Then, we apply this framework to efficiently train an energy-efficient UAV control policy for fair and effective coverage in 3D dynamic environments. Meta-ETI is evaluated in both theoretical and application-related aspects and demonstrates superior performance compared to the baseline frameworks. The result shows that Meta-ETI demonstrates 2 to 3 times faster adaptation speed and a decent performance in sample efficiency. Furthermore, in the UAV-IoT coverage application, Meta-ETI shows 30% to 50% better in energy efficiency and 40% to 60% more served devices because of the fair coverage. © 2014 IEEE.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

ESG Investing: A Statistically Valid Approach to Data-Driven Decision Making and the Impact of ESG Factors on Stock Returns and Risk

引用

IEEE Access 2024年 12卷 69434-69444页

作者： Teja, Kamurthi Ravi Liu, Chuan-Ming National Taipei University of Technology International Program of Electrical Engineering and Computer Science Taipei10608 Taiwan National Taipei University of Technology Department of Computer Science and Information Engineering Taipei10608 Taiwan

This study examines the impact of environmental, social, and governance (ESG) factors on economic investment from a statistical perspective, aiming to develop a tested investment strategy that capitalizes on the connection between ESG factors and financial performance. ESG investing: A statistically valid approach to data-driven decision-making (ESGI-SVADDM) investment strategy is based on a rigorous, statistically valid approach that utilizes data, math, statistics, and data science libraries to drive investment decisions, eliminating the need for personal opinions and subjectivity. The process includes establishing an investment thesis, formulating testable hypotheses (HPS), retrieving and refining relevant data, calculating relevant measures, and testing and validating all hypotheses. The study uses S&P 500 stock data and ESG data from Sustainalytics to test the hypotheses. The empirical tests conducted revealed a negative correlation between ESG risk and expected returns, as well as a positive trend in the relationship between ESG risk and the overall risk of stocks. Moreover, the study found that higher ESG risk scores are associated with lower returns for investors, and that adopting a strategy of investing in stocks with low ESG risk and shorting stocks with high ESG risk yields superior returns compared to the market portfolio. © 2013 IEEE.

关键词： Investments

来源：评论

学校读者我要写书评

暂无评论

Optimal Offering of Energy Storage in UK Day-Ahead Energy and Frequency Response Markets

引用

Journal of Modern Power Systems and Clean Energy 2024年第2期12卷 415-426页

作者： Makedon Karasavvidis Andreas Stratis Dimitrios Papadaskalopoulos Goran Strbac Department of Electrical and Electronic Engineering Imperial College LondonLondonUK Statkraft UK Ltd. LondonUK Department of Electrical and Computer Engineering University of PatrasPatrasGreece IEEE

The offering strategy of energy storage in energy and frequency response(FR) markets needs to account for country-specific market regulations around FR products as well as FR utilization factors, which are highly uncertain. To this end, a novel optimal offering model is proposed for stand-alone price-taking storage participants, which accounts for recent FR market design developments in the UK, namely the trade of FR products in time blocks, and the mutual exclusivity among the multiple FR products. The model consists of a day-ahead stage, devising optimal offers under uncertainty, and a real-time stage, representing the storage operation after uncertainty is materialized. Furthermore, a concrete methodological framework is developed for comparing different approaches around the anticipation of uncertain FR utilization factors(deterministic one based on expected values, deterministic one based on worst-case values, stochastic one, and robust one), by providing four alternative formulations for the real-time stage of the proposed offering model, and carrying out an out-of-sample validation of the four model instances. Finally, case studies employing real data from UK energy and FR markets compare these four instances against achieved profits, FR delivery violations, and computational scalability.

关键词： Energy markets energy storage frequency response optimal offering robust optimization stochastic programming

来源：评论

学校读者我要写书评

暂无评论

Network life-time maximisation with low-power consumption by the usage of ANFIS-based technique in wireless sensor networks

引用

International Journal of Wireless and Mobile Computing 2024年第1期26卷 1-8页

作者： Rao, N. Srinivas Rama Rao, K.V.S.N. Department of Computer Science and Engineering KLEF Deemed to be University Andhra Pradesh Guntur India Department of Computer Science and Engineering KLEF Deemed to be University Telangana Hyderabad India

Clustering strategies for reducing the energy consumption and extending the network life have been employed widely in Wireless Sensor Network (WSN). The clustering mechanism can extend the network’s service life and network failure in WSN. In the study, we proposed the technique for improving network performance with a new energy efficient Adaptive Neuro-Fuzzy Inference System (ANFIS)-based routing approach for WSN. A new distributed cluster creation methodology that enables the self-organisation of local nodes, a novel method for the adjustment of clusters and the turning of the Cluster Head (CH) centre location to distribute energy burden equally through all sensing nodes incorporates the suggested ANFIS-based routing. The simulation result shows that the proposed scheme outperforms conventional methods with an improvement of 80% in network lifetime. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Base stations

来源：评论

学校读者我要写书评

暂无评论

Learning-Based Compress-and-Forward Schemes for the Relay Channel

引用

IEEE Journal on Selected Areas in Communications 2025年第7期43卷 2393-2404页

作者： Ozyilkan, Ezgi Carpi, Fabrizio Garg, Siddharth Erkip, Elza New York University Department of Electrical and Computer Engineering BrooklynNY United States

The relay channel, consisting of a source-destination pair along with a relay, is a fundamental component of cooperative communications. While the capacity of a general relay channel remains unknown, various relaying strategies, including compress-and-forward (CF), have been proposed. In CF, the relay forwards a quantized version of its received signal to the destination. Given the correlated signals at the relay and destination, distributed compression techniques, such as Wyner–Ziv coding, can be harnessed to utilize the relay-to-destination link more efficiently. Leveraging recent advances in neural network-based distributed compression, we revisit the relay channel problem and integrate a learned task-aware Wyner–Ziv compressor into a primitive relay channel with a finite-capacity out-of-band relay-to-destination link. The resulting neural CF scheme demonstrates that our compressor recovers binning of the quantized indices at the relay, mimicking the optimal asymptotic CF strategy, although no structure exploiting the knowledge of source statistics was imposed into the design. The proposed neural CF, employing finite order modulation, operates closely to the rate achievable in a primitive relay channel with a Gaussian codebook. We showcase the advantages of exploiting the correlated destination signal for relay compression through various neural CF architectures that involve end-to-end training of the compressor and the demodulator components. Our learned task-oriented compressors provide the first proof-of-concept work toward interpretable and practical neural CF relaying schemes. © 1983-2012 IEEE.

关键词： Compressors

来源：评论

学校读者我要写书评

暂无评论

Matrix Completion from One-Bit Dither Samples

引用

IEEE Transactions on Signal Processing 2024年 1-14页

作者： Eamaz, Arian Yeganegi, Farhang Soltanalian, Mojtaba Department of Electrical and Computer Engineering University of Illinois Chicago Chicago IL USA

We explore the impact of coarse quantization on matrix completion in the extreme scenario of dithered one-bit sensing, where the matrix entries are compared with random dither levels. In particular, instead of observing a subset of high-resolution entries of a low-rank matrix, we have access to a small number of one-bit samples, generated as a result of these comparisons. In order to recover the low-rank matrix using its coarsely quantized known entries, we begin by transforming the problem of one-bit matrix completion (one-bit MC) with random dithering into a nuclear norm minimization problem. The one-bit sampled information is represented as linear inequality feasibility constraints. We then develop the popular singular value thresholding (SVT) algorithm to accommodate these inequality constraints, resulting in the creation of the One-Bit SVT (OBSVT). Our findings demonstrate that incorporating multiple random dither sequences in one-bit MC can significantly improve the performance of the matrix completion algorithm. In pursuit of achieving this objective, we utilize diverse dithering schemes, namely uniform, Gaussian, and discrete dithers. To accelerate the convergence of our proposed algorithm, we introduce three variants of the OB-SVT algorithm. Among these variants is the randomized sketched OB-SVT, which departs from using the entire information at each iteration, opting instead to utilize sketched data. This approach effectively reduces the dimension of the operational space and accelerates the convergence. We perform numerical evaluations comparing our proposed algorithm with the maximum likelihood estimation method previously employed for one-bit MC, and demonstrate that our approach can achieve a better recovery performance. Authors

关键词： Linear matrix inequalities

来源：评论

学校读者我要写书评

暂无评论

A Novel Access Point Deployment Framework for mmWave Cell-Free Massive MIMO Networks

引用

IEEE Transactions on Wireless Communications 2025年第6期24卷 4581-4597页

作者： Topal, Ozan Alp Demir, Ozlem Tugfe Bjornson, Emil Cavdar, Cicek School of Electrical Engineering and Computer Science KTH Royal Institute of Technology Stockholm Sweden Department of Electrical and Electronics Engineering TOBB University of Economics and Technology Ankara Turkey

Millimeter-wave network deployment is an essential and ongoing problem due to the limited coverage and expensive network infrastructure. In this work, we solve a joint network deployment and resource allocation optimization problem for a mmWave cell-free massive MIMO network considering indoor environments. The objective is to minimize the number of deployed access points (APs) for a given environment, bandwidth, AP cooperation, and precoding scheme while guaranteeing the rate requirements of the user equipments (UEs). Considering coherent joint transmission (C-JT) and non-coherent joint transmission (NC-JT), we solve the problem of AP placement, UE-AP association, and power allocation among the UEs and resource blocks jointly. For numerical analysis, we model a mid-sized airplane cabin in ray-tracing as an exemplary case for IDS. Results demonstrate that a minimum data rate of 1 Gbps can be guaranteed with less than 10 APs with C-JT. From a holistic network design perspective, we analyze the trade-off between the required fronthaul capacity and the processing capacity per AP, under different network functional split options. We observe an above 600 Gbps fronthaul rate requirement, once all network operations are centralized, which can be reduced to 200 Gbps under physical layer functional splits. 2002-2012 IEEE.

关键词： Integer programming

来源：评论

学校读者我要写书评

暂无评论

A Scalable Unsupervised and Back Propagation Free Learning With SACSOM: A Novel Approach to SOM-Based Architectures

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2025年第4期6卷 955-967页

作者： Hirani, Gaurav R. Wang, Kevin I-Kai Abdulla, Waleed H. The University of Auckland Department of Electrical Computer and Software Engineering Auckland1010 New Zealand

The field of computer vision is predominantly driven by supervised models, which, despite their efficacy, are computationally expensive and often intractable for many applications. Recently, research has expedited alternative avenues such as self-organizing maps (SOM)-based architectures, which offer significant advantages such as tractability, the absence of back-propagation, and feed-forward unsupervised learning. However, these SOM-based approaches frequently suffer from lower accuracy and limited generalization capabilities. To address these shortcomings, we propose a novel model called split and concur SOM (SACSOM). SACSOM overcomes the limitations of closely related SOM-based algorithms by utilizing multiple parallel branches, each equipped with its own SOM modules that process data independently with varying patch sizes. Furthermore, by creating groups of classes and using respective training samples to train independent subbranches in each branch, our approach accommodates datasets with a large number of classes. SACSOM employs a simple yet effective labeling technique requiring minimal labeled samples. The outputs from each branch, filtered by a threshold, contribute to the final prediction. Experimental validation on MNIST-digit, Fashion-MNIST, CIFAR-10, and CIFAR-100 demonstrates that SACSOM achieves competitive accuracy with significantly reduced computation time. Furthermore, it exhibits superior performance and generalization capabilities, even in high-noise scenarios. The weights of the single-layered SACSOM provide meaningful insights into the patch-based learning pattern, enhancing its tractability and making it ideal from the perspective of explainable AI. This study addresses the limitations of current clustering techniques, such as K-means and traditional SOMs, by proposing a lightweight, manageable, and fast architecture that does not require a GPU, making it suitable for low-powered devices. © 2024 IEEE.

关键词： Unsupervised learning

来源：评论

学校读者我要写书评

暂无评论

Consistent Action for Stable Training in Reinforcement Learning–based Gain Tuning of Linear Feedback Controller

引用

Journal of Institute of Control, Robotics and Systems 2024年第9期30卷 965-972页

作者： Byun, Hyungjo ASRI Department of Electrical and Computer Engineering Seoul National University Korea Republic of

Controlling nonlinear systems with linear feedback controller after linearization is a widely used method. This paper proposes a new method to efficiently train a reinforcement learning agent to select the control gain of the linear feedback controller. The proposed method involves the agent selecting a consistent gain by choosing between a new and an old gain. Since the linear feedback controller can operate with this consistent gain, the agent does not need to select the optimal gain, thus simplifying the training process. Numerical simulations were performed by applying the proposed method to a longitudinal model of a nonlinear missile with a 3-loop autopilot. The proposed method enables training with a wide range of gains and enhances the reference tracking performance compared to classical gain scheduling. © ICROS 2024.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：