This Volume 4555 of the conference proceedings contains 29 papers. Topics discussed include neural network and distributed processing, hardware parallel, character recognition, image segmentation and image classificat...
详细信息
This Volume 4555 of the conference proceedings contains 29 papers. Topics discussed include neural network and distributed processing, hardware parallel, character recognition, image segmentation and image classification.
This paper proposes a novel downlink precoding method for a cell-free massive multiple-input multiple-output (CF-mMIMO) network, requiring no channel state information sharing between the access points via fronthaul l...
详细信息
This paper proposes a novel downlink precoding method for a cell-free massive multiple-input multiple-output (CF-mMIMO) network, requiring no channel state information sharing between the access points via fronthaul links. By drawing analogies between a CF-mMIMO network and an artificial neuralnetwork, the proposed algorithm borrows the idea of backpropagation to train the precoders and the combiners through over-the-air ping-pong signaling between the access points and user equipments. It utilizes manifolds optimization to meet the per-AP power constraint and is named as distributed quasi-neuralnetwork precoding on manifold (DQNPM). The DQNPM algorithm can accommodate a large category of objective functions for fully distributed implementation. Numerical simulations show that our method outperforms the state-of-the-art approaches, and is robust against pilot contamination.
This paper proposes a distributed learning-based framework to tackle the sum ergodic rate maximization problem in cell-free massive multiple-input multiple-output (MIMO) systems by utilizing the graph neuralnetwork (...
详细信息
This paper proposes a distributed learning-based framework to tackle the sum ergodic rate maximization problem in cell-free massive multiple-input multiple-output (MIMO) systems by utilizing the graph neuralnetwork (GNN). Different from centralized schemes, which gather all the channel state information (CSI) at the central processing unit (CPU) for calculating the resource allocation, the local resource of access points (APs) is exploited in the proposed distributed GNN-based framework to allocate transmit powers. Specifically, APs can use a unique GNN model to allocate their power based on the local CSI. The GNN model is trained at the CPU using the local CSI of one AP, with partially exchanged information from other APs to calculate the loss function to reflect system characteristics, capturing comprehensive network information while avoiding computation burden. Numerical results show that the proposed distributed learning-based approach achieves a sum ergodic rate close to that of centralized learning while outperforming the model-based optimization.
Pixel similarity measurement is a critical step in distributed scatterer (DS) interferometry, directly affecting DS phase estimation. Despite considerable efforts to improve its accuracy, existing methods still suffer...
详细信息
Pixel similarity measurement is a critical step in distributed scatterer (DS) interferometry, directly affecting DS phase estimation. Despite considerable efforts to improve its accuracy, existing methods still suffer from unsatisfactory performance, especially with small stack sizes. In recent years, deep neuralnetworks have achieved remarkable breakthroughs in interferometric synthetic aperture radar (InSAR) processing. However, their potential for measuring pixel similarity in multitemporal InSAR remains unexplored. This article proposes a neuralnetwork-driven pixel similarity measurement approach, termed PSMNet. To address the challenge of accurately defining true data, a supervised learning strategy is designed. The proposed network consists of two main modules: 1) a feature extraction module that generates high-level feature images with enhanced representation and reduced noise and 2) a similarity measurement module that evaluates pixel similarity without relying on assumptions about data distribution. The network is trained on synthetic data, enabling it to generalize for different stack sizes and target characteristics. Extensive experiments on simulated and real TanDEM-X images demonstrate a significant accuracy improvement of the proposed approach, highlighting its robust performance for varying stack sizes and computational efficiency advantage compared to traditional methods. The proposed approach further enhances DS phase estimation and increases the number of measurement points, showing great promise for ground surface deformation monitoring.
The empirical mode decomposition (EMD) and artificial neuralnetwork (ANN) are used in this paper to solve an islanding detection problem. In light of this, the voltage signal parameter is obtained or measured at the ...
详细信息
Natural gas pipeline integrity monitoring is crucial to detect potential leaks, find structural issues, and prevent environmental damage. This article presents a system of natural gas pipeline monitoring that uses a s...
详细信息
Natural gas pipeline integrity monitoring is crucial to detect potential leaks, find structural issues, and prevent environmental damage. This article presents a system of natural gas pipeline monitoring that uses a specialized double Brillouin peak sensing fiber along with the Brillouin optical time domain analysis (BOTDAs) technique. The calibrated sensing fiber coefficients for strain and temperature are 41.8 kHz/mu epsilon and 0.9 MHz/degrees C for peak 1;and 47.2 kHz/mu epsilon, and 1.11 MHz/degrees C for peak 2, respectively. Initially, lab tests were performed by installing a short section of double Brillouin peak fiber (DBPF) on a 1-in steel pipe under pressure up to 1000 per square inch (psi) at elevated temperatures. Simultaneous distributed measurements of temperature and pressure-induced hoop strain were successfully measured. Considering the long processing speed to extract Brillouin frequency shift (BFS), we employ a novel probabilistic deep neuralnetwork (PDNN) framework for rapid BFS prediction. Additionally, using the Finite Element Method, the effects of the pipeline pressure on hoop strain were modeled and compared to the experimental hoop strain under the same set of pipeline conditions. Finally, an actual 4-in outer diameter steel natural gas pipeline was used for pilot-scale tests, where hoop strain was measured at various pressure levels. Leaks were simulated to demonstrate accurate pipeline integrity monitoring. At an internal pipe pressure of 1000 psi, hoop strain of approximately 300 mu epsilon was observed, and the sensitivity was calculated as0.28 mu epsilon/psi. The results of this pilot-scale study demonstrated that the system is capable of performing distributed monitoring sufficient to detect pipeline pressure and the presence of leaks to ensure the safe operation of gas pipelines in the field.
Graph neuralnetworks have drawn tremendous attention in the past few years due to their convincing performance and high interpretability in various graph-based tasks like link prediction and node classification. With...
详细信息
Graph neuralnetworks have drawn tremendous attention in the past few years due to their convincing performance and high interpretability in various graph-based tasks like link prediction and node classification. With the ever-growing graph size in the real world, especially for industrial graphs at a billion-level, the storage of graphs can easily consume Terabytes so that the process of GNNs has to be processed in a distributed manner. As a result, the execution could be inefficient due to the expensive cross-node communication and irregular memory access. Various GNN accelerators have been proposed for efficient GNN processing. They, however, mainly focused on small and medium-size graphs, which is not applicable to large-scale distributed graphs. In this paper, we present a practical Near-Data-processing architecture based on a memory-pool system for large-scale distributed GNNs. We propose a customized memory fabric interface to construct the memory pool for low-latency and high throughput cross-node communication, which can provide flexible memory allocation and strong scalability. A practical Near-Data-processing design is proposed for efficient work offloading and bandwidth utilization improvement. Moreover, we also introduce a partition and scheduling scheme to further improve performance and achieve workload balance. Comprehensive evaluations demonstrate that the proposed architecture can achieve up to 27 x and 8 x higher training speed compared to two state-of-the-art distributed GNN frameworks: Deep Graph Library and P-3, respectively.
作者:
Strypsteen, ThomasBertrand, AlexanderKU Leuven
Department of Electrical Engineering STADIUS Center for Dynamical Systems Signal Processing and Data Analytics LeuvenB-3001 Belgium Leuven.AI
KU Leuven Institute for AI LeuvenB-3001 Belgium
We propose a dynamic sensor selection approach for deep neuralnetworks (DNNs), which is able to derive an optimal sensor subset selection for each specific input sample instead of a fixed selection for the entire dat...
详细信息
Big data generated by social media stands for a valuable source of information, which offers an excellent opportunity to mine valuable insights. Particularly, User-generated contents such as reviews, recommendations, ...
详细信息
Big data generated by social media stands for a valuable source of information, which offers an excellent opportunity to mine valuable insights. Particularly, User-generated contents such as reviews, recommendations, and users' behavior data are useful for supporting several marketing activities of many companies. Knowing what users are saying about the products they bought or the services they used through reviews in social media represents a key factor for making decisions. Sentiment analysis is one of the fundamental tasks in Natural Language processing. Although deep learning for sentiment analysis has achieved great success and allowed several firms to analyze and extract relevant information from their textual data, but as the volume of data grows, a model that runs in a traditional environment cannot be effective, which implies the importance of efficient distributed deep learning models for social Big Data analytics. Besides, it is known that social media analysis is a complex process, which involves a set of complex tasks. Therefore, it is important to address the challenges and issues of social big data analytics and enhance the performance of deep learning techniques in terms of classification accuracy to obtain better decisions. In this paper, we propose an approach for sentiment analysis, which is devoted to adopting fastText with Recurrent neuralnetwork variants to represent textual data efficiently. Then, it employs the new representations to perform the classification task. Its main objective is to enhance the performance of well-known Recurrent neuralnetwork (RNN) variants in terms of classification accuracy and handle large scale data. In addition, we propose a distributed intelligent system for real-time social big data analytics. It is designed to ingest, store, process, index, and visualize the huge amount of information in real-time. The proposed system adopts distributed machine learning with our proposed method for enhancing decision-making
Edge intelligence has arisen as a promising computing paradigm for supporting miscellaneous smart applications that rely on machine learning techniques. While the community has extensively investigated multi-tier edge...
详细信息
Edge intelligence has arisen as a promising computing paradigm for supporting miscellaneous smart applications that rely on machine learning techniques. While the community has extensively investigated multi-tier edge deployment for traditional deep learning models (e.g. CNNs, RNNs), the emerging Graph neuralnetworks (GNNs) are still under exploration, presenting a stark disparity to its broad edge adoptions such as traffic flow forecasting and location-based social recommendation. To bridge this gap, this paper formally studies the cost optimization for distributed GNN processing over a multi-tier heterogeneous edge network. We build a comprehensive modeling framework that can capture a variety of different cost factors, based on which we formulate a cost-efficient graph layout optimization problem that is proved to be NP-hard. Instead of trivially applying traditional data placement wisdom, we theoretically reveal the structural property of quadratic submodularity implicated in GNN's unique computing pattern, which motivates our design of an efficient iterative solution exploiting graph cuts. Rigorous analysis shows that it provides parameterized constant approximation ratio, guaranteed convergence, and exact feasibility. To tackle potential graph topological evolution in GNN processing, we further devise an incremental update strategy and an adaptive scheduling algorithm for lightweight dynamic layout optimization. Evaluations with real-world datasets and various GNN benchmarks demonstrate that our approach achieves superior performance over de facto baselines with more than 95.8% cost reduction in a fast convergence speed.
暂无评论