In this paper, we develop a novel beam training scheme for extremely large-scale multiple-input-multiple-output (XL-MIMO) system by exploiting the visual image information. Different from the conventional beam trainin...
详细信息
ISBN:
(纸本)9798350304060;9798350304053
In this paper, we develop a novel beam training scheme for extremely large-scale multiple-input-multiple-output (XL-MIMO) system by exploiting the visual image information. Different from the conventional beam training schemes that consumes a large number of in-band (time/frequency) resources, the proposed scheme only leverages the out-of-band (vision image) information, which can efficiently reduce the training overhead. Specifically, we proposed a vision image-aided beam training cascaded framework integrating YOLOv5 and ResNet18 networks, where the YOLOv5 uses the object detection technique to extract the size and location information of the mobile vehicles (MVs) and the ResNet18 based the extracted information infers the optimal beam index without occupying in-band overhead. The simulation results demonstrate that the proposed vision image-aided beam training scheme outperforms the benchmark scheme.
This paper introduces a method to automatically learn the unary and pairwise potentials of a conditional random field (CRF) from the input data in a non-parametric fashion, within the framework of the semantic segment...
详细信息
The development of Industrial internet of Things (IIoT) technology and network infrastructures has enabled the acquisition of substantial data, enabling data-driven condition monitoring and analysis. Detecting anomali...
详细信息
The development of Industrial internet of Things (IIoT) technology and network infrastructures has enabled the acquisition of substantial data, enabling data-driven condition monitoring and analysis. Detecting anomalies in machinery equipment is crucial in IIoT environments for safety enhancement, productivity, and reliability. To provide effective anomaly detection at IIoT edge nodes without delay, it is necessary to efficiently collect and process vast amounts of data from various sensors. While this demands a significant amount of computing resources, edge nodes only have limited data storage and processing capabilities. Therefore, our focus is on developing a lightweight anomaly detection algorithm for acoustic signal processing, considering the computational resources of the IIoT edge node. In this article, we propose the parallel discrete wavelet transform (PDWT) as an efficient method for compressing and processing acoustic signals received at edge nodes. This approach significantly alleviates memory consumption and reduces the computational time at the edge. In addition, by harnessing preprocessed features through PDWT, we can develop lightweight anomaly detection models suitable for deployment at the edge, making them highly practical for real-world implementation. The experimental results using real-world data collected from industrial machines confirm the effectiveness of the proposed solution.
Multiview representation learning techniques based on deep correlation maximization have become increasingly popular for learning meaningful and compact representations from multiview data. Even though their performan...
详细信息
Window setting in CT brain images is the crucial pre-processing step to examine the abnormalities for diagnosing disease. Recently, many methods have been proposed to determine the suitable window automatically instea...
详细信息
In the rapid development of the internet of Things technology, image recognition and detection technology is used in all walks of life. In order to solve the limitations of traditional image detection methods in pract...
详细信息
Multimodal image fusion aims to merge features from different modalities to create a comprehensively representative image. However, existing medical image fusion methods often struggle to handle noise generated during...
详细信息
images captured in low light conditions usually suffer from poor visibility, a high amount of noise, and little information stored in the dark image, which has a negative impact on subsequent processing for outdoor co...
详细信息
ISBN:
(纸本)9798350374292;9798350374285
images captured in low light conditions usually suffer from poor visibility, a high amount of noise, and little information stored in the dark image, which has a negative impact on subsequent processing for outdoor computer vision applications. Presently, numerous deep learning based methods achieved superior performance with multi-exposure paired training data or additional information. However, obtaining multi-exposure data samples is a tedious task in real-time scenarios. To mitigate this challenge, we propose a zero reference based learnable wavelet approach without multi-exposure paired training data requirement for low-light image enhancement. Our proposed approach generates the low light image and learns to project an image into noise free similar looking image, then we enhance the image using retinex theory. Further, we have proposed learnable wavelet block to remove the hidden noise amplified while enhancement. We introduce Gaussian-based supervision to improve the smoothness of the image. Extensive experimental analysis on synthetic as well as real-world images, along with thorough ablation study demonstrate the effectiveness of our proposed method over the existing state-of-the-art methods for low-light image enhancement. The code is provided at https://***/vision-lab-sggsiet/Zero-Reference-based-Low-light-Enhancement-with-Wavelet-Optimization.
Person re-identification aims to recognize a target pedestrian across non-overlapping camera views based on source information. The internet of Things (IoT) provides a wide range of application scenarios for pedestria...
详细信息
ISBN:
(纸本)9798350366457;9798350366440
Person re-identification aims to recognize a target pedestrian across non-overlapping camera views based on source information. The internet of Things (IoT) provides a wide range of application scenarios for pedestrian re-identification technology-smart city management, resource optimization, and multi-source data fusion. It is crucial for IoT applications like intelligent video surveillance but remains challenging due to factors like low image resolution, varying angles, lighting changes, and occlusion. In this paper, we propose a multi-task learning approach that integrates text information to enhance recognition accuracy. Using a dual-stream Transformer encoder, we extract both image and text features. To improve feature interaction and learning, we perform multimodal interaction for fine-grained alignment and share feature for modality-invariant feature representation and learning. Our method, TFTI, outperforms state-of-the-art techniques in person re-identification, as validated on the CUHK-PEDES dataset.
In recent years, with the continuous development of computer technology, internet of Things (IoT) technology has been widely used in various fields and has played an important role in various industries. The internet ...
详细信息
暂无评论