Solar photo voltaic (PV) energy system backbone of the renewable energy system. Energy system is depended on weather conditions such as temperature and radiation intensity. The role of machine learning (ML) for solar ...
详细信息
Transformer neural networks (TNN) have been widely utilized on a diverse range of applications, including natural language processing (NLP), machine translation, and computer vision (CV). Their widespread adoption has...
详细信息
ISBN:
(数字)9798350355543
ISBN:
(纸本)9798350355550
Transformer neural networks (TNN) have been widely utilized on a diverse range of applications, including natural language processing (NLP), machine translation, and computer vision (CV). Their widespread adoption has been primarily driven by the exceptional performance of their multi-head self-attention block used to extract key features from sequential data. The multi-head self-attention block is followed by feedforward neural networks, which play a crucial role in introducing non-linearity to assist the model in learning complex patterns. Despite the popularity of TNNs, there has been limited numbers of hardware accelerators targeting these two critical blocks. Most prior works have concentrated on sparse architectures that are not flexible for popular TNN variants. This paper introduces ProTEA, a runtime programmable accelerator tailored for the dense computations of most of state-of-the-art transformer encoders. ProTEA is designed to reduce latency by maximizing parallelism. We introduce an efficient tiling of large matrices that can distribute memory and computing resources across different hardware components within the FPGA. We provide run time evaluations of ProTEA on a Xilinx Alveo U55C high-performance data center accelerator card. Experimental results demonstrate that ProTEA can host a wide range of popular transformer networks and achieve near optimal performance with a tile size of 64 in the multi-head self-attention block and 6 in the feedforward networks block when configured with 8 parallel attention heads, 12 layers, and an embedding dimension of 768 on the U55C. Comparative results are provided showing ProTEA is 2.5× faster than an NVIDIA Titan XP GPU. Results also show that it achieves 1.3 – 2.8× speed up compared with current state-of-the-art custom designed FPGA accelerators.
WiFi backscatter communication has gained many applications, but its performance characteristics remain to be analyzed. While existing research has investigated the success probability of backscatter tags in contentio...
详细信息
Power transformers are supposed to be an expensive and critical component of a power system and so its schedule maintenance is an important aspect near the utilities. The cellulose paper used as the solid insulating m...
详细信息
Text extraction from images using the traditional techniques of image collecting,and pattern recognition using machine learning consume time due to the amount of extracted features from the *** Neural Networks introdu...
详细信息
Text extraction from images using the traditional techniques of image collecting,and pattern recognition using machine learning consume time due to the amount of extracted features from the *** Neural Networks introduce effective solutions to extract text features from images using a few techniques and the ability to train large datasets of images with significant *** study proposes using Dual Maxpooling and concatenating convolution Neural Networks(CNN)layers with the activation functions Relu and the Optimized Leaky Relu(OLRelu).The proposed method works by dividing the word image into slices that contain *** pass them to deep learning layers to extract feature maps and reform the predicted *** Short Memory(BiLSTM)layers extractmore compelling features and link the time sequence fromforward and backward directions during the training *** Connectionist Temporal Classification(CTC)function calcifies the training and validation loss *** addition to decoding the extracted feature to reform characters again and linking them according to their time *** proposed model performance is evaluated using training and validation loss errors on the Mjsynth and Integrated Argument Mining Tasks(IAM)*** result of IAM was 2.09%for the average loss errors with the proposed dualMaxpooling and *** the Mjsynth dataset,the best validation loss rate shrunk to 2.2%by applying concatenating CNN layers,and Relu.
In this paper, a current injection-based QSTS load flow approach has been developed to simulate and analyze the interactions between Distributed Energy Resources (DERs) and voltage-regulating devices over a given time...
详细信息
This study looks at the influence of sample points on SINR-based m-coverage probability in Urban and Suburban environments within the context of 6G networks. SINR is a crucial factor impacting network coverage and rel...
详细信息
In this paper, we present a framework that integrates digital twin (DT) technology into space-air-ground integrated networks (SAGINs) to enhance vehicular edge computing (VEC). Our objective is to efficiently offload ...
详细信息
Graphene oxide(GO)is a 2D coating material used to improve fiber optics sensors’response to relative *** resonators(MBRs)have garnered more attention as sensing media *** MBR with a 190μm diameter was coated with **...
详细信息
Graphene oxide(GO)is a 2D coating material used to improve fiber optics sensors’response to relative *** resonators(MBRs)have garnered more attention as sensing media *** MBR with a 190μm diameter was coated with ***,tapered fiber light coupling was used to investigate the relative humidity sensing performance in the range of 35—70%RH at 25℃.The MBR showed a higher Q factor before and after GO *** sensitivity of 0.115 dB/%RH was recorded with the 190μm GO-coated MBR sample compared to a sensitivity of 0.022 dB/%RH for the uncoated MBR *** results show that the MBR can be used in fiber optic sensing applications for environmental sensing.
This study introduces a long-short-term memory(LSTM)-based neural network model developed for detecting anomaly events in care-independent smart homes,focusing on the critical application of elderly fall *** balances ...
详细信息
This study introduces a long-short-term memory(LSTM)-based neural network model developed for detecting anomaly events in care-independent smart homes,focusing on the critical application of elderly fall *** balances the dataset using the Synthetic Minority Over-sampling Technique(SMOTE),effectively neutralizing bias to address the challenge of unbalanced datasets prevalent in time-series classification *** proposed LSTM model is trained on the enriched dataset,capturing the temporal dependencies essential for anomaly *** model demonstrated a significant improvement in anomaly detection,with an accuracy of 84%.The results,detailed in the comprehensive classification and confusion matrices,showed the model’s proficiency in distinguishing between normal activities and *** study contributes to the advancement of smart home safety,presenting a robust framework for real-time anomaly monitoring.
暂无评论