Particles in the atmosphere, such as dust and smoke, can cause visual clarity problems in both images and videos. Haze is the result of the interaction between airborne particles and light, which is scattered and atte...
详细信息
Particles in the atmosphere, such as dust and smoke, can cause visual clarity problems in both images and videos. Haze is the result of the interaction between airborne particles and light, which is scattered and attenuated. Hazy media present difficulties in a variety of applications due to the reduced contrast and loss of essential information. In response, dehazing techniques have been introduced to bring hazy videos and images back to clarity. Here, we provide a novel technique for eliminating haze. It comprises preprocessing steps before dehazing. Preprocessing is applied to hazy images through homomorphic processing and Contrast Limited Adaptive Histogram Equalization (CLAHE). We present a dehazing technique referred to as the pre-trained Feature Fusion Attention Network (FFA-Net) that directly lets dehazed images be restored from hazy or preprocessed hazy inputs without requiring the determination of atmospheric factors, such as air light and transmission maps. The FFA-Net architecture incorporates a Feature Attention (FA) method to do this task. We assess the proposed technique in a variety of circumstances, including visible frames, Near-Infrared (NIR) frames, and real-world hazy images. Evaluation criteria like entropy, correlation, and Peak Signal-to-Noise Ratio (PSNR) are used to compare the quality of dehazed frames or images to their hazy counterparts. Furthermore, histogram analysis and spectral entropy are adopted to determine the effectiveness of the proposed technique in comparison to existing dehazing techniques. Comparative results are presented for both real-world and simulated environments. The benefits of the proposed technique are demonstrated by a comparison of the results obtained from the standalone pre-trained FFA-Net and the proposed comprehensive methodology. Moreover, a thorough assessment is carried out for comparing the effectiveness of the proposed FFA-Net technique to those of some current dehazing techniques on real hazy images. T
Missing data is a widespread problem across many research fields, and it can have a major impact on the precision of statistical analyses. Imputation is one method for handling missing data, in which missing values ar...
详细信息
Indoor positioning system determine the position of objects in a closed room or story building. This system can determine not only the position but also the orientation and direction of a person's movement. This r...
详细信息
作者:
Salama, Wessam M.Aly, Moustafa H.Department of Computer Engineering
Faculty of Engineering Pharos University Canal El Mahmoudia Street Beside Green Plaza Complex 21648 Alexandria Egypt OSA Member
Department of Electronics and Communications Engineering College of Engineering and Technology Arab Academy for Science Technology and Marine Transport Alexandria1029 Egypt
Recent studies on channel estimation in wireless communication systems have focused on deep learning methods. Our primary contribution is based on the use of DenseNet121 hybrid with Random Forest (RF), Gated Recurrent...
详细信息
Recent studies on channel estimation in wireless communication systems have focused on deep learning methods. Our primary contribution is based on the use of DenseNet121 hybrid with Random Forest (RF), Gated Recurrent Units (GRU), Long Short-Term Memory Networks (LSTM), and Recurrent Neural Networks (RNN) to improve the channel estimation and lower the error rate. In order to mitigate inter-symbol interference and map the datasets, this paper introduces M-quadrature amplitude modulation (16-QAM) and orthogonal frequency division multiplexing (OFDM), which is based on quadrature phase shift keying (QPSK). Additionally, the existence or lack of cyclic prefixes forms the basis of our simulation. Additionally, the suggested models are investigated using pilot samples 2, 4, 8, and 64. Labeled OFDM signal samples, where the labels match the signal received after applying OFDM and passing through the medium, are used to train the proposed models. The DenseNet121 functions as a powerful feature extractor to extract intricate spatial information from received signal data. Sequential models like as RNN, LSTM, and GRU are used to model temporal dependencies in the retrieved features. RF is also utilized to exploit non-linear relationships and interactions between features to further increase prediction accuracy and reduce bit error rate (BER). By comparing the models using key metrics like accuracy, bit error rate (BER), and mean squared error (MSE), superior performance is attained based on the DenseNet121_RNN_GRU_RF model. Additionally, the DLMs are assessed against traditional methods like minimal mean square error (MMSE) and least squares (LS). Using the DenseNet121_RNN_GRU_RF model indicates a considerable gain over alternative architectures, with an improvement of 36.3% over DensNet121-RNN-LSTM-RF, according to a comparison of the suggested models without cyclic prefix for OFDM_QPSK. The improvement in percentages of roughly 63.3% over DensNet121-RNN-LSTM, 68.18% over De
This paper presents a method for extracting and interpreting information from diverse, unstructured dental literature using advanced AI techniques. By integrating information extraction, ontologies, and knowledge grap...
详细信息
In this paper, a model was built to compare the performance of the following machine learning (ML) models: DT, RF, SVM, and MLP, using two types of classification: binary classification and multi classification. The r...
详细信息
Autoencoders are widely recognized as non-probabilistic learning models for extracting useful information from data. Most autoencoder models assume a Euclidean geometry for the underlying nature of the data. However, ...
详细信息
The identification and classification of collective people’s activities are gaining momentum as significant themes in machine learning,with many potential applications *** need for representation of collective human ...
详细信息
The identification and classification of collective people’s activities are gaining momentum as significant themes in machine learning,with many potential applications *** need for representation of collective human behavior is especially crucial in applications such as assessing security conditions and preventing crowd *** paper investigates the capability of deep neural network(DNN)algorithms to achieve our carefully engineered pipeline for crowd *** includes three principal stages that cover crowd analysis ***,individual’s detection is represented using the You Only Look Once(YOLO)model for human detection and Kalman filter for multiple human tracking;Second,the density map and crowd counting of a certain location are generated using bounding boxes from a human detector;and Finally,in order to classify normal or abnormal crowds,individual activities are identified with pose *** proposed system successfully achieves designing an effective collective representation of the crowd given the individuals in addition to introducing a significant change of crowd in terms of activities *** results onMOT20 and SDHA datasets demonstrate that the proposed system is robust and *** framework achieves an improved performance of recognition and detection peoplewith a mean average precision of 99.0%,a real-time speed of 0.6ms non-maximumsuppression(NMS)per image for the SDHAdataset,and 95.3%mean average precision for MOT20 with 1.5ms NMS per image.
This paper presents a chopper-stabilized three-stage operational amplifier (OpAmp) with a unity gain bandwidth of 69 MHz and an input referred noise density of 3 nV√Hz. The proposed design achieves a stable unity gai...
详细信息
Speech is a fundamental means of human interaction. Speaker Identification (SI) plays a crucial role in various applications, such as authentication systems, forensic investigation, and personal voice assistance. Howe...
详细信息
Speech is a fundamental means of human interaction. Speaker Identification (SI) plays a crucial role in various applications, such as authentication systems, forensic investigation, and personal voice assistance. However, achieving robust and secure SI in both open and closed environments remains challenging. To address this issue, researchers have explored new techniques that enable computers to better understand and interact with humans. Smart systems leverage Artificial Neural Networks (ANNs) to mimic the human brain in identifying speakers. However, speech signals often suffer from interference, leading to signal degradation. The performance of a Speaker Identification System (SIS) is influenced by various environmental factors, such as noise and reverberation in open and closed environments, respectively. This research paper is concerned with the investigation of SI using Mel-Frequency Cepstral Coefficients (MFCCs) and polynomial coefficients, with an ANN serving as the classifier. To tackle the challenges posed by environmental interference, we propose a novel approach that depends on symmetric comb filters for modeling. In closed environments, we study the effect of reverberation on speech signals, as it occurs due to multiple reflections. To address this issue, we model the reverberation effect with comb filters. We explore different domains, including time, Discrete Wavelet Transform (DWT), Discrete Cosine Transform (DCT), and Discrete Sine Transform (DST) domains for feature extraction to determine the best combination for SI in case of reverberation environments. Simulation results reveal that DWT outperforms other transforms, leading to a recognition rate of 93.75% at a Signal-to-Noise Ratio (SNR) of 15 dB. Additionally, we investigate the concept of cancelable SI to ensure user privacy, while maintaining high recognition rates. Our simulation results show a recognition rate of 97.5% at 0 dB using features extracted from speech signals and their DCTs. Fo
暂无评论