Environmental sound classification(ESC) is the trending research area. ESC categorizes sounds such as dog barking, gunshots, and children playing in the surroundings. Due to overlapping sound signals, the presence of ...
详细信息
Environmental sound classification(ESC) is the trending research area. ESC categorizes sounds such as dog barking, gunshots, and children playing in the surroundings. Due to overlapping sound signals, the presence of several audio sources while recording audio, and different distances from audio sources to the microphone make this problem complex. This study proposes a robust model for ESC, which can help in crime investigation systems, security warning systems, and the development of smart homes and hearing aids. Researchers have designed numerous frameworks for classifying surrounding events. Various techniques for ESC have been used in the past, but they are either computationally intensive or provide less accuracy. A hybrid model consisting of Convolutional Neural Network and Recurrent Neural Network for ESC is proposed to provide an accuracy of 99.89%, which is the highest till now, as far as we know. The model is a combination of both models;it is called CRNN. CRNN has already been used in a few past studies, but raw waveforms are used, and the accuracy attained is quite low. The publicly available Dataset UrbanSound8 K is used. Augmentation techniques are used to overcome the scarcity of datasets. The cepstral features are extracted and input to the CRNN. CRNN is encouraged due to its ability to capture spatial and temporal dependencies of environmental sound waves. Various hyperparameters, such as the number of LSTM layers, number of filters, batch size, momentum, and number of neurons in the LSTM layer, are altered to find the best value for hyperparameters for ESC. It is found that 0.5 momentum, 128 filters, 512 neurons in the LSTM layer, 256 batch size, and one LSTM layer give the highest accuracy. Another dataset, ESC- 10, is used to validate the model. It is found that the proposed model provides considerable accuracy for ESC- 10, even though it is lower than in the case of UrbanSound8 K. In the future, the model can be applied to different applications
1 Introduction On-device deep learning(DL)on mobile and embedded IoT devices drives various applications[1]like robotics image recognition[2]and drone swarm classification[3].Efficient local data processing preserves ...
详细信息
1 Introduction On-device deep learning(DL)on mobile and embedded IoT devices drives various applications[1]like robotics image recognition[2]and drone swarm classification[3].Efficient local data processing preserves privacy,enhances responsiveness,and saves ***,current ondevice DL relies on predefined patterns,leading to accuracy and efficiency *** is difficult to provide feedback on data processing performance during the data acquisition stage,as processing typically occurs after data acquisition.
Time-sensitive networking (TSN) is widely used in industrial automation and automotive applications due to its ability to provide deterministic transmission. To meet the stringent deterministic requirements of time-se...
详细信息
This paper suggests a new mechanism from deep learning concept for personalised therapy in Clinical Decision Support Systems (CDSS). Basically, the texts used for the observation are acquired from the standard data so...
详细信息
Effective recommender systems play a crucial role in accurately capturing user and item attributes that mirror individual preferences. Some existing recommendation techniques have started to shift their focus towards ...
详细信息
Due to the complexity of ocean sensing tasks, buoy detection in traditional ocean observation methods has the disadvantages of high cost and insufficient real-time performance. Ocean mobile crowd sensing technology co...
详细信息
With rapidly expanding cloud-enabled big data environments, there is an imperative need for efficient data-sharing mechanisms that are multidimensional and balance both speed and security. In this connection, high-spe...
详细信息
Aiming at the nonlinear and dynamic characteristics of data in automotive engine systems, a fault detection method based on canonical variate analysis combined with Bhattacharyya distance (CVA-BD) is proposed in this ...
详细信息
5G has boosted the possibility of ultra-high-speed, low-latency, and reliable wireless communication systems. With 5G networks, if efficient resource management is not properly looked at, then the full potential of su...
详细信息
Kidney disease (KD) is a gradually increasing global health concern. It is a chronic illness linked to higher rates of morbidity and mortality, a higher risk of cardiovascular disease and numerous other illnesses, and...
详细信息
暂无评论