Active deception jamming is one of the common means to jam radar signals. How to effectively recognize active deception jamming is a challenge of modern radar technology. To address the accuracy and real-time of radar...
详细信息
The study of animal groups39; flocking behavior is of great significance to the study of multi-agent system. Flocking algorithm is a typical algorithm for group control based on animal group behavior. The research o...
详细信息
The fields of computer vision and special effects production often require editing and synthesis of facial expressions, such as transfer one person39;s expression to another39;s face. It is crucial to synthesize r...
详细信息
The use of Autonomous Underwater Vehicles (AUVs) for environment exploration, data collection and research developments has been growing day by day. Beach checks and surveys are documented studies about land erosion, ...
详细信息
With several deep learning approaches, the domain of automatic speech recognition (ASR) has seen notable advancements in recent times. The domains of intelligent human- computer interaction and machine translation gre...
详细信息
ISBN:
(数字)9798350372120
ISBN:
(纸本)9798350372137
With several deep learning approaches, the domain of automatic speech recognition (ASR) has seen notable advancements in recent times. The domains of intelligent human- computer interaction and machine translation greatly benefited from accurate speech recognition. The present study introduces a hybrid architecture amalgamating a convolutional neural network (CNN) and bidirectional long short-term memory (BLSTM) for speech recognition. For aligning speech input sequences with corresponding text output sequences, it uses the connectionist temporal classification (CTC) technique. The experiments were done on the LJ speech dataset, and the results showed with an increased number of training samples, the performance of the speech recognition algorithm tended to be increased but the time taken for training gradually increased over time. Moreover, a trained speech recognition algorithm exhibits a longer training time when the recognition accuracy is lower. In this study, we implemented a hybrid deep learning model CNN-BLSTM, in conjunction with the CTC loss function, which attains a word error rate (WER) of 36.97% on the testing dataset.
In this paper the Llama-2 and GPT-2 large language models are evaluated for their fundamental understanding of basic due process concepts. The reference implementations and versions fine-tuned on judicial opinions wer...
详细信息
ISBN:
(数字)9798350372977
ISBN:
(纸本)9798350372984
In this paper the Llama-2 and GPT-2 large language models are evaluated for their fundamental understanding of basic due process concepts. The reference implementations and versions fine-tuned on judicial opinions were prompted with questions addressing due process issues. The results were evaluated by an attorney.
An SaaS-based conference management system that implements facial recognition technology has been designed to address the requirement for intelligent conference management. The system leverages the SaaS model to achie...
An SaaS-based conference management system that implements facial recognition technology has been designed to address the requirement for intelligent conference management. The system leverages the SaaS model to achieve rapid deployment, enabling centralized management of multiple conference venues via a conference management service center. This center enables unified scheduling of attendees, equipment, and conference facilities, and provides customization to meet the requirements of various conference scenarios. The system uses facial recognition technology for conference check-in and data collection during the event, thereby optimizing conference automation management based on attendee information. User feedback has indicated a significant improvement in conference management efficiency.
The investigation of seismic tremor gauging utilizing man-made intelligence approaches has shown promising outcomes of late. AI calculations have had the option to distinguish examples and connections in seismic and d...
详细信息
In the process of learning English, English pronunciation has always been difficult. This article was based on the Mul Tran (Multilingual Translation) platform and used various methods such as animation, sound, images...
详细信息
ISBN:
(数字)9798350376173
ISBN:
(纸本)9798350376180
In the process of learning English, English pronunciation has always been difficult. This article was based on the Mul Tran (Multilingual Translation) platform and used various methods such as animation, sound, images, and text to learn and train English phonetics. It can provide effective pronunciation feedback for learners, guide and correct their continuous training, and improve their oral pronunciation. The system includes pronunciation demonstration, pronunciation follow-up, pronunciation comparison, and pronunciation scoring functions. The system also provides the function of “continuous following of original sounds”, which can continuously follow and score the original sounds, and comprehensively compare the pronunciation of each speech of the learners. For each pronunciation, according to the resonance peak image, the mouth shape was improved. The effective rate of correcting vowel pronunciation was 0.899, and the effective rate of correcting word pronunciation was 0.928, which had a certain guiding effect on student pronunciation correction.
Currently, drug and alcohol addiction has become a major menace to society39;s youth. As responsible citizens of this country, we must act now to keep these young brains from succumbing to this lethal addiction. In ...
详细信息
暂无评论