The research describes experimentally determine the heat transfer coefficient, Nusselt number, pressure drop, and friction factor of CoFe2O4 /water nanofluids flow in a tube during turbulent flow and then predict them...
详细信息
Handwritten Text image Recognition (HTR) in the medical profession has become the major challenge due to the complexity of doctors' handwriting styles and the critical need for precise and efficient text recogniti...
详细信息
ISBN:
(数字)9798350389692
ISBN:
(纸本)9798350389708
Handwritten Text image Recognition (HTR) in the medical profession has become the major challenge due to the complexity of doctors' handwriting styles and the critical need for precise and efficient text recognition. This study proposes an innovative approach that combines the power of Bidirectional Long Short-Term Memory networks (BLSTM) and Convolutional Neural Networks (CNN) to address these challenges effectively. The proposed model harnesses the bidirectional capabilities of BLSTMs to capture contextual dependencies within doctors' handwritten notes, enabling it to understand and interpret handwriting more accurately. Additionally, CNNs are employed for feature extraction, enabling the model to recognize salient patterns and representations within handwritten text images. This paper presents comprehensive experiments conducted on a diverse dataset of doctors' handwritten notes, demonstrating the model's superior performance compared to conventional approaches and outperforms the existing work in terms of accuracy. Two different crucial metric word error rate and character error rate are accessed through the combination of BLSTM and CNN that yields state-of-the-art accuracy, robustness to variations in handwriting styles, and remarkable adaptability across various medical documents in the context of Nepal.
Voice Cloning employs technology and algorithms to create an artificial or synthetic reproduction of a person’s voice. To understand and mimic the distinct vocal qualities of the target speaker, including tone, pitch...
详细信息
ISBN:
(数字)9798350394474
ISBN:
(纸本)9798350394481
Voice Cloning employs technology and algorithms to create an artificial or synthetic reproduction of a person’s voice. To understand and mimic the distinct vocal qualities of the target speaker, including tone, pitch, cadence, and pronunciation, a machine learning model must be trained on audio samples of the speaker. This project presents itself as a revolutionary approach to the enduring problem of successful cross-lingual interactions, the paper’s novel approach to speech-to-speech machine translation that incorporates voice cloning. This method allows us to synthesize speech in the target language while maintaining the original speaker’s vocal characteristics. The model algorithms such as Whisper AI(WER 5% for transcription, No-Language-Left-Behind-200(44% more accurate than other models) for translation, Tacotron for speech synthesis and multiple transformers for voice cloning will be incorporated. The Voice Cloning model operates effectively on unseen voices, transcending the limitations of voice cloning will be incorporated. The voice cloning model operates effectively on unseen voices, transcending the limitations of relying solely on pre-trained models. The model successfully clones users’ voices in a different language with an approximate latency of 10 seconds.
With its vast range of applications, including object recognition, image classification, and pattern recognition, the Convolutional Neural Network (CNN) algorithm has brought about a paradigm shift in the field of com...
详细信息
The trend for the need for at-home workout regimens and the potential for technology to support the trend is rising as remote employment becomes more and more common. One such activity that has benefits like increased...
详细信息
ISBN:
(数字)9798350372847
ISBN:
(纸本)9798350372854
The trend for the need for at-home workout regimens and the potential for technology to support the trend is rising as remote employment becomes more and more common. One such activity that has benefits like increased flexibility and stress alleviation is yoga. Presently, numerous self-guided yoga courses in the form of pictures and videos are available online. Consequently, there is an urgent requirement for a system that can recognize and evaluate the precision of yoga posture execution in these instructional materials. In the current scenario lightweight prediction models are very much essential because devices like smartphones, edge devices, and embedded systems that have limited processing power or battery life, lightweight models are the best option as they require less resources to train and operate. One particular approach of making the recognition system lightweight is the use of keypoint-based pose/action classifiers. In this paper, a lightweight classifier for yoga postures based on neural network architecture using MoveNet pose estimation technique is proposed. Our model is able to classify the yoga postures by learning the human joints information in a efficient manner.
Identifying what front-end library runs on a web page is challenging. Although many mature detectors exist on the market, they suffer from false positives and the inability to detect libraries bundled by packers such ...
ISBN:
(纸本)9798350329964
Identifying what front-end library runs on a web page is challenging. Although many mature detectors exist on the market, they suffer from false positives and the inability to detect libraries bundled by packers such as Webpack. Most importantly, the detection features they use are collected from developers' knowledge leading to an inefficient manual workflow and a large number of libraries that the existing detectors cannot detect. This paper introduces PTdetector, which provides the first automated method for generating features and detecting libraries on web pages. We propose a novel data structure, the pTree, which we use as a detection feature. The pTree is well-suited for automation and addresses the limitations of existing detectors. We implement PTdetector as a browser extension and test it on 200 top-traffic websites. Our experiments show that PTdetector can identify packer-bundled libraries, and its detection results outperform existing tools.
Kserve, commonly referred to as KFServing, is an open-source serving framework for machine learning (ML) models that runs on Kubernetes. enables the deployment, servicing, and management of ML models in a production e...
详细信息
Aspect-level sentiment analysis is a modern technology that enables people to share their views or emotions with products, services, or topics based on their aspects. Aspect-level sentiment analysis focuses on aspect ...
详细信息
ISBN:
(数字)9798350391770
ISBN:
(纸本)9798350391787
Aspect-level sentiment analysis is a modern technology that enables people to share their views or emotions with products, services, or topics based on their aspects. Aspect-level sentiment analysis focuses on aspect extraction, which considers aspects from user reviews. Various methods and approaches have been employed for aspect extraction. Although these methods show better performance and attempt to overcome various challenges, many unsolved problems still need to be solved, such as sentimental ambiguity, opinion misprediction, a weak sentiment lexicon, and implicit sentiment detection. Numerous sentiment lexicons have been employed for word polarity extraction. Dictionaries also play an important role in sentiment classification. There are various dictionaries, but it has been found that issues such as one dictionary might not work properly for all domains. This study analyzes various existing sentiment lexicons employed for domain-specific sentiment analysis. Various approaches have been employed to create a domain-dependent sentiment lexicon and its effect on aspect-level sentiment analyses.
TPC (Three-Phase Consolidation) is here introduced as a simple but effective approach to continually learn new classes (and/or instances of known classes) while controlling forgetting of previous knowledge. Each exper...
Interacting and understanding with text heavy visual content with multiple images is a major challenge for traditional vision models. This paper is on enhancing vision models' capability to comprehend or understan...
详细信息
暂无评论