The rise in demand for real-time applications, such as live streaming, online gaming, and Internet telephony, has highlighted the necessity for transport protocols that offer low latency and network stability. Traditi...
详细信息
RFID technology offers an affordable and user-friendly solution for contactless identification of objects and individuals. However, the widespread adoption of RFID systems raises concerns regarding security and privac...
详细信息
Medical image segmentation of anatomical structures and pathology is crucial in modern clinical diagnosis, disease study, and treatment planning. To date, great progress has been made in deep learning-based segmentati...
详细信息
Diabetes is a metabolic disorder that results in a retinal complication called diabetic retinopathy(DR)which is one of the four main reasons for sightlessness all over the *** usually has no clear symptoms before the ...
详细信息
Diabetes is a metabolic disorder that results in a retinal complication called diabetic retinopathy(DR)which is one of the four main reasons for sightlessness all over the *** usually has no clear symptoms before the onset,thus making disease identication a challenging *** healthcare industry may face unfavorable consequences if the gap in identifying DR is not lled with effective ***,our objective is to develop an automatic and cost-effective method for classifying DR *** this work,we present a custom Faster-RCNN technique for the recognition and classication of DR lesions from retinal *** pre-processing,we generate the annotations of the dataset which is required for model ***,introduce DenseNet-65 at the feature extraction level of Faster-RCNN to compute the representative set of key ***,the Faster-RCNN localizes and classies the input sample into ve *** experiments performed on a Kaggle dataset comprising of 88,704 images show that the introduced methodology outperforms with an accuracy of 97.2%.We have compared our technique with state-of-the-art approaches to show its robustness in term of DR localization and ***,we performed cross-dataset validation on the Kaggle and APTOS datasets and achieved remarkable results on both training and testing phases.
This paper presents the evolving role of artificial intelligence (AI) in improving internal control and management processes. AI-driven technologies, including Generative Adversarial Networks (GANs) and ontologies, in...
详细信息
ISBN:
(数字)9798350369106
ISBN:
(纸本)9798350369113
This paper presents the evolving role of artificial intelligence (AI) in improving internal control and management processes. AI-driven technologies, including Generative Adversarial Networks (GANs) and ontologies, in addition to enabling data-supported decision-making, and empowering local governments to execute administrative tasks more efficiently and effectively. The study explores the wide range of opportunities that AI offers, highlighting the potential for optimizing internal workflows. Besides, it delves into key challenges, models, and methods for fully integrating AI into organizational processes, providing actionable insights for achieving streamlined, data-driven management systems.
Voice cloning has numerous useful applications, including assisting individuals who have lost their ability to speak, movie dubbing, and translating voices into different languages. However, voice cloning in Bangla is...
详细信息
ISBN:
(数字)9798331519094
ISBN:
(纸本)9798331519100
Voice cloning has numerous useful applications, including assisting individuals who have lost their ability to speak, movie dubbing, and translating voices into different languages. However, voice cloning in Bangla is still in its early stages. In this work, we explore two standard approaches for zero-shot Bengali voice cloning, which can generate cloned audio from short samples, even for Bengali speakers not included in the training data. Despite using limited datasets, our models achieve cloning quality comparable to state-of-the-art models trained on much larger datasets, maximizing potential benefits for the Bengali community. We use a neural network technique called the speaker encoder (SE) voice cloning method, which achieved a Mean Opinion Score (MOS) of 3.8 and 83% cosine similarity between the original and cloned voice. This model was trained on a custom multi-speaker dataset. Additionally, we employ a speaker converter (SC) approach trained on a single-speaker dataset, which achieved a MOS of 4.0 and 82.64% cosine similarity. By combining the two models in an ensemble, we further improved performance, reaching an MOS of 4.4 and a cosine similarity of 85.21%. Our proposed approach can be an important step toward developing voice cloning technology for the Bengali language, potentially positively impacting the community. The demos can be accessed here.
Understanding students' behavior in online courses may provide teachers with useful information to improve their educational design and provide insights for content and instructional designers to develop personali...
详细信息
Picocells can enhance the capacity and spectral efficiency of heterogeneous wireless networks (HWNs) and reduce user equipment (UE) power consumption. Also, the wireless backhaul technology has emerged as an efficient...
详细信息
We are currently living in societies that are profoundly concerned about the impact of current and potential technologies on our present and future lives. How is the future of Artificial Intelligence (AI) impact perce...
详细信息
Background: As visual inspection is an inherent process during radiological screening, the associated eye gaze data can provide valuable insights into relevant clinical decision processes and facilitate computer-assis...
详细信息
暂无评论