In today’s digital world, we are committed to digitizing thousands of handwritten transcriptions to preserve their content. Historical Arabic Handwritten Text Recognition (HAHTR) remains a challenge for computer visi...
详细信息
In today’s digital world, we are committed to digitizing thousands of handwritten transcriptions to preserve their content. Historical Arabic Handwritten Text Recognition (HAHTR) remains a challenge for computer vision systems, due to the many difficulties inherently associated with document image quality and the complexity of Arabic script. In this work, we address the problem of recognizing historical Arabic documents that adapts to different writing styles and degrees of legibility. We developed a system that is able to recognize a whole page of a historical Arabic handwritten text in two consecutive steps comprising text line detection and recognition. The proposed approach performs detection using bounding boxes followed by a neural network-based model for character-level text recognition. However, the lack of data hinders the mass digitization of Arabic historical documents. Therefore, we provide a new and freely available dataset, focusing on diverse handwriting styles to facilitate a strong generalization of the trained model. This dataset will significantly benefits researchers and practitioners by accelerating progress in the field of HAHTR. Extensive experimental work demonstrates that the recognition models are effective when trained with different sources of data, and having different writing styles does not penalize the model’s ability to generalize but rather enhances it. Additionally, we define and develop a new metric to evaluate model robustness against character misclassification, particularly for characters with similar patterns. The experiments conducted demonstrated that the proposed HAHTR pipeline is accurate and highly generalizable, as well as the validity of bounding box methods for detecting text lines. The training approach with different data sources enabled us to surpass the state-of-the-art results with 5.7% of Character Error Rate (CER) on the KHATT database.
The advancement of the Internet of Medical Things (IoMT) has led to the emergence of various health and emotion care services, e.g., health monitoring. To cater to increasing computational requirements of IoMT service...
详细信息
The advancement of the Internet of Medical Things (IoMT) has led to the emergence of various health and emotion care services, e.g., health monitoring. To cater to increasing computational requirements of IoMT services, Mobile Edge Computing (MEC) has emerged as an indispensable technology in smart health. Benefiting from the cost-effectiveness of deployment, unmanned aerial vehicles (UAVs) equipped with MEC servers in Non-Orthogonal Multiple Access (NOMA) have emerged as a promising solution for providing smart health services in proximity to medical devices (MDs). However, the escalating number of MDs and the limited availability of communication resources of UAVs give rise to a significant increase in transmission latency. Moreover, due to the limited communication range of UAVs, the geographically-distributed MDs lead to workload imbalance of UAVs, which deteriorates the service response delay. To this end, this paper proposes a UAV-enabled Distributed computation Offloading and Power control method with Multi-Agent, named DOPMA, for NOMA-based IoMT environment. Specifically, this paper introduces computation and transmission queue models to analyze the dynamic characteristics of task execution latency and energy consumption. Moreover, a credit assignment scheme-based reward function is designed considering both system-level rewards and rewards tailored to each MD, and an improved multi-agent deep deterministic policy gradient algorithm is developed to derive offloading and power control decisions independently. Extensive simulations demonstrate that the proposed method outperforms existing schemes, achieving \(7.1\%\) reduction in energy consumption and \(16\%\) decrease in average delay.
This book constitutes the refereed proceedings of the 10th information Retrieval Societies Conference, AIRS 2014, held in Kuching, Malaysia, in December 2014. The 42 full papers were carefully reviewed and selected fr...
详细信息
ISBN:
(数字)9783319128443
ISBN:
(纸本)9783319128436
This book constitutes the refereed proceedings of the 10th information Retrieval Societies Conference, AIRS 2014, held in Kuching, Malaysia, in December 2014.
The 42 full papers were carefully reviewed and selected from 110 submissions. Seven tracks were the focus of the AIR 2014 and they were IR models and theories; IR evaluation, user study and interactive IR; web IR, scalability and IR in social media; multimedia IR; natural language processing for IR; machine learning and data mining for IR and IR applications.
暂无评论