检索结果-内蒙古大学图书馆

Towards Accurate Recognition of Historical Arabic Manuscripts: A Novel dataset and a Generalizable Pipeline

ACM Transactions on Asian and Low-Resource Language information Processing 1000年

作者： Hakim Bouchal Ahror Belaid Farid Meziane Laboratory of Medical Informatics and Intelligent and Dynamic Environments LIMED) University of Bejaia Faculty of Technology 06000 Bejaia Algeria Laboratory of Medical Informatics and Intelligent and Dynamic Environments LIMED) University of Bejaia Faculty of Exact Sciences 06000 Bejaia Algeria Data Science & Applications Research Unit Research Centre for Scientific and Technical Information 06000 Bejaia Algeria Adamim Office for Scientific Research 16057 Eucalyptus Algeria Data Science Research Centre University of Derby DE22 3AW United Kingdom of Great Britain and Northern Ireland

In today’s digital world, we are committed to digitizing thousands of handwritten transcriptions to preserve their content. Historical Arabic Handwritten Text Recognition (HAHTR) remains a challenge for computer vision systems, due to the many difficulties inherently associated with document image quality and the complexity of Arabic script. In this work, we address the problem of recognizing historical Arabic documents that adapts to different writing styles and degrees of legibility. We developed a system that is able to recognize a whole page of a historical Arabic handwritten text in two consecutive steps comprising text line detection and recognition. The proposed approach performs detection using bounding boxes followed by a neural network-based model for character-level text recognition. However, the lack of data hinders the mass digitization of Arabic historical documents. Therefore, we provide a new and freely available dataset, focusing on diverse handwriting styles to facilitate a strong generalization of the trained model. This dataset will significantly benefits researchers and practitioners by accelerating progress in the field of HAHTR. Extensive experimental work demonstrates that the recognition models are effective when trained with different sources of data, and having different writing styles does not penalize the model’s ability to generalize but rather enhances it. Additionally, we define and develop a new metric to evaluate model robustness against character misclassification, particularly for characters with similar patterns. The experiments conducted demonstrated that the proposed HAHTR pipeline is accurate and highly generalizable, as well as the validity of bounding box methods for detecting text lines. The training approach with different data sources enabled us to surpass the state-of-the-art results with 5.7% of Character Error Rate (CER) on the KHATT database.

关键词： Text Detection Handwritten Text Recognition Arabic Historical Documents CNN-BLSTM Arabic dataset

来源：评论

学校读者我要写书评

暂无评论

Artificial Intelligence and Its Practical Applications in the Digital Economy 1

引用

丛书名： Lecture Notes in Networks and Systems

1000年

作者： Yahya Mohamed Elhadj Mohamedade Farouk Nanne Anis Koubaa Farid Meziane Mohamed Deriche

来源：评论

学校读者我要写书评

暂无评论

Distributed Computation Offloading and Power Control for UAV-Enabled Internet of Medical Things

引用

ACM Transactions on Internet technology 1000年

作者： Jiakun Gao Xiaolong Xu Lianyong Qi Wanchun Dou Xiaoyu Xia Xiaokang Zhou School of Software Nanjing University of Information Science and Technology NanJing China School of Software Nanjing University of Information Science and Technology NanJing China and Jiangsu Province Engineering Research Center of Advanced Computing and Intelligent Services Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology (CICAEET) Nanjing University of Information Science and Technology Nanjing China College of Computer Science and Technology China University of Petroleum (East China) QingDao China and State Key Laboratory for Novel Software Technology Nanjing University Nanjing China State Key Laboratory for Novel Software Technology Nanjing University Nanjing China School of Computer Technology RMIT University Victoria Australia Faculty of Data Science Shiga University Hikone Japan

The advancement of the Internet of Medical Things (IoMT) has led to the emergence of various health and emotion care services, e.g., health monitoring. To cater to increasing computational requirements of IoMT services, Mobile Edge Computing (MEC) has emerged as an indispensable technology in smart health. Benefiting from the cost-effectiveness of deployment, unmanned aerial vehicles (UAVs) equipped with MEC servers in Non-Orthogonal Multiple Access (NOMA) have emerged as a promising solution for providing smart health services in proximity to medical devices (MDs). However, the escalating number of MDs and the limited availability of communication resources of UAVs give rise to a significant increase in transmission latency. Moreover, due to the limited communication range of UAVs, the geographically-distributed MDs lead to workload imbalance of UAVs, which deteriorates the service response delay. To this end, this paper proposes a UAV-enabled Distributed computation Offloading and Power control method with Multi-Agent, named DOPMA, for NOMA-based IoMT environment. Specifically, this paper introduces computation and transmission queue models to analyze the dynamic characteristics of task execution latency and energy consumption. Moreover, a credit assignment scheme-based reward function is designed considering both system-level rewards and rewards tailored to each MD, and an improved multi-agent deep deterministic policy gradient algorithm is developed to derive offloading and power control decisions independently. Extensive simulations demonstrate that the proposed method outperforms existing schemes, achieving \(7.1\%\) reduction in energy consumption and \(16\%\) decrease in average delay.

关键词： Internet of Medical Things (IoMT) Mobile Edge Computing (MEC) Unmanned Aerial Vehicles (UAV) Non-Orthogonal Multiple Access (NOMA)

来源：评论

学校读者我要写书评

暂无评论

Applied Linear Algebra, Probability and Statistics 1

引用

丛书名： Indian Statistical Institute Series

1000年

作者： Ravindra B. Bapat Samir Kumar Neogy Manjunatha Prasad Karantha Stephen J. Kirkland Sukanta Pati Simo Puntanen

来源：评论

学校读者我要写书评

暂无评论

information Retrieval technology 1

引用

丛书名： Lecture Notes in Computer science

1000年

作者： Azizah Jaafar Nazlena Mohamad Ali Shahrul Azman Mohd Noah Alan F. Smeaton Peter Bruza Zainab Abu Bakar Nursuriati Jamil Tengku Mohd Tengku Sembok

ISBN: (数字)9783319128443

ISBN: (纸本)9783319128436

This book constitutes the refereed proceedings of the 10th information Retrieval Societies Conference, AIRS 2014, held in Kuching, Malaysia, in December 2014. The 42 full papers were carefully reviewed and selected from 110 submissions. Seven tracks were the focus of the AIR 2014 and they were IR models and theories; IR evaluation, user study and interactive IR; web IR, scalability and IR in social media; multimedia IR; natural language processing for IR; machine learning and data mining for IR and IR applications.

关键词： information Storage and Retrieval database Management information Systems Applications (incl. Internet) Artificial Intelligence

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：