咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >DeCNT: Deep Deformable CNN for... 收藏

DeCNT: Deep Deformable CNN for Table Detection

作     者:Siddiqui, Shoaib Ahmed Malik, Muhammad Imran Agne, Stefan Dengel, Andreas Ahmed, Sheraz 

作者机构:German Res Ctr Artificial Intelligence D-67663 Kaiserslautern Germany Univ Kaiserslautern Dept Comp Sci D-67663 Kaiserslautern Germany Natl Univ Sci & Technol Sch Elect Engn & Comp Sci Islamabad Pakistan 

出 版 物:《IEEE ACCESS》 (IEEE Access)

年 卷 期:2018年第6卷

页      面:74151-74161页

核心收录:

基  金:BMBF Project DeFuseNN [01IW17002] NVIDIA AI Lab Program 

主  题:Deep learning representation learning convolutional neural networks object detection deformable convolution table detection table spotting faster R-CNN FPN 

摘      要:This paper presents a novel approach for the detection of tables present in documents, leveraging the potential of deep neural networks. Conventional approaches for table detection rely on heuristics that are error prone and specific to a dataset. In contrast, the presented approach harvests the potential of data to recognize tables of arbitrary layout. Most of the prior approaches for table detection are only applicable to PDFs, whereas, the presented approach directly works on images making it generally applicable to any format. The presented approach is based on a novel combination of deformable CNN with faster R-CNN/FPN. Conventional CNN has a fixed receptive field which is problematic for table detection since tables can be present at arbitrary scales along with arbitrary transformations (orientation). Deformable convolution conditions its receptive field on the input itself allowing it to mold its receptive field according to its input. This adaptation of the receptive field enables the network to cater for tables of arbitrary layout. We evaluated the proposed approach on two major publicly available table detection datasets: ICDAR-2013 and ICDAR-2017 POD. The presented approach was able to surpass the state-of-the-art performance on both ICDAR-2013 and ICDAR-2017 POD datasets with a F-measure of 0.994 and 0.968, respectively, indicating its effectiveness and superiority for the task of table detection.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分