With the rapid development of document digitization, people have become accustomed to capturing and processing documents using electronic devices such as smartphones. However, the captured document images often suffer...
With the rapid development of document digitization, people have become accustomed to capturing and processing documents using electronic devices such as smartphones. However, the captured document images often suffer from issues like shadows and noise due to environmental factors, which can affect their readability. To improve the quality of captured document images, researchers have proposed a series of models or frameworks and applied them in distinct scenarios such as image enhancement, and document information extraction. In this paper, we primarily focus on shadow removal methods and open-source datasets. We concentrate on recent advancements in this area, first organizing and analyzing nine available datasets. Then, the methods are categorized into conventional methods and neural network-based methods. Conventional methods use manually designed features and include shadow map-based approaches and illumination-based approaches. Neural network-based methods automatically generate features from data and are divided into single-stage approaches and multi-stage approaches. We detail representative algorithms and briefly describe some typical techniques. Finally, we analyze and discuss experimental results, identifying the limitations of datasets and methods. Future research directions are discussed, and nine suggestions for shadow removal from document images are proposed. To our knowledge, this is the first survey of shadow removal methods and related datasets from document images.
Applying deep learning techniques to Electroencephalogram (EEG) data has shown great potential in the field of depression detection. However, existing EEG-based depression detection models face challenges: they strugg...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Applying deep learning techniques to Electroencephalogram (EEG) data has shown great potential in the field of depression detection. However, existing EEG-based depression detection models face challenges: they struggle to capture the complex spatiotemporal dependencies and the complementary nature of spatiotemporal information in EEG data; functional connectivity between brain regions is not sufficiently considered. To address these issues, we propose a new Spatial-Temporal Graph-Enhanced Transformer, named STGE-Former. Raw EEG signals are first mapped to Spatial-Temporal Shared Embeddings, then processed by the Spatial Attention Stream and the Temporal Graph-Enhanced Attention Stream to extract spatiotemporal complementary information, and finally classified through a classification head. Experimental results on the MODMA dataset show that our model outperforms existing methods in the task of EEG-Based MDD Detection. STGE-Former provides a promising approach for automatic depression detection. The code is available at https://***/RockyChen0205/STGE-Former.
Predicting the three-dimensional structure of proteins from amino acid sequences with only a few remote homologs,or de novo prediction,remains a major challenge in computational *** modeling of the protein backbone re...
详细信息
Predicting the three-dimensional structure of proteins from amino acid sequences with only a few remote homologs,or de novo prediction,remains a major challenge in computational *** modeling of the protein backbone represents the initial phase of a protein structure prediction *** a parallel ant colony optimization based on sharing one pheromone matrix,this report proposes a parallel approach to predict the structure of a protein *** parallel approach combines various sources of energy functions and generates protein backbones with the lowest energies jointly determined by the various energy *** free modeling targets in CASP8/9 are used to evaluate the performance of the *** 13 targets in CASP8,two out of the predicted model1s selected by our approach are the best of the published CASP8 results,and seven out of the model1s are ranked in the top *** 29 targets in CASP9,20 out of the best models from our predictions are ranked in the top 10,and 11 out of the model1s are ranked in the top 10.
Frequent counting is a very so often required operation in machine learning algorithms. A typical machine learning task, learning the structure of Bayesian network (BN) based on metric scoring, is introduced as an e...
详细信息
Frequent counting is a very so often required operation in machine learning algorithms. A typical machine learning task, learning the structure of Bayesian network (BN) based on metric scoring, is introduced as an example that heavily relies on frequent counting. A fast calculation method for frequent counting enhanced with two cache layers is then presented for learning BN. The main contribution of our approach is to eliminate comparison operations for frequent counting by introducing a multi-radix number system calculation. Both mathematical analysis and empirical comparison between our method and state-of-the-art solution are conducted. The results show that our method is dominantly superior to state-of-the-art solution in solving the problem of learning BN.
Dense contrastive representation learning (DCRL) has greatly improved the learning efficiency for image dense prediction tasks, showing its great potential to reduce the large costs of medical image collection and den...
详细信息
Recent kernel-based PPI extraction systems achieve promising performance because of their capability to capture structural syntactic information, but at the expense of computational complexity. This paper incorporates...
详细信息
Recent kernel-based PPI extraction systems achieve promising performance because of their capability to capture structural syntactic information, but at the expense of computational complexity. This paper incorporates dependency information as well as other lexical and syntactic knowledge in a feature-based framework. Our motivation is that, considering the large amount of biomedical literature being archived daily, feature-based methods with comparable performance are more suitable for practical applications. Additionally, we explore the difference of lexical characteristics between biomedical and newswire domains. Experimental evaluation on the AIMed corpus shows that our system achieves comparable performance of 54.7 in F1-Score with other state-of-the-art PPI extraction systems, yet the best performance among all the feature-based ones.
This paper proposes a dependency-driven scheme to dynamically determine the syntactic parse tree structure for tree kernel- based anaphoricity determination in coreference resolution. Given a full syntactic parse tree...
详细信息
This paper proposes a dependency-driven scheme to dynamically determine the syntactic parse tree structure for tree kernel- based anaphoricity determination in coreference resolution. Given a full syntactic parse tree, it keeps the nodes and the paths related with current mention based on constituent dependencies from both syntactic and semantic perspectives, while removing the noisy information, eventually leading to a dependency-driven dynamic syntactic parse tree (D-DSPT). Evaluation on the ACE 2003 corpus shows that the D-DSPT outperforms all previous parse tree structures on anaphoricity determination, and that applying our anaphoricity determination module in coreference resolution achieves the so far best performance.
Event anaphora resolution plays a critical role in discourse analysis. This paper proposes a tree kernel- based framework for event pronoun resolution. In particular, a new tree expansion scheme is introduced to autom...
详细信息
ISBN:
(纸本)9781577355120
Event anaphora resolution plays a critical role in discourse analysis. This paper proposes a tree kernel- based framework for event pronoun resolution. In particular, a new tree expansion scheme is introduced to automatically determine a proper parse tree structure for event pronoun resolution by considering various kinds of competitive information related with the anaphor and the antecedent candidate. Evaluation on the OntoNotes English corpus shows the appropriateness of the tree kernel-based framework and the effectiveness of competitive information for event pronoun resolution.
This paper proposes a unified framework for zero anaphora resolution, which can be divided into three sub-tasks: zero anaphor detection, anaphoricity determination and antecedent identification. In particular, all the...
详细信息
暂无评论