With the rapid development of document digitization, people have become accustomed to capturing and processing documents using electronic devices such as smartphones. However, the captured document images often suffer...
With the rapid development of document digitization, people have become accustomed to capturing and processing documents using electronic devices such as smartphones. However, the captured document images often suffer from issues like shadows and noise due to environmental factors, which can affect their readability. To improve the quality of captured document images, researchers have proposed a series of models or frameworks and applied them in distinct scenarios such as image enhancement, and document information extraction. In this paper, we primarily focus on shadow removal methods and open-source datasets. We concentrate on recent advancements in this area, first organizing and analyzing nine availab.e datasets. Then, the methods are categorized into conventional methods and neural network-based methods. Conventional methods use manually designed features and include shadow map-based approaches and illumination-based approaches. Neural network-based methods automatically generate features from data and are divided into single-stage approaches and multi-stage approaches. We detail representative algorithms and briefly describe some typical techniques. Finally, we analyze and discuss experimental results, identifying the limitations of datasets and methods. Future research directions are discussed, and nine suggestions for shadow removal from document images are proposed. To our knowledge, this is the first survey of shadow removal methods and related datasets from document images.
Applying deep learning techniques to Electroencephalogram (EEG) data has shown great potential in the field of depression detection. However, existing EEG-based depression detection models face challenges: they strugg...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Applying deep learning techniques to Electroencephalogram (EEG) data has shown great potential in the field of depression detection. However, existing EEG-based depression detection models face challenges: they struggle to capture the complex spatiotemporal dependencies and the complementary nature of spatiotemporal information in EEG data; functional connectivity between brain regions is not sufficiently considered. To address these issues, we propose a new Spatial-Temporal Graph-Enhanced Transformer, named STGE-Former. Raw EEG signals are first mapped to Spatial-Temporal Shared Embeddings, then processed by the Spatial Attention Stream and the Temporal Graph-Enhanced Attention Stream to extract spatiotemporal complementary information, and finally classified through a classification head. Experimental results on the MODMA dataset show that our model outperforms existing methods in the task of EEG-Based MDD Detection. STGE-Former provides a promising approach for automatic depression detection. The code is availab.e at https://***/RockyChen0205/STGE-Former.
Predicting the three-dimensional structure of proteins from amino acid sequences with only a few remote homologs,or de novo prediction,remains a major challenge in computational *** modeling of the protein backbone re...
详细信息
Predicting the three-dimensional structure of proteins from amino acid sequences with only a few remote homologs,or de novo prediction,remains a major challenge in computational *** modeling of the protein backbone represents the initial phase of a protein structure prediction *** a parallel ant colony optimization based on sharing one pheromone matrix,this report proposes a parallel approach to predict the structure of a protein *** parallel approach combines various sources of energy functions and generates protein backbones with the lowest energies jointly determined by the various energy *** free modeling targets in CASP8/9 are used to evaluate the performance of the *** 13 targets in CASP8,two out of the predicted model1s selected by our approach are the best of the published CASP8 results,and seven out of the model1s are ranked in the top *** 29 targets in CASP9,20 out of the best models from our predictions are ranked in the top 10,and 11 out of the model1s are ranked in the top 10.
Frequent counting is a very so often required operation in machine learning algorithms. A typical machine learning task, learning the structure of Bayesian network (BN) based on metric scoring, is introduced as an e...
详细信息
Frequent counting is a very so often required operation in machine learning algorithms. A typical machine learning task, learning the structure of Bayesian network (BN) based on metric scoring, is introduced as an example that heavily relies on frequent counting. A fast calculation method for frequent counting enhanced with two cache layers is then presented for learning BN. The main contribution of our approach is to eliminate comparison operations for frequent counting by introducing a multi-radix number system calculation. Both mathematical analysis and empirical comparison between our method and state-of-the-art solution are conducted. The results show that our method is dominantly superior to state-of-the-art solution in solving the problem of learning BN.
Event anaphora resolution plays a critical role in discourse analysis. This paper proposes a tree kernel- based framework for event pronoun resolution. In particular, a new tree expansion scheme is introduced to autom...
详细信息
ISBN:
(纸本)9781577355120
Event anaphora resolution plays a critical role in discourse analysis. This paper proposes a tree kernel- based framework for event pronoun resolution. In particular, a new tree expansion scheme is introduced to automatically determine a proper parse tree structure for event pronoun resolution by considering various kinds of competitive information related with the anaphor and the antecedent candidate. Evaluation on the OntoNotes English corpus shows the appropriateness of the tree kernel-based framework and the effectiveness of competitive information for event pronoun resolution.
This paper proposes a unified framework for zero anaphora resolution, which can be divided into three sub-tasks: zero anaphor detection, anaphoricity determination and antecedent identification. In particular, all the...
详细信息
This paper presents and implements an approach to parallel ACO algorithms. The principal idea is to make multiple ant colonies share and utilize only one pheromone matrix. We call it SHOP (SHaring One Pheromone matrix...
详细信息
Knowledge of noun phrase anaphoricity might be profitably exploited in coreference resolution to bypass the resolution of non-anaphoric noun phrases. However, it is surprising to notice that recent attempts to incorpo...
详细信息
This paper presents a topic-driven framework for generating a generic summary from multi-documents. Our approach is based on the intuition that, from the statistical point of view, the summary's probability distri...
详细信息
暂无评论