Feature selection is a complex problem used across many fields, such as computer vision and data mining. Feature selection algorithms extract a subset of features from a greater feature set which can improve algorithm...
详细信息
Document Layout Analysis (DLA) is a segmentation process that decomposes a scanned document image into its blocks of interest and classifies them. DLA is essential in a large number of applications, such as Informatio...
详细信息
Document Layout Analysis (DLA) is a segmentation process that decomposes a scanned document image into its blocks of interest and classifies them. DLA is essential in a large number of applications, such as Information Retrieval, Machine Translation, Optical Character Recognition (OCR) systems, and structured data extraction from documents. However, identification of document blocks in DLA is challenging due to variations of block locations, inter-and intra-class variability, and background noises. In this paper, we propose a novel texture-based convolutional neural network for document layout analysis, called DoT-Net. DoT-Net is a multiclass classifier that can effectively identify document component blocks such as text, image, table, mathematical expression, and line-diagram, whereas most related methods have focused on the text vs. non-text block classification problem. DoT-Net can capture textural variations among the multiclass regions of documents. Our proposed method DoT-Net achieved promising results outperforming state-of-the-art document layout classifiers on accuracy, F1 score, and AUC. The open-source code of DoT-Net is available at https://***/datax-lab/DoTNet.
Background Understanding heterogeneity of structural brain changes in aging may provide insights into susceptibility to neurodegenerative diseases. We characterize the genetics underlying brain structural heterogeneit...
Background Understanding heterogeneity of structural brain changes in aging may provide insights into susceptibility to neurodegenerative diseases. We characterize the genetics underlying brain structural heterogeneity within cognitively unimpaired (CU) individuals using data-driven machine learning applied to a diverse dataset of 27,402 individuals from 11 neuroimaging studies from the iSTAGING consortium. Method Structural brain morphologic patterns of CU individuals were independently examined in four decade-long intervals spanning ages 45 to 85. Within each interval, Smile-GAN (Yang et al., 2021) was trained on baseline anatomic and white matter hyperintensity (WMH) volumes. Smile-GAN probability scores were used as phenotypes in genome-wide association studies (GWAS). Specifically, we performed multiple linear regressions controlling for confounders (e.g., age) via Plink (Purcell et al., 2007). We observed longitudinal clustering stability across decades, so individuals from adjacent age groups were combined into broader age groups ([45,65), [65,85)) due to the large sample requirement of GWAS. Genomic loci, represented by the top leading single nucleotide polymorphisms (SNPs), were defined considering linkage disequilibrium. We investigated associations of SNPs with clinical traits and mapped them to genes using the GWAS Catalog (Buniello et al., 2019). Result Three structural brain aging patterns, relative to resilient agers (A0), consistent across decades, emerged: A1, or ‘typical’ aging with low atrophy and WMHs, and two ‘advanced’ aging patterns, one showing elevated WMHs and modest atrophy (A2) and the other displaying severe, widespread atrophy and moderate WMH load (A3) (Figure 1). GWAS discovered eight and six genomic loci in [45,65) and [65,85) age groups, respectively (Table 1, Figure 2). The lead SNPs for A1 and A2 were previously associated with several cardiometabolic risk factors, WMHs, and regional brain volumes. Interestingly, rs4843552, previo
Face recognition has recently become ubiquitous in many scenes for authentication or security purposes. Meanwhile, there are increasing concerns about the privacy of face images, which are sensitive biometric data tha...
详细信息
The performance of alignment systems on property matching lags behind that on class and instance matching. This work seeks to understand the reasons for this and consider avenues for improvement. The paper contains an...
详细信息
This paper presents a new method for medical diagnosis of neurodegenerative diseases, such as Parkinson's, by extracting and using latent information from trained Deep convolutional, or convolutional-recurrent Neu...
详细信息
Collection and analysis of large volumes of ICU data are invaluable to the advancement of clinical knowledge, and large-scale ICU databases have been effective resources to understand risk factors and perform predicti...
详细信息
This article introduces a method by which the power grid's security level can be observed in advance based on the expected initial failure. Firstly, based on the general form of cascading trip, this paper gives a ...
详细信息
An indoor semantic trajectory is a sequence of timestamped semantic positions inside a building. However, its extraction depends on the erroneous indoor positioning. The error leads to an invalid trajectory that has d...
详细信息
ISBN:
(数字)9781728108582
ISBN:
(纸本)9781728108599
An indoor semantic trajectory is a sequence of timestamped semantic positions inside a building. However, its extraction depends on the erroneous indoor positioning. The error leads to an invalid trajectory that has distant consecutive positions. This invalid trajectory may lead to an issue of the non-sensical patterns when analyzing a big semantic trajectory data. To prevent extracting invalid trajectories, we apply the movement constraints to infer only close positions to the current position. We extend the constraints to several indoor positioning techniques, such as Hidden Markov Model, K-Nearest Neighbor, or Deep Neural Network. We show that our approach can effectively extract valid indoor semantic trajectories.
暂无评论