In this paper, we aim to explore an uncharted territory, which is Chinese multimodal named entity recognition (NER) with both textual and acoustic contents. To achieve this, we construct a large-scale human-annotated ...
详细信息
ISBN:
(纸本)9781954085527
In this paper, we aim to explore an uncharted territory, which is Chinese multimodal named entity recognition (NER) with both textual and acoustic contents. To achieve this, we construct a large-scale human-annotated Chinese multimodal NER dataset, named CNERTA. Our corpus totally contains 42,987 annotated sentences accompanying by 71 hours of speech data. based on this dataset, we propose a family of strong and representative baseline models, which can leverage textual features or multimodal features. Upon these baselines, to capture the natural monotonic alignment between the textual modality and the acoustic modality, we further propose a simple multimodal multitask model by introducing a speech-to-text alignment auxiliary task. through extensive experiments, we observe that: (1) Progressive performance boosts as we move from unimodal to multimodal, verifying the necessity of integrating speech clues into Chinese NER. (2) Our proposed model yields state-of-the-art (SoTA) results on CNERTA, demonstrating its effectiveness.
In this paper, a feature-level fusion based approach is proposed for blind image steganalysis. We choose three types of typical higher-order statistics as the candidate features for fusion and make use of the Boosting...
详细信息
the proceedings contain 43 papers. the topics discussed include: a probabilistic grouping principle to go from pixels to visual structures;an arithmetic and combinatorial approach to three-dimensional discrete lines;s...
ISBN:
(纸本)9783642198663
the proceedings contain 43 papers. the topics discussed include: a probabilistic grouping principle to go from pixels to visual structures;an arithmetic and combinatorial approach to three-dimensional discrete lines;smooth 2D coordinate systems on discrete surfaces;an improved Riemannian metric approximation for graph cuts;associating cell complexes to four dimensional digital objects;hierarchic Euclidean skeletons in cubical complexes;a unified topological framework for digital imaging;isthmus-based6-directional parallel thinning algorithms;quasi-linear transformations, numeration systems and fractals;path-based distance with varying weights and neighborhood sequences;sparse object representations by digital distance functions;efficient robust digital hyperplane fitting with bounded error;circular arc reconstruction of digital contours with chosen Hausdorff error;and an error bounded tangent estimator for digitized elliptic curves.
the proceedings contain 71 papers. the topics discussed include: soft computing algorithms applied to the segmentation of nerve cell images;patternrecognitionbased on time-frequency distributions of radar micro-Dopp...
详细信息
ISBN:
(纸本)0769522947
the proceedings contain 71 papers. the topics discussed include: soft computing algorithms applied to the segmentation of nerve cell images;patternrecognitionbased on time-frequency distributions of radar micro-Doppler dynamics;a quantitative software quality evaluation model for the artifacts of component based development;a new approach to software requirements elicitation;using data mining technology to design an intelligent CIM system for IC manufacturing;data mining for imprecise temporal associations;analysis of breast cancer using data mining and statistical techniques;analyzing the conditions of coupling existence based on program slicing and some abstract information-flow;a study of model layers and reflection;a general scalable implementation of fast matrix multiplication algorithms on distributed memory computers;error prediction for multi-classification;an integer support vector machine;and layered neural networks computations.
this paper introduces a novel approach for dynamic structuring of contextual lattices. It is anticipated that the approach can be applied to improve the accuracy of word-segmentation patterns in autonomous text recogn...
详细信息
In a country like India, a single text line of most of the official documents contains two different script words. Under two-language formula, the Indian documents are written in English and the state official languag...
详细信息
the great heterogeneity of web based Learning systems storing and providing digital e-learning data requires the introduction of interoperability aspects in order to resolve integration problems in a flexible and dyna...
详细信息
Named Entity recognition (NER) is a fundamental and widely used task in natural language processing (NLP), which is generally trained on the human-annotated corpus. However, data annotation is costly and time-consumin...
详细信息
ISBN:
(纸本)9781954085541
Named Entity recognition (NER) is a fundamental and widely used task in natural language processing (NLP), which is generally trained on the human-annotated corpus. However, data annotation is costly and time-consuming, which restricts its scale and further leads to the performance bottleneck of NER models. In reality, we can conveniently collect large-scale entity dictionaries and distantly supervised data. However, the collected dictionaries are lack of semantic context and the distantly supervised training instances contain large noise, which will bring uncertain effects to NER models when directly incorporated into the high-quality training set. To address the above issue, we propose a BERT-based decoupled NER model with two-stage training to appropriately take advantage of the heterogeneous corpus, including dictionaries, distantly supervised instances, and human-annotated instances. Our decoupled model consists of a Mention-BERT and a Context-BERT to respectively learn from the context-deficient dictionaries and noised distantly supervised instances at the pre-training stage. At the unifiedtraining stage, the two BERTs are trained together on human-annotated data to predict the correct labels for candidate regions. Empirical studies on three Chinese NER datasets demonstrate that our method achieves significant improvements against several baselines, establishing the new state-of-the-art performance.
this book contains refereed and improved papers presented at the 6th IAPR workshop on graphics recognition (GREC 2005). this year is the tenth anniversary of GREC, which was started in 1995 and has been held every 2 y...
详细信息
ISBN:
(数字)9783540347125
ISBN:
(纸本)9783540347118
this book contains refereed and improved papers presented at the 6th IAPR workshop on graphics recognition (GREC 2005). this year is the tenth anniversary of GREC, which was started in 1995 and has been held every 2 years: GREC 1995 in Penn State University, USA (LNCS Volume 1072, Springer, 1996); GREC 1997 in Nancy, France (LNCS Volume 1389, Springer, 1998); GREC 1999 in Jaipur, India (LNCS Volume 1941, Springer, 2000); GREC 2001 in Kingston, Canada (LNCS Volume 2390, Springer, 2002); and GREC 2003 in Barcelona, Spain (LNCS Volume 3088, Springer, 2004). GREC is the main event of IAPR TC-10 (the Technical Committee on graphics recognition within the international Association for patternrecognition) and provides an excellent opportunity for researchers and practitioners at all levels of experience to meet colleagues and to share new ideas and knowledge about graphics recognition methods. graphics recognition is a particular field in the domain of document analysis, which combines patternrecognition and image processing techniques for the analysis of any kind of graphical information in documents from either paper or electronic formats. In its 10 year history, the graphics recognition community has extended its research topics from the analysis and understanding of graphic documents (including engineering drawings vectorization and recognition), to graphics-based information retrieval and symbol recognition, to new media analysis, and even stepped into research areas of other communities, e. g.
暂无评论