This study employed deep learning to analyze a substantial data set of 109.13 million COVID-19-related microblogs, leading to the construction of a specialized risk perception indicator dictionary. Employing this dict...
详细信息
This study employed deep learning to analyze a substantial data set of 109.13 million COVID-19-related microblogs, leading to the construction of a specialized risk perception indicator dictionary. Employing this dictionary, we were able to capture the dynamic fluctuations in risk perception within online communities across various cities in real time. This approach highlighted the varying intensities of public response to the evolving crisis during the isolation and normalization stages of the pandemic. We observed that COVID-19-related transmission threat and information uncertainty significantly influenced public risk perception at different stages of the pandemic. Innovatively, our study quantifies public psychological resilience within online communities by examining the equilibrium between public risk perception and objective COVID-19-related risks. This equilibrium is conceptualized as the alignment of public perception with the evolving reality of COVID-19 threat and information. We investigated psychological resilience in two dimensions: adaptability, indicated by the extent of deviation from this equilibrium, and agility, reflected in the rate at which equilibrium is reestablished. Our study not only unveils new insights into the intricate relationship among public risk perception, the evolving risks, and psychological resilience but also offers empirical evidence to inform risk management strategies in online communities at different stages of a crisis. This research provides essential insights into how public perception and emotional responses during health crises like COVID-19 can be monitored and analyzed through social media data. By utilizing advanced analytical methods, including naturallanguageprocessing (NLP) and panel vector error correction (PVEC) modeling, the study successfully quantified the psychological resilience of online communities. These methods allow for the real-time assessment of how communities adapt and respond to evolving risks
Although most current large multimodal models (LMMs) can already understand photos of natural scenes and portraits, their understanding of abstract images, e.g., charts, maps, or layouts, and visual reasoning capabili...
详细信息
Single-cell RNA sequencing (scRNA-seq) has emerged as a pivotal tool for exploring cellular landscapes across diverse species and tissues. Precise annotation of cell types is essential for understanding these landscap...
详细信息
Single-cell RNA sequencing (scRNA-seq) has emerged as a pivotal tool for exploring cellular landscapes across diverse species and tissues. Precise annotation of cell types is essential for understanding these landscapes, relying heavily on empirical knowledge and curated cell marker databases. In this study, we introduce MarkerGeneBERT, a naturallanguageprocessing (NLP) system designed to extract critical information from the literature regarding species, tissues, cell types, and cell marker genes in the context of single-cell sequencing studies. Leveraging MarkerGeneBERT, we systematically parsed full-text articles from 3702 single-cell sequencing-related studies, yielding a comprehensive collection of 7901 cell markers representing 1606 cell types across 425 human tissues/subtissues, and 8223 cell markers representing 1674 cell types across 482 mouse tissues/subtissues. Comparative analysis against manually curated databases demonstrated that our approach achieved 76% completeness and 75% accuracy, while also unveiling 89 cell types and 183 marker genes absent from existing databases. Furthermore, we successfully applied the compiled brain tissue marker gene list from MarkerGeneBERT to annotate scRNA-seq data, yielding results consistent with original studies. Conclusions: Our findings underscore the efficacy of NLP-based methods in expediting and augmenting the annotation and interpretation of scRNA-seq data, providing a systematic demonstration of the transformative potential of this approach. The 27323 manual reviewed sentences for training MarkerGeneBERT and the source code are hosted at https://***/chengpeng1116/MarkerGeneBERT.
text summarization is a task in naturallanguageprocessing that automatically generates the summary from the source document in a human-written form with minimal loss of information. Research in text summarization ha...
详细信息
text summarization is a task in naturallanguageprocessing that automatically generates the summary from the source document in a human-written form with minimal loss of information. Research in text summarization has shifted towards abstractive text summarization due to its challenging aspects. This study provides a broad systematic literature review of abstractive text summarization on single-document summarization to gain insights into the challenges, widely used datasets, evaluation metrics, approaches, and methods. This study reviews research articles published between 2011 and 2023 from popular electronic databases. In total, 226 journal and conference publications were included in this review. The in-depth analysis of these papers helps researchers understand the challenges, widely used datasets, evaluation metrics, approaches, and methods. This article identifies and discusses potential opportunities and directions along with a generic conceptual framework and guidelines on abstractive summarization models and techniques for research in abstractive text summarization.
Contact centers handle both chat and voice calls for the same domain. As part of their workflow, it is a standard practice to summarize the conversations once they conclude. A significant distinction between chat and ...
详细信息
Up until now, in the field of naturallanguageprocessing and Computational Text Analysis methods (CTAM) most studies focused on logical-grammatical analysis or, more recently, on content and sentiment analysis. Howev...
详细信息
ISBN:
(纸本)9783031559167;9783031559174
Up until now, in the field of naturallanguageprocessing and Computational Text Analysis methods (CTAM) most studies focused on logical-grammatical analysis or, more recently, on content and sentiment analysis. However, there is still limited reference to the role of the discursive process: that is, how language's use shapes the reality of sense in which we live in. But how can we gain a deep knowledge and understanding of the sense of what is conveyed by a text? In order to investigate the process of sense's reality configuration, we introduce Dialogic Process Analysis. Starting from the formalization of 24 rules of naturallanguage's use of transversal to every idiom, called Discursive Repertories, Dialogic Process Analysis allows to describe how discursive processes unravel and to trace precisely the elements that generate each specific sense's reality, which may be different even when contents and meanings are the same. Although researchers are able to denominate the Discursive Repertories, performing such a task requires specific and complex analysis expertise: that is why the application of Machine Learning models can lighten these problems. Thus, in this work we present the Dialogic Process Analysis research programme, its experimentations and results in the definition of its own Machine Learning model for textual data analysis and its future lines of development.
Philology, the study of ancient manuscripts, demands years of professional training in extensive knowledge memorization and manual textual retrieval. Despite these requirements align closely with strengths of recent s...
详细信息
Recent research has investigated the use of generative language models to produce regular expressions with semantic-based approaches. However, these approaches have shown shortcomings in practical applications, partic...
详细信息
The proceedings contain 185 papers. The topics discussed include: ontology population reusing resources for dialogue intent detection: generic and multilingual approach;efficient multilingual text classification for I...
ISBN:
(纸本)9789544520724
The proceedings contain 185 papers. The topics discussed include: ontology population reusing resources for dialogue intent detection: generic and multilingual approach;efficient multilingual text classification for Indian languages;domain adaptation for Hindi-Telugu machine translation using domain specific back translation;towards a better understanding of noise in naturallanguageprocessing;comparing supervised machine learning techniques for genre analysis in software engineering research articles;enriching the transformer with linguistic factors for low-resource machine translation;interactive learning approach for Arabic target-based sentiment analysis;probabilistic ensembles of zero- and few-shot learning models for emotion classification;and predicting the factuality of reporting of news media using observations about user attention in their YouTube channels.
The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. Howeve...
详细信息
暂无评论