In today's world, everyone, from novices to experts, seeks to understand and profit from the complex stock market. However, predicting stock price movements remains a daunting task due to its frequent fluctuations...
详细信息
Burnout syndrome and depression are prevalent mental health problems in many societies today. Most existing methods used in clinical intervention and research are based on inventories. naturallanguageprocessing (NLP...
详细信息
ISBN:
(纸本)9781643682815;9781643682808
Burnout syndrome and depression are prevalent mental health problems in many societies today. Most existing methods used in clinical intervention and research are based on inventories. naturallanguageprocessing (NLP) enables new possibilities to automatically evaluate text in the context of clinical Psychology. In this paper, we show how affective word list ratings can be used to differentiate between texts indicating depression or burnout, and a control group. In particular, we show that depression and burnout show statistically significantly higher arousal than the control group.
Current NLP datasets targeting ambiguity can be solved by a native speaker with relative ease. We present Cryptonite, a large-scale dataset based on cryptic crosswords, which is both linguistically complex and natural...
详细信息
Cyberbullying represents one of the most recent challenges stemming from the widespread adoption of social media platforms. With the surge in social media usage, the right to express oneself is being misused. Statisti...
详细信息
The amount of digital text-based consumer review data has increased dramatically and there exist many machine learning approaches for automated text-based sentiment analysis. Marketing researchers have employed variou...
详细信息
The amount of digital text-based consumer review data has increased dramatically and there exist many machine learning approaches for automated text-based sentiment analysis. Marketing researchers have employed various methods for analyzing text reviews but lack a comprehensive comparison of their performance to guide method selection in future applications. We focus on the fundamental relationship between a consumer’s overall empirical evaluation, and the text-based explanation of their evaluation. We study the empirical tradeoff between predictive and diagnostic abilities, in applying various methods to estimate this fundamental relationship. We incorporate methods previously employed in the marketing literature, and methods that are so far less common in the marketing literature. For generalizability, we analyze 25,241 products in nine product categories, and 260,489 reviews across five review platforms. We find that neural network-based machine learning methods, in particular pre-trained versions, offer the most accurate predictions, while topic models such as Latent Dirichlet Allocation offer deeper diagnostics. However, neural network models are not suited for diagnostic purposes and topic models are ill equipped for making predictions. Consequently, future selection of methods to process text reviews is likely to be based on analysts’ goals of prediction versus diagnostics.
Current methods for acquiring datasets for human motion classification are limited to controlled settings where participants are directed by a human experiment organizer. Datasets acquired in controlled settings often...
详细信息
ISBN:
(纸本)9781665436694
Current methods for acquiring datasets for human motion classification are limited to controlled settings where participants are directed by a human experiment organizer. Datasets acquired in controlled settings often cannot capture natural human behaviors and are inadequate for obtaining large amounts of real-world data in a sustainable fashion. This paper proposes a new paradigm for automated acquisition of natural human movements based on Interactive RF Gaming. The training of AI/ML models is considered over an evolution of time: pre-deployment via physics-aware batch training and post-deployment via continual learning from interactions. Algorithms needed to address real-world considerations such as the parsing of continuous streams of data, computational constraints, real-time processing, and game design considerations based on cyberphysical human system requirements are also discussed.
Objectives: The main goal of this review is to introduce the work with various conversational datasets containing data from patients suffering from Alzheimer's disease. The basic questions we deal with in the syst...
详细信息
ISBN:
(纸本)9781728186863
Objectives: The main goal of this review is to introduce the work with various conversational datasets containing data from patients suffering from Alzheimer's disease. The basic questions we deal with in the systematic review are: (1) Which datasets are most often used in studies? (2) How does the number of participants in individual datasets vary? (3) What is the representation of individual groups in datasets? methods: We used databases Scopus, Web of Science and Google Scholar to create the report. Inclusion criteria have been created from key words - voice, speech, Alzheimer detection and naturallanguageprocessing. We have focused on articles from the beginning of 2019 until now. Articles that did not contain full-text in English were excluded. Results: The review contains a total of 37 studies in which their datasets can be examined. The most commonly used datasets in the articles are ADReSS, PITT and CCC. Many datasets were created for a specific study only, and some have not yet been made public. The range of participants in individual datasets ranges from 30 to 865. The most frequently used ADReSS dataset in the studies contained records of 156 participants. Conclusion: In several cases, the size of the dataset turns out to play an important role, but the overall quality of the dataset was a more significant factor. As a result of a deeper understanding of datasets, we concluded that many factors, such as the age of the participants, gender, or the number of education years, played a significant role as an indicator for Alzheimer's disease prediction.
Aspect-based financial sentiment analysis (ABFSA) is a fine-grained task that can enrich and sharpen financial analysis by identifying sentiments towards specific entities (e.g., company, stock). As the application of...
详细信息
ISBN:
(纸本)9798350342734
Aspect-based financial sentiment analysis (ABFSA) is a fine-grained task that can enrich and sharpen financial analysis by identifying sentiments towards specific entities (e.g., company, stock). As the application of ABSA in professional language, ABFSA is a challenging task requiring extensive domain knowledge while remaining understudied. Numeral understanding is crucial for financial text analysis, but existing NLP models for ABFSA lack such ability by mainly treating numerals as plain text. In addition, most studies on knowledge incorporation disregard necessary domain-specific connotations or suffer from the low coverage issue. In this paper, we propose a novel numeral-oriented network with a multi-source affective knowledge refinement strategy (NumAKEN) for ABFSA. NumAKEN utilizes a numeral encoding method based on DigitCNN to capture critical numeric concepts such as magnitude and category. A multi-source affective knowledge fusion strategy is designed for hybrid lexicon construction and incorporation, which can guide the model to capture significant sentiment clues as well as alleviate conflicting and coverage issues. Extensive experiments on two datasets illustrate that our NumAKEN model outperforms all state-of-the-art methods and verify the effectiveness of our model.
Brain decoding, understood as the process of mapping brain activities to the stimuli that generated them, has been an active research area in the last years. In the case of language stimuli, recent studies have shown ...
详细信息
ISBN:
(纸本)9781665405409
Brain decoding, understood as the process of mapping brain activities to the stimuli that generated them, has been an active research area in the last years. In the case of language stimuli, recent studies have shown that it is possible to decode fMRI scans into an embedding of the word a subject is reading. However, such word embeddings are designed for naturallanguageprocessing tasks rather than for brain decoding. Therefore, they limit the model's ability to recover the precise stimulus. In this work, we propose to directly classify an fMRI scan, mapping it to the corresponding word within a fixed vocabulary. Unlike existing work, we evaluate on scans from previously unseen subjects. We argue that this is a more realistic setup and we present a model that can decode fMRI data from unseen subjects with 2:62% Top-1 and 9:76% Top-5 accuracy in this challenging task. Moreover our model can be fine-tuned on data from the test subject to achieve 4:22% Top-1 and 12:87% Top-5 accuracy, significantly outperforming all the considered competitive baselines.
Ethics is one of the longest standing intellectual endeavors of humanity. In recent years, the fields of AI and NLP have attempted to address ethical issues of harmful outcomes in machine learning systems that are mad...
详细信息
ISBN:
(纸本)9781955917711
Ethics is one of the longest standing intellectual endeavors of humanity. In recent years, the fields of AI and NLP have attempted to address ethical issues of harmful outcomes in machine learning systems that are made to interface with humans. One recent approach in this vein is the construction of NLP morality models that can take in arbitrary text and output a moral judgment about the situation described. In this work, we offer a critique of such NLP methods for automating ethical decision-making. Through an audit of recent work on computational approaches for predicting morality, we examine the broader issues that arise from such efforts. We conclude with a discussion of how machine ethics could usefully proceed in NLP, by focusing on current and near-future uses of technology, in a way that centers around transparency, democratic values, and allows for straightforward accountability.
暂无评论