Color words serve as the linguistic embodiment of color usage, forming a subsystem of the human vocabulary that is intimately tied to production, life, and emotions. These words carry rich cultural connotations, natio...
ISBN:
(纸本)9789819705856;9789819705863
Color words serve as the linguistic embodiment of color usage, forming a subsystem of the human vocabulary that is intimately tied to production, life, and emotions. These words carry rich cultural connotations, national characteristics, and reflect the zeitgeist and cultural tendencies. In this paper, we amalgamate Chinese and foreign color theories to analyze the usage, distribution, coloring domains, and co-occurring words of Chinese color words. This analysis is based on data from Chinese newspapers spanning from the Foreign Affairs Movement of the late Qing Dynasty to the present. We map the distribution of color words in Chinese written language over 1872-2015. Our study reveals that different eras are characterized by different theme colors, and colorwords evolve in tandem with societal development and the enhancement of people's understanding. The scope, variety, and connotation of color words continue to expand, leading to increasingly rich chromatograms. The coloring domains of color words are becoming more extensive, and their connections with objective entities are becoming more abundant. The usage and evolution of color words are closely linked to policy changes, the social and historical environment, and cultural development. They exhibit a sensitivity to the times, and the national cultural psychology and social customs underlying the color words are also changing. Color words are increasingly dissociating from religion and regime, finding widespread usage in political, economic, and cultural fields, and their importance is steadily increasing.
Text-to-speech (TTS) research has made significant progress in achieving human-like interpretability. Yet, a noticeable research gap persists regarding emotional expressiveness in synthesized speech. While some existi...
ISBN:
(纸本)9783031705656;9783031705663
Text-to-speech (TTS) research has made significant progress in achieving human-like interpretability. Yet, a noticeable research gap persists regarding emotional expressiveness in synthesized speech. While some existing studies address this through techniques such as voice conversion and textual context modeling, precise control over emotion composition in speech synthesis remains challenging. Moreover, those techniques typically operate at sentence-level, precluding fine-grained control over emotion transitions. Emotion composition in utterances is multifaceted, dictated by a multitude of linguistic and acoustic features;and among these features, pitch plays a crucial role, with distinct pitch contours often signaling specific emotions at different points in a spoken sentence. Aiming for more controllable emotional speech synthesis, we propose PiCo-VITS, an end-to-end TTS model architecture that leverages pitch contours in conjunction with latent features. Experimental results demonstrate the efficacy of the proposed model in synthesizing speech that conveys mixed emotions. Notably, the model allows both the desired emotions and the emotion transition patterns to be specified, while maintaining intelligibility comparable to state-of-the-art techniques.
Mandarin alphabetical words (MAWs) refer to the code-mixing of Romanized letters and characters such as X(sic) 'X-ray' in the Mandarin lexicon. Previous studies have mainly focused on MAWs' formation but l...
ISBN:
(纸本)9789819705825;9789819705832
Mandarin alphabetical words (MAWs) refer to the code-mixing of Romanized letters and characters such as X(sic) 'X-ray' in the Mandarin lexicon. Previous studies have mainly focused on MAWs' formation but lacked empirical evidence regarding their morpho-syntactic behaviours. Classifiers have been used to infer nominals' semantic properties and characteristics. An intriguing yet less explored issue is the classifier-selection pattern of MAWs and MAWs' morphosyntactic idiosyncrasies. We adopt a corpus-based approach to handle this issue. Assuming that a MAW's classifier is motivated by the head of that MAW, we hypothesize that when aMAWis integrated into theMandarin lexicon, its dominant classifiers will be the semantically more specific ones and not the neutral classifier. We show that MAWs share a dominant compounding structure in Mandarin and that MAW's classifier is decided by that head even when the head is represented by alphabets.
In recent studies, it has been demonstrated that incorporating diverse training datasets enhances the overall knowledge and generalization capabilities of large-scale language models, especially in cross-domain scenar...
ISBN:
(纸本)9783031705625;9783031705632
In recent studies, it has been demonstrated that incorporating diverse training datasets enhances the overall knowledge and generalization capabilities of large-scale language models, especially in cross-domain scenarios. In line with this, we introduce Bella Turca: a comprehensive Turkish text corpus, totaling 265GB, specifically curated for training language models. Bella Turca encompasses 25 distinct subsets of 4 genre, carefully chosen to ensure diversity and high quality. While Turkish is spoken widely across three continents, it suffers from a dearth of robust data resources for language modelling. Existing transformers and language models have primarily relied on repetitive corpora such as OSCAR and/or Wiki, which lack the desired diversity. Our work aims to break free from this monotony by introducing a fresh perspective to Turkish corpora resources. To the best of our knowledge, this release marks the first instance of such a vast and diverse dataset tailored for the Turkish language. Additionally, we contribute to the community by providing the code used in the dataset's construction and cleaning, fostering collaboration and knowledge sharing.
Airport terminals are complex sociotechnical systems, in which humans interact with diverse technical systems. A natural way to represent them is through agent-based modeling. However, this method has two drawbacks: i...
ISBN:
(纸本)9783031610332;9783031610349
Airport terminals are complex sociotechnical systems, in which humans interact with diverse technical systems. A natural way to represent them is through agent-based modeling. However, this method has two drawbacks: it entails a heavy computational burden and the emergent properties are often difficult to analyze. The purpose of our research is therefore to accurately abstract and explain the dynamics of airport terminal operations by means of computationally efficient and interpretable surrogate models, based on an existing agent-based simulation model. We propose a methodology consisting of two stages. Stage I involves the development of faithful surrogates. A sample is collected according to an active learning strategy, upon which Gaussian process regression, higher-order polynomials, gradient boosting, and random forests are fitted. Stage II then applies state-of-the-art techniques from the emerging field of explainable artificialintelligence to these models. Both model-agnostic and model-specific methods are considered, and their results are synthesized in order to explain the emergent properties. We prove the efficacy of this approach by conducting two case studies on AATOM, an existing Agent-based Airport Terminal Operations Model. Altogether, we clearly observed the preservation of emergent phenomena in surrogate models, and conclude that their combination with interpretable machine learning is an effective way to explain the dynamics of complex sociotechnical systems.
The advanced capabilities of large language models can also be seen in their increasing use in the automatic generation of programming code. Although models are generally getting better and better, there are very few ...
ISBN:
(纸本)9783031705625;9783031705632
The advanced capabilities of large language models can also be seen in their increasing use in the automatic generation of programming code. Although models are generally getting better and better, there are very few metrics that can be used to evaluate the quality of the generated code. In particular, this evaluation becomes challenging without dependence on good reference data in the form of tests or alternative solutions. In this paper, we explore both existing and new approaches to evaluate generated python code. These approaches can be classified into two categories: similarity-based and reference independent. The similarity-based approaches involve examining the code's syntax-tree structure and embeddings and comparing them to reference code from the dataset. On the other hand, the reference independent approaches utilize static code analysis metrics used to assess human-written code. These metrics include maintainability and adherence to style guidelines. We examine these metrics on the example of several state-of-the-art code generation models to test their validity. Based on our results, the independent metrics seem to be the most promising approaches for future research.
In accordance with Pustejovsky's proposed Semantic Generation Mechanisms, this research aims to investigate the principles governing semantic collocations related to mental verbs and the generative mechanisms of p...
ISBN:
(纸本)9789819705825;9789819705832
In accordance with Pustejovsky's proposed Semantic Generation Mechanisms, this research aims to investigate the principles governing semantic collocations related to mental verbs and the generative mechanisms of polysemy. The research compiles lexical representations of select mental verbs and conducts a comprehensive analysis of the polysemy and categorizations associated with these verbs, with the intent of establishing guidelines for semantic generation from the collocations of mental verbs. When the argument structures of patient nouns partially align with those of these mental verbs, they can be effectively matched. In situations where different telic roles and agentive roles of the patient nouns are equally significant, polysemy may manifest within the collocations of these mental verbs.
Developing questions that are pedagogically sound, relevant, and promote learning is a challenging and time-consuming task for educators. Modern-day large language models (LLMs) generate high-quality content across mu...
ISBN:
(纸本)9783031642982;9783031642999
Developing questions that are pedagogically sound, relevant, and promote learning is a challenging and time-consuming task for educators. Modern-day large language models (LLMs) generate high-quality content across multiple domains, potentially helping educators to develop high-quality questions. Automated educational question generation (AEQG) is important in scaling online education catering to a diverse student population. Past attempts at AEQG have shown limited abilities to generate questions at higher cognitive levels. In this study, we examine the ability of five state-of-the-art LLMs of different sizes to generate diverse and high-quality questions of different cognitive levels, as defined by Bloom's taxonomy. We use advanced prompting techniques with varying complexity for AEQG. We conducted expert and LLM-based evaluations to assess the linguistic and pedagogical relevance and quality of the questions. Our findings suggest that LLMs can generate relevant and high-quality educational questions of different cognitive levels when prompted with adequate information, although there is a significant variance in the performance of the five LLMs considered. We also show that automated evaluation is not on par with human evaluation.
Teamwork is widely used in higher education for its learning benefits, but students often face issues while working in teams. A peer assessment tool can detect such issues and simultaneously provide support: this coul...
ISBN:
(纸本)9783031642982;9783031642999
Teamwork is widely used in higher education for its learning benefits, but students often face issues while working in teams. A peer assessment tool can detect such issues and simultaneously provide support: this could be done by adding a virtual agent throughout the survey, but what should this agent say? This research describes two studies to inform the design of such an agent. We investigate the adaptation of support statements to a student filling in a peer assessment tool. This adaptation is based on the rater's Emotional Stability and Conscientiousness, and the score assigned to a teammate's quality of contribution. This adaptation is summarized in an algorithm for a peer assessment tool.
Voice disorders like dysphonia can significantly impact a person's quality of life, so proper diagnostic methods are crucial. Previous approaches have primarily used datasets of a single language without consideri...
ISBN:
(纸本)9783031705656;9783031705663
Voice disorders like dysphonia can significantly impact a person's quality of life, so proper diagnostic methods are crucial. Previous approaches have primarily used datasets of a single language without considering language independence. This study investigates the effectiveness of self-supervised (SS) speech representation models in both mono- and cross-lingual settings to determine their ability to perform language-independent dysphonia detection. Four recent SS models, namely Wav2vec2.0, WavLM, HuBert, and Data2vec, in their large and base variations, were examined for their ability to capture speech features related to dysphonia. The findings suggest that larger variants of SS models generally outperform smaller ones, with the HuBert and WavLM large models achieving an accuracy of 93.06% and 91.67% in mono-lingual experiments, respectively. Additionally, the study explored cross-lingual capabilities and found that, except for Wav2vec2.0, base variations of SS models exhibited higher accuracies. The highest accuracy achieved in the cross-lingual case was 88.33% by the Wav2vec2.0 model when algorithms were trained on Hungarian samples and tested in the Dutch language. These results highlight the potential of SS models for language-dependent and independent dysphonia detection.
暂无评论