The inherent semantic relatedness and closeness of near-synonyms pose difficulties to second language (L2) learners in comprehending and applying lexical knowledge in real situations. Previous studies have shown a sop...
ISBN:
(纸本)9789819705856;9789819705863
The inherent semantic relatedness and closeness of near-synonyms pose difficulties to second language (L2) learners in comprehending and applying lexical knowledge in real situations. Previous studies have shown a sophisticated corpus-based method of distinguishing Chinese near-synonyms from a more 'theoretical-based' approach, the application of corpora in learning near-synonyms in an L2 classroom, however, is underexplored. This study reports both the 'theoretical-based' and 'pedagogical-application' of using corpora in studying 'business' related near-synonyms, and a significant gap between the 'theory' and the 'application' is identified. Our findings not only affirm the 'theoretical-based' method in capturing subtle nuances of near-synonyms but suggest implications of 'pedagogical-application' in teaching and learning near-synonyms with corpora.
Different from Putonghua, the verb "huaiyi" 'suspect' in Hong Kongstyle Chinese, can be used in the passive voice without a passive marker. It generally acts as an attributive or a predicate. A compa...
ISBN:
(纸本)9789819705825;9789819705832
Different from Putonghua, the verb "huaiyi" 'suspect' in Hong Kongstyle Chinese, can be used in the passive voice without a passive marker. It generally acts as an attributive or a predicate. A comparison of the Chinese and English versions of the Hong Kong government gazettes shows that "huaiyi" in Hong Kong-style Chinese has multiple correspondences with the English word "suspect". The usage of is influenced by the linguistic and pragmatic factors of English, as well as the phenomenon of the implicit experiencer of Chinese grammar.
Student's problem behaviors are undesirable behaviors encompass actions that deviate from established school standards, potentially impacting students' overall well-being and academic success significantly. Di...
ISBN:
(纸本)9783031643019;9783031643026
Student's problem behaviors are undesirable behaviors encompass actions that deviate from established school standards, potentially impacting students' overall well-being and academic success significantly. Diagnosing these behaviors demands a multidisciplinary understanding, posing a challenge for conventional educators. Capitalizing on the advancements in Large Language Model (LLM) technology, we introduce this PBChat model, a specialized LLM designed for pinpointing problem behaviors. We articulate a theoretical framework for problem behavior diagnosis, laying the conceptual groundwork for PBChat. To train PBChat, we curate a multi-turn dialogue dataset based on annotated cases, and subsequently, fine-tune the ChatGLM2 base model using the QLoRA algorithm to build PBChat model. Experimental assessments gauge the performance of PBChat, with both automated and human evaluations revealing its efficacy in successfully diagnosing problem behaviors, surpassing the capabilities of general LLMs.
The synonym of epistemic "think" in Chinese is understudied. This study surveys the usage of five synonyms of verbs related to "think" - renwei, juede, yiwei, xiang, and kaolu - by using the Academ...
ISBN:
(纸本)9789819705856;9789819705863
The synonym of epistemic "think" in Chinese is understudied. This study surveys the usage of five synonyms of verbs related to "think" - renwei, juede, yiwei, xiang, and kaolu - by using the Academia Sinica Balanced Corpus of Modern Chinese. By studying the contexts of the target verbs, we report the distributions of three types of nouns and two types of adverbs in relation to the target verbs among three genres, and then we discuss their potential effects on epistemic evidentiality. Next, we conduct a survey study to examine native speakers' judgments of the epistemic evidentiality of sentences that contain the five target verbs with the types of nouns and adverbs extracted from the Sinia corpus. The results show that "renwei" has the highest ratings of evidentiality, followed by "juede," while "yiwei" and "xiang" have the lowest mean ratings. "Kaolu" is often regarded as irrelevant or neutral concerning epistemic evidentiality.
We present a procedure to build relatively quickly new resources with annotated named entities and their linking to Wikidata. First, we applied state-of-the-art models for named entity recognition on a sentence-aligne...
ISBN:
(纸本)9783031705625;9783031705632
We present a procedure to build relatively quickly new resources with annotated named entities and their linking to Wikidata. First, we applied state-of-the-art models for named entity recognition on a sentence-aligned parallel English-Czech corpus. We selected the most common entity classes: person, location, organization, and miscellaneous. Second, we manually checked the corpus in a suitably set annotation application. Third, we used a state-of-the-art tool for named entity linking and enhanced the ranking using sentence embeddings obtained by sentence transformers. We then checked manually whether the linking to knowledge bases was correct. As a result, we added two annotation layers to an existing parallel corpus: one with the named entities and one with links to Wikidata. The corpus contains 14,881 parallel Czech-English sentences and 3,769 links to Wikidata. The corpus can be used for training more robust named entity recognition and named entity linking models and for linguistic research of parallel news texts.
This study demonstrates the importance of corpora in the identification of near-synonyms through a comparative analysis of the near-synonyms.. meili 'beautiful' and (sic) piaoliang 'beautiful'. Chinese...
ISBN:
(纸本)9789819705856;9789819705863
This study demonstrates the importance of corpora in the identification of near-synonyms through a comparative analysis of the near-synonyms.. meili 'beautiful' and (sic) piaoliang 'beautiful'. Chinese Word Sketch (CWS) was used to analyse the frequency, common patterns and only patterns of this pair of near-synonyms. Comparison of word meanings, sentence components, and word collocations was carried out. It has been found that when females are the sentence subject, they generally tend to be described as meili. Moreover, when used as a modifier, meili is more suitable than piaoliang in describing nouns that: 1) have broader meanings;2) have a wider range of local words;and 3) can indicate places.
The term "readability" describes how simple it is for a reader to understand a written text. This can be measured with a variety of readability metrics. While some tools exist for assessing the readability o...
ISBN:
(纸本)9783031705625;9783031705632
The term "readability" describes how simple it is for a reader to understand a written text. This can be measured with a variety of readability metrics. While some tools exist for assessing the readability of Slovak texts, no free or open-source tools currently offer this functionality. This article presents an online Python library that uses Mistrik's readability metric for the Slovak language. We developed an open-source library for measuring the readability score of Slovak texts and evaluated the findings from Mistrik's initial investigation approach.
Words in natural language can be assigned to specific morphological categories. For example, the English word 'apples' can be described using morphological labels like N;PL. The conditional probabilities on su...
ISBN:
(纸本)9783031705625;9783031705632
Words in natural language can be assigned to specific morphological categories. For example, the English word 'apples' can be described using morphological labels like N;PL. The conditional probabilities on such word forms given the labels would reveal for English that the morpheme 's' is present almost always when the label N;PL appears. This indicates that the morphological properties of a word can be traced to its morphemes. We do not have any data resource that associates morphemes with morphological categories. We use UniMorph schema and datasets for universal morphological annotation as a source of morphological categories and morpheme segmentation. We align morphemes (or exponents) with the corresponding morphological categories based on the UniMorph schema for 12 languages. Given the multilingual nature of the task, we utilize unsupervised methods based on the Delta P measure and IBM Models as we test out the effectiveness of alignment methods used in statistical machine translation. Our results indicate that IBM Models accurately capture the alignment asymmetries between morphemes and morphological categories under non-trivial alignment settings.
The impact of emotion on prosody in the context of speech communication has yielded inconclusive results when it comes to the prosodic patterns associated with high-arousal emotions of different emotional valences, su...
ISBN:
(纸本)9789819705856;9789819705863
The impact of emotion on prosody in the context of speech communication has yielded inconclusive results when it comes to the prosodic patterns associated with high-arousal emotions of different emotional valences, such as "Happy" and "Anger". To clarify the existing ambiguity, this study utilized an emotional speech database to examine prosodic metrics of multi-word verbal speech. The findings suggest that tempo is more of a reliable measure than pitch in conveying emotional valance. The syllables towards the end of a sentence are the most crucial in conveying valence or size projection, while non-final syllables provide a limited indication of valence or size projection.
With the development of information technologies, ourworld currently faces such an overwhelming mass of neologisms. Therefore, the study of neologisms has become an important research topic in recent years [1]. In thi...
ISBN:
(纸本)9789819705856;9789819705863
With the development of information technologies, ourworld currently faces such an overwhelming mass of neologisms. Therefore, the study of neologisms has become an important research topic in recent years [1]. In this research, we investigate the factors that facilitate the efficient propagation of Chinese neologisms, based on Internet usage data extracted from Google Trends. We collected 342 neologisms from the published authoritative lists and annotated them with eight factors that potentially contribute to their popularity. The empirical findings shed light on the predictive potential of specific factors, such as the topic, syntactic type, length, and semantic polarity of the neologisms. Our investigation further assigns weight to each factor, revealing that syntactic type and semantic polarity of the neologisms exert more pronounced effects on their developments.
暂无评论