In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to Englis...
详细信息
The article presents the results of a study of the field material of the Purovsky dialect of the Forest Nenets language. His experimental phonological analysis was conducted using the LingvoDoc data processing algorit...
详细信息
The Upper Kama dialect is spoken by the Komi ethnic group residing in the upper Kama River region. The idiom developed independently from the area of distribution of Komi dialects and existed solely in oral form. In K...
详细信息
The question of accurate classification of a language or a dialect becomes particularly relevant when its speakers, due to migrations, find themselves in isolated regions. Geographically distant from their original pl...
详细信息
ISBN:
(纸本)9798350349993
The question of accurate classification of a language or a dialect becomes particularly relevant when its speakers, due to migrations, find themselves in isolated regions. Geographically distant from their original places of residence. This prompts the inquiry: do these dialects more often succumb to the influence of nearby languages or do they, conversely, 'preserve' themselves with minimal changes? Within the scope of the current research we have aimed to answer this question for a unique Stavropol dialect of the Estonian language, that has been isolated near northern Caucasus for almost 200 years. In 2020 we have recorded a full dictionary of the supposed dialect, containing 2583 entries. Through the examination of this material from three dictionaries using the 'Cognate analysis' (It calculates the phonetic disparities in etymologically related words on LingvoDoc) and the 'Glottochronological Analysis of Dialects' programs on LingvoDoc we've established that Thus, the language of the Podgornoye village exhibits both innovative reflexes (loss of the initial h) and archaic reflexes in consonants. In addition, with regard to vowels, it displays both more archaic features (retention of diphthongs uo and ie) and innovative reflexes (for Estonian diphthongs ea and ó ó). This corroborates with our the hypothesis regarding the 'conservation' of the Stavropol dialect of the Estonian language when compared to standard Estonian. Consequently, on the etymological-phonetic proximity graph, the language of the Podgornoye village is closer to Finnish than Estonian. Thus the Estonian dialect of the village of Podgornoye serves as an example of almost complete 'conservation' after the migration of its speakers to the North Caucasus, where the unique features of Northern Estonian dialects have been preserved through active language use by its speakers. In conclusion, we can see that the lists of basic vocabulary of the dialect of the village of Podgornoye and of the literary Esto
In this work, we tackle the problem of Armenian named entity recognition, providing silver- and gold-standard datasets as well as establishing baseline results on popular models. We present a 163000-token named entity...
详细信息
A numerical and experimental study has been conducted to explore the influence of vertical wall on the wake of the non/ducted propeller. The numerical simulation is achieved using a structured and transient sliding me...
详细信息
In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to Englis...
详细信息
ISBN:
(数字)9781728190884
ISBN:
(纸本)9781728190891
In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to English and back twice, resulting in pairs of lexically distant but semantically similar sentences. The generated paraphrases are then manually reviewed and annotated. Using the method train and test datasets are created, containing 2360 paraphrases in total. In addition, the datasets are used to train and evaluate BERT-based models for detecting paraphrase in Armenian, achieving results comparable to the state-of-the-art of other languages.
In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to Englis...
详细信息
In this work, we intrinsically and extrinsically evaluate and compare existing word embedding models for the Armenian language. Alongside, new embeddings are presented, trained using GloVe, fastText, CBOW, SkipGram al...
详细信息
The article presents the results of a study of the field material of the Purovsky dialect of the Forest Nenets language. His experimental phonological analysis was conducted using the LingvoDoc data processing algorit...
The article presents the results of a study of the field material of the Purovsky dialect of the Forest Nenets language. His experimental phonological analysis was conducted using the LingvoDoc data processing algorithms. A comparison of the vowel sound system obtained as a result of this analysis with other phonetic systems of the Forest Nenets language proposed by linguists of the XX-XXI centuries, and its four separate native speakers, allowed us to conclude that this dialect is unique and further research is necessary, taking into account data on contact languages for the Forest Nenets.
暂无评论