In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to Englis...
详细信息
ISBN:
(数字)9781728190884
ISBN:
(纸本)9781728190891
In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to English and back twice, resulting in pairs of lexically distant but semantically similar sentences. The generated paraphrases are then manually reviewed and annotated. Using the method train and test datasets are created, containing 2360 paraphrases in total. In addition, the datasets are used to train and evaluate BERT-based models for detecting paraphrase in Armenian, achieving results comparable to the state-of-the-art of other languages.
In this work, we employ a semi-automatic method based on back translation to generate a sentential paraphrase corpus for the Armenian language. The initial collection of sentences is translated from Armenian to Englis...
详细信息
In this work, we intrinsically and extrinsically evaluate and compare existing word embedding models for the Armenian language. Alongside, new embeddings are presented, trained using GloVe, fastText, CBOW, SkipGram al...
详细信息
We study the adaptation of an optimistic time warp kernel to cross-cluster computing on the grid. Wide area communication, the primary source of overhead, is off-loaded onto dedicated routing processes. This allows th...
详细信息
ISBN:
(纸本)9780769521114
We study the adaptation of an optimistic time warp kernel to cross-cluster computing on the grid. Wide area communication, the primary source of overhead, is off-loaded onto dedicated routing processes. This allows the simulation processes to run at full speed and it thus significantly decreases the performance gap caused by the wide area distribution. Further improvements are obtained by employing message aggregation on the wide area links. We achieve many of our objectives for lazy cancellation and moderate communication, but high communication rates with aggressive cancellation remains a challenge.
The article presents the results of a study of the field material of the Purovsky dialect of the Forest Nenets language. His experimental phonological analysis was conducted using the LingvoDoc data processing algorit...
The article presents the results of a study of the field material of the Purovsky dialect of the Forest Nenets language. His experimental phonological analysis was conducted using the LingvoDoc data processing algorithms. A comparison of the vowel sound system obtained as a result of this analysis with other phonetic systems of the Forest Nenets language proposed by linguists of the XX-XXI centuries, and its four separate native speakers, allowed us to conclude that this dialect is unique and further research is necessary, taking into account data on contact languages for the Forest Nenets.
Explorative research is a vital part of biological sciences. Biologists frequently have to examine and compare multiple (large) sets of biological data in an interactive and explorative manner. Exploring alternative w...
详细信息
We simulate time harmonic flows by the lattice Boltzmann method. We propose a general scheme to choose simulation parameters, under the constraints of fixed Reynolds and Womersley numbers, and with a specified simulat...
详细信息
We simulate time harmonic flows by the lattice Boltzmann method. We propose a general scheme to choose simulation parameters, under the constraints of fixed Reynolds and Womersley numbers, and with a specified simulation error. Under these constraints parameters are chosen to minimize the execution time. Numerical stability is studied in a range of Reynolds and Womersley numbers. As an example of time harmonic flow simulations, results of blood flow in a human abdominal aorta are presented.
This paper introduced the optimization and deoptimization technologies for Escape analysis in open world. These technologies are used in a novel Escape analysis framework that has been implemented in Open runtime plat...
详细信息
This paper introduced the optimization and deoptimization technologies for Escape analysis in open world. These technologies are used in a novel Escape analysis framework that has been implemented in Open runtime platform, Intel's open-source Java virtual machine. We introduced the optimization technologies for synchronization removal and object stack allocation, as well as the runtime deoptimization and compensation work. The deoptimization and compensation technologies are crucial for a practical Escape analysis in open world. We evaluated the runtime efficiency of the deoptimization and compensation work on benchmarks like SPECjbb2000 and SPECjvm98.
The Upper Kama dialect is spoken by the Komi ethnic group residing in the upper Kama River region. The idiom developed independently from the area of distribution of Komi dialects and existed solely in oral form. In K...
The Upper Kama dialect is spoken by the Komi ethnic group residing in the upper Kama River region. The idiom developed independently from the area of distribution of Komi dialects and existed solely in oral form. In Komi linguistics, there is debate regarding the position of the Upper Kama idiom within the dialectal area of the Komi language. The paper presents the main stages of data processing for four Komi-Permyak dictionaries, including two archival dictionaries from the turn of the 19th-20th centuries, two audio dictionaries of the southern Komi-Zyryan dialects, and an audio dictionary of the Upper Kama dialect uploaded on the LingvoDoc platform. The study found that, based on phonetic-etymological criteria, the Upper Kama dialect is the most similar to the modern Upper Sysola dialect of the Komi-Zyryan language. However, when analyzing the similarity of the basic vocabulary, the dialect in question is most similar to the Kudymkar dialect of the southern Komi-Permyak language. This contradiction may explain the numerous discussions about the position of the Upper Kama dialect in the Komi language dialectal area.
In object-oriented programming languages, objects with polymorphic attributes can negatively impact performance and hinder static analysis. These attributes require dynamic dispatch, which is slower than static bindin...
详细信息
ISBN:
(纸本)9798400708688
In object-oriented programming languages, objects with polymorphic attributes can negatively impact performance and hinder static analysis. These attributes require dynamic dispatch, which is slower than static binding, and complicate the analysis process. We propose a novel algorithm for object specialization that addresses this issue by replacing polymorphic attributes with monomorphic ones, resulting in improved performance and simplified static analysis. Our algorithm is a new approach compared to existing function specialization algorithms. We provide a proof of the algorithm’s soundness and correctness, and present an implementation of the algorithm as a software tool. Empirical evaluation shows that our approach achieves significant improvements in performance and simplifies the static analysis process. Our algorithm can be applied to a variety of object-oriented languages such as Java and Python.
暂无评论