Natural Language Processing (NLP) has entered a new era with the advent of pre-trained language models, paving the way for constructing robust language models. Pretrained transformer-based models such as GPT-2 have be...
Natural Language Processing (NLP) has entered a new era with the advent of pre-trained language models, paving the way for constructing robust language models. Pretrained transformer-based models such as GPT-2 have become prevalent due to their cutting-edge efficiency. However, these approaches rely heavily on resource-intensive languages, forcing other languages to adopt multilingual frameworks (mGPT). The mGPT model could perform better for low-resource languages such as Bangla because the model has been trained on a diverse dataset spanning multiple languages. Recent studies show that the language-specific GPT model outperforms the multilingual mGPT model. In this research, we have proposed a pretrained monolingual GPT model called BanglaGPT using the objective of causal language modeling (CLM). Due to the lack of available large datasets for NLP tasks in Bangla, we have created a Bangla language model dataset called BanglaCLM using a 26.24 GB Bangla corpus scraped from several public websites. We have used a subword-based tokenization algorithm named Byte-Pair Encoding (BPE) for Bangla and finally trained the Bangla-GPT2 model from scratch using the BanglaCLM dataset. Our pretrained BanglaGPT provides state-of-the-art performance for Bangla text generation with a perplexity score of 2.86 and a loss score of 0.45 on the test set.
In this study, we demonstrate a graphene-based portable gas sensor with an integrated fused silica micro chamber for direct on-chip sample injection to detect a low concentration (-10 ppm) of acetone from a soil sampl...
详细信息
Smartphones are increasingly vital to people on a daily basis. Telephones are utilized in all aspects of life, ranging from personal to professional, due to technological advancements. It serves a function beyond maki...
详细信息
ISBN:
(数字)9798350306446
ISBN:
(纸本)9798350306453
Smartphones are increasingly vital to people on a daily basis. Telephones are utilized in all aspects of life, ranging from personal to professional, due to technological advancements. It serves a function beyond making phone calls. It enables internet connectivity and email reading. when not using the computer. The characteristics of a mobile phone are a crucial consideration when buying *** overall objective of this research is to find the best way to apply machine learning to estimate the retail pricing of smartphones based on their individual specs. Individuals who frequently use their phone are more attentive to selecting features. When purchasing a cell phone, a comparison is done based on the price-performance ratio. Phone features are regarded as performance. This research aims to forecast if mobile phones with certain features are considered economical or *** work is capable of being utilized in various marketing and business contexts to assist in making informed purchasing decisions by maximizing features while minimizing costs.
The primary purpose of the software industry is to provide high-quality software. Software system failure is caused by faulty software components. The goal of reliable software is to reduce the amount of software prog...
详细信息
The recognition of handwritten digits has been among the most enduring fundamental problems explored in the field of machine learning and computer vision. The objective of this work is to design a state-of-art Convolu...
详细信息
Reading of a text in a given sentence have lot of parameters to consider for expressing an opinion on a given text to conclude nature of the text. The data which need to be formed in the form of data sets based on spe...
详细信息
ISBN:
(数字)9798350366570
ISBN:
(纸本)9798350366587
Reading of a text in a given sentence have lot of parameters to consider for expressing an opinion on a given text to conclude nature of the text. The data which need to be formed in the form of data sets based on specific constrains. The Natural Language Processing (NLP) will always presume that Text readability made as the considerable measure for better round spread of any newspaper. The data related to these documents must be collected and store them in data sets. During this process of measuring text readability at the granular level of the text reading is a cumbersome task. This will always implicate a problem of machine readability of the text and explainability of the text taken from the documents mentioned above. In this context human readability will act an important role in predicting the text reading at granular level where the text-based sentiment analysis have been predicted through a machine learning model. The text reading has been evaluated in order to improve the readability through explainable system. The explainable systems will capture and explain the predictions of text reading. In this technical work an explainable system Text Filtering Insignificant Word (TFIW) is introduced where it improves the textual reading at the granule level based on sentiments by prediction the words which need to be filtered. This will improve the explainability of text and text reading and improve the fast text and explainability of the document.
Agricultural productivity is the factor on which many countries' economies heavily rely on. Identifying plant diseases is extremely crucial in the agricultural sector as they can hamper the plant's robustness ...
详细信息
Text extraction from scene images has started gaining a lot of traction in recent years in the computer vision field as its applications is manifold. One of its sub-categories is scene text detection. Factors like com...
详细信息
This paper presents a novel RF-panel to monitor multiple human physiological parameters non-invasively and accurately without requiring any complex arrangements. The proposed panel consists of three resonators coupled...
详细信息
In order to segregate liver tumours in medical imaging applications, a novel architecture called Selective Attention UNet is proposed in this article. The suggested architecture, which is based on the well-known UNet ...
详细信息
暂无评论