Topic modeling has become essential in a variety of text mining applications, such as document clustering and recommendation systems. This study investigates the potential of BERTopic, a transformer-based method that ...
详细信息
The penetration of renewable energy with high intermittencies, such as solar power, presents a challenge for the North Sulawesi and Gorontalo power grid. Currently, the largest photovoltaic plant in the region is Liku...
详细信息
Migrating from Monolithic architecture to Microservices architecture is a major change in how applications are designed, developed, and managed. This paper introduces an innovative approach for Microservices identific...
详细信息
We consider random simple temporal graphs in which every edge of the complete graph Kn appears once within the time interval [0, 1] independently and uniformly at random. Our main result is a sharp threshold on the si...
详细信息
In Federated Learning (FL), devices that participate in the training usually have heterogeneous resources, i.e., energy availability. In current deployments of FL, devices that do not fulfill certain hardware requirem...
详细信息
The advancements in sensing technologies and AI algorithms have opened up a wide range of possibilities for developing applications to meet the needs of individuals who are deaf or hard of hearing. Sign language plays...
详细信息
The Greek School Network (GSN) provides support to students, teachers, and school units in secondary education across Greece. Handling numerous user queries manually can be challenging, necessitating the development o...
详细信息
This paper describes the development of a transformer-based text generation model for Nigerian Pidgin also known as Naijá, a popular language in West Africa. Despite its wide use, Nigerian Pidgin remains under-re...
详细信息
This paper describes the development of a transformer-based text generation model for Nigerian Pidgin also known as Naijá, a popular language in West Africa. Despite its wide use, Nigerian Pidgin remains under-resourced, particularly in areas related to text generation and natural language processing. These difficulties are primarily due to technological constraints rather than the language’s fundamental attributes. There is currently a demand for Nigerian Pidgin-specific solutions because it is used in everyday communication and has a unique linguistic blend. This paper aims to close this gap by exploring the application of state-of-the-art transformer technology to develop a text generation model for Nigerian Pidgin. This work uses the public Afriberta-corpus dataset to optimize the Generative Pre-trained Transformer (GPT-2) model across a sizeable dataset. The performance evaluators, BLEU and Perplexity metrics provide a detailed breakdown of the model’s text quality and predictive accuracy. Despite the difficulties caused by a limited amount of training data, preliminary evaluations show that the model can generate coherent Nigerian Pidgin text. The performance evaluation yielded perplexity scores of 43.56 for variable target reference length and 43.26 for fixed text length. BLEU scores of 0.15 for fixed max length and 0.56 for variable reference target length. This highlights the quality of generated text and the significant improvement when the generated text length is aligned with the reference target. Our work was benchmarked against African American Vernacular (AAVE) revealing that BLEU scores for AAVE are significantly lower than those for Standard American English, with BLEU given as 0.26. Our Nigerian Pidgin model, with a BLEU score of 0.56, shows a better performance. However, both results suggest that both dialects are challenging for language models. Leveraging the pre-trained transformer-based language model and evaluation metrics, we showcase the mo
Over the past few years, blockchain technology has gained significant attention. This surge in popularity can be attributed to the emergence of cryptocurrencies and the development of smart contracts. Cryptocurrency i...
详细信息
A Reconfigurable Intelligent Surface (RIS) can significantly enhance network positioning and mapping, acting as an additional anchor point in the reference system and improving signal strength and measurement diversit...
详细信息
暂无评论