Large-scale unlabeled data has spurred recent progress in self-supervised learning methods for learning rich visual representations. Masked autoencoders (MAE), a recently proposed self-supervised method, has exhibited...
详细信息
Dialect identification presents a challenging task within the realm of naturallanguageprocessing. The growing utilization of Arabic dialects in written form, particularly on social media platforms, has created new d...
详细信息
Machine translation (MT) is a maj or naturallanguageprocessing subfield that is designed to automatically translate human spoken languages. Neural machine translation (NMT) has gained significant achievement in rece...
详细信息
The study of Tibetan function words is an indispensable basic work in Tibetan naturallanguageprocessing and has a wide range of practical application value. It is the core of Tibetan information processing and the b...
ISBN:
(纸本)9798400709241
The study of Tibetan function words is an indispensable basic work in Tibetan naturallanguageprocessing and has a wide range of practical application value. It is the core of Tibetan information processing and the basis of Tibetan naturallanguage understanding, and has wide application prospects in Tibetan proofreading, information retrieval, bilingual translation, automatic classification and other technologies. On the basis of previous studies, this paper discusses the grammatical functions of Tibetan function words la dhon from the perspective of information processing, classification system and formal description of unfree function words, focuses on the grammatical functions of Tibetan function words la dhon, and puts forward the preliminary conception and processingmethods of Tibetan function words.
Organizations are increasingly adopting chatbots to deliver enterprise-specific search services, providing conversational and contextually appropriate naturallanguage responses to user queries. To meet complex demand...
详细信息
ISBN:
(数字)9798350354973
ISBN:
(纸本)9798350354980
Organizations are increasingly adopting chatbots to deliver enterprise-specific search services, providing conversational and contextually appropriate naturallanguage responses to user queries. To meet complex demands and enhance operational efficiency, while overcoming the limitations of traditional retrieval methods, we present an advanced approach for developing a Large language Model (LLM)-Powered Enterprise Conversational Search Framework (LECS). This framework employs a sophisticated mixedgranularity chunking strategy for processing both short and long texts, an advanced question-rewriting mechanism, and a comprehensive multi-source ranking system. Extensive experiments have demonstrated the superior performance of LECS, particularly in terms of recall@5 and answer accuracy.
Sentiment analysis is one of the most widely studied applications in NLP, but most work focuses on languages with large amounts of data. We introduce the first large-scale human-annotated Twitter sentiment dataset for...
详细信息
ISBN:
(纸本)9791095546726
Sentiment analysis is one of the most widely studied applications in NLP, but most work focuses on languages with large amounts of data. We introduce the first large-scale human-annotated Twitter sentiment dataset for the four most widely spoken languages in Nigeria-Hausa, Igbo, Nigerian-Pidgin, and Yoruba-consisting of around 30,000 annotated tweets per language, including a significant fraction of code-mixed tweets. We propose text collection, filtering, processing, and labeling methods that enable us to create datasets for these low-resource languages. We evaluate a range of pre-trained models and transfer strategies on the dataset. We find that language-specific models and language-adaptive fine-tuning generally perform best. We release the datasets, trained models, sentiment lexicons, and code to incentivize research on sentiment analysis in under-represented languages.
In this world, communication plays a major role to interact with each other. It acts like a bridge to convey message to other people. Human beings are able to manage their tasks by making use of their five senses, in ...
详细信息
Blind image super-resolution is gaining increasing attention due to its significant practical implications. Poor model generalization results from the assumption that low-resolution (LR) images are the consequence of ...
详细信息
ISBN:
(数字)9798350349115
ISBN:
(纸本)9798350349122
Blind image super-resolution is gaining increasing attention due to its significant practical implications. Poor model generalization results from the assumption that low-resolution (LR) images are the consequence of bicubic downsampling of high-resolution (HR) images, which is made by a number of existing super-resolution methods. We employ a degradation model similar to BSRGAN to synthesize LR images, making them appear more realistic, and then treat the blind image super-resolution problem as a domain translation issue, utilizing diffusion models for resolution enhancement.
Identifying synergistic drug combinations is paramount significance in addressing complex diseases while reducing the risks of toxicities and other adverse effects. Although a plethora of computational methods have be...
详细信息
The increasing complexity of medical documentation necessitates effective summarization tools that enable healthcare professionals to swiftly access critical patient information. This study investigates the efficacy o...
详细信息
暂无评论