In recent years, the demand for AI-driven conversational agents has increased significantly across various industries. Traditional generative models, while capable of producing human-like responses, often struggle wit...
详细信息
The impressive performance of Large language Model (LLM) has prompted researchers to develop Multi-modal LLM (MLLM), which has shown great potential for various multi-modal tasks. However, current MLLM often struggles...
详细信息
Legal judgment is a decision formally issued by a court as a conclusion to legal proceedings, analyzing the court's findings, reasoning, and rulings on the matters brought before it. This study explores the analys...
详细信息
Dataset pruning aims to select a subset of a dataset for efficient model training. While data efficiency in naturallanguageprocessing has primarily focused on within-corpus scenarios during model pre-training, effic...
Agriculture plays a vital role in the economy of many nations, especially in regions where a significant portion of the population depends on it for their livelihood. Despite its importance, linguistic and technologic...
详细信息
Kolmogorov-Arnold Networks (KAN) is an emerging neural network architecture in machine learning. It has greatly interested the research community about whether KAN can be a promising alternative to the commonly used M...
详细信息
In this work, we present LLM Gesticulator, an LLM-based audio-driven co-speech gesture generation framework that synthesizes full-body animations that are rhythmically aligned with the input audio while exhibiting nat...
详细信息
In practical applications, large images are processed in patches, many of which are simple and smooth, making them suitable for lighter network processing. This paper proposes a hybrid path selection mechanism that en...
详细信息
Obstructive sleep apnea-hypopnea syndrome (OSAHS) is a common sleep disorder caused by upper airway blockage, leading to oxygen deprivation and disrupted sleep. Traditional diagnosis using polysomnography (PSG) is exp...
详细信息
Creating descriptive captions for images is now becoming a mission-critical application area in the intersection of naturallanguageprocessing and computer vision. This work provides the hybrid model VisionGPT2, comb...
详细信息
暂无评论