Coreference resolution aims to identify expressions in a text that refer to the same entity and establish connections between them. This paper presents an improved method for Thai coreference resolution, extending the...
详细信息
ISBN:
(纸本)9798350349122;9798350349115
Coreference resolution aims to identify expressions in a text that refer to the same entity and establish connections between them. This paper presents an improved method for Thai coreference resolution, extending the F-coref architecture with two key enhancements. First, to handle the absence of explicit word boundaries in Thai, a pre-tokenization step is implemented before applying the model tokenizer. This ensures accurate alignment between gold coreference labels and resulting tokens. Second, an improved loss function is proposed to overcome a challenge encountered by F-coref during training. This modification prevents the model from solely optimizing coreference to null spans, ensuring a more balanced training trajectory. empirical evaluations demonstrate the effectiveness of these modifications in boosting the robustness of Thai coreference resolution.
Scientific information extraction (SciIE) is critical for converting unstructured knowledge from scholarly articles into structured data (entities and relations). Several datasets have been proposed for training and v...
详细信息
This paper investigates controllable generation for large language models (LLMs) with prompt-based control, focusing on Lexically Constrained Generation (LCG). We systematically evaluate the performance of LLMs on sat...
详细信息
Despite the remarkable abilities of Large language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer. To avoid provid...
详细信息
The tool-use Large language Models (LLMs) that integrate with external Python interpreters have significantly enhanced mathematical reasoning capabilities for open-source LLMs, while tool-free methods chose another tr...
详细信息
In dialogue, the addressee may initially misunderstand the speaker and respond erroneously, often prompting the speaker to correct the misunderstanding in the next turn with a Third Position Repair (TPR). The ability ...
详细信息
Guiding users through complex procedural plans is an inherently multimodal task in which having visually illustrated plan steps is crucial to deliver an effective plan guidance. However, existing works on plan-followi...
详细信息
Causal reasoning is fundamental to human intelligence and crucial for effective decision-making in real-world environments. Despite recent advancements in large vision-language models (LVLMs), their ability to compreh...
详细信息
Recent large vision-language multimodal models pre-trained with huge amount of image-text pairs show remarkable performances in downstream tasks. However, the multimodal pre-training has limitations in terms of resour...
详细信息
Large language models (LLMs) are typically fine-tuned on diverse and extensive datasets sourced from various origins to develop a comprehensive range of skills, such as writing, reasoning, chatting, coding, and more. ...
详细信息
暂无评论