In the domain of question-answering in NLP, the retrieval of Frequently Asked Questions (FAQ) is an important sub-area which is well researched and has been worked upon for many languages. Here, in response to a user ...
ISBN:
(纸本)9783031333798;9783031333804
In the domain of question-answering in NLP, the retrieval of Frequently Asked Questions (FAQ) is an important sub-area which is well researched and has been worked upon for many languages. Here, in response to a user query, a retrieval system typically returns the relevant FAQs from a knowledge-base. The efficacy of such a system depends on its ability to establish semantic match between the query and the FAQs in real-time. The task becomes challenging due to the inherent lexical gap between queries and FAQs, lack of sufficient context in FAQ titles, scarcity of labeled data and high retrieval latency. In this work, we propose a bi-encoder-based query-FAQ matching model that leverages multiple combinations of FAQ fields (like, question, answer, and category) both during model training and inference. Our proposed Multi-Field Bi-Encoder (MFBE) model benefits from the additional context resulting from multiple FAQ fields and performs well even with minimal labeled data. We empirically support this claim through experiments on proprietary as well as open-source public datasets in both unsupervised and supervised settings. Our model achieves around 27% and 23% better top-1 accuracy for the FAQ retrieval task on internal and open datasets, respectively over the best performing baseline.
Text generation models are widespread nowadays, there are more and more use cases for them. The majority of the most prominent and efficient, in terms of output quality, ones require vast computational resources for t...
ISBN:
(纸本)9783031429347;9783031429354
Text generation models are widespread nowadays, there are more and more use cases for them. The majority of the most prominent and efficient, in terms of output quality, ones require vast computational resources for training and inference, which becomes an obstacle for their practical usage. However, even for big and powerful architectures, the texts they generate often lack coherence, logical links can be broken, which makes texts less readable and less useful. This paper presents a method for increasing coherence of text generated by neural network models and we emphasize the importance of languages' nature analysis prior to building new generation methods. We have analysed the existing decoding methods and have built the mechanism for maximizing coherence of the output sequences into some of them.
The recently developed transformer has been largely explored in the research field of computer vision and especially improve the performance of single object tracking. However, the majority of current efforts concentr...
ISBN:
(纸本)9783031509582;9783031509599
The recently developed transformer has been largely explored in the research field of computer vision and especially improve the performance of single object tracking. However, the majority of current efforts concentrate on combining and enhancing convolutional neural network (CNN)-generated features and cannot fully excavating the potential of transformer. Motivated by this, we introduce multi-granularity theory into the pure transformer-based single object tracker and design a multi-granularity feature fusion module. With a view to fuse the feature of different granularity and enhance the feature representation, we design the double-branch transformer feature extractor and utilize cross-attention mechanism to fuse the feature. In our extensive experiments on multiple tracking benchmarks, including OTB2015, VOT2020, TrackingNet, GOT-10k, LaSOT, our proposed method named MGTT, the results could demonstrate that the proposed tracker achieves better performance than multiple state-of-the-art trackers.
Class-conditional label noise characterizes classification tasks in which the training set labels are randomly flipped versions of the actual ground-truth. The analysis of telescope data in astroparticle physics poses...
ISBN:
(数字)9783031434273
ISBN:
(纸本)9783031434266;9783031434273
Class-conditional label noise characterizes classification tasks in which the training set labels are randomly flipped versions of the actual ground-truth. The analysis of telescope data in astroparticle physics poses this problem with a novel condition: one of the class-wise label flip probabilities is known while the other is not. We address this condition with an objective function for optimizing the decision thresholds of existing classifiers. Our experiments on several imbalanced data sets demonstrate that accounting for the known label flip probability substantially improves the learning outcome over existing methods for learning under class-conditional label noise. In astroparticle physics, our proposal achieves an improvement in predictive performance and a considerable reduction in computational requirements. These achievements are a direct result of our proposal's ability to learn from real telescope data, instead of relying on simulated data as is common practice in the field.
The accurate prediction of legal case outcomes is crucial for effective legal advocacy, which relies on a deep understanding of past cases. Our research aims to develop an automated chatbot for predicting the outcomes...
ISBN:
(纸本)9789819958337;9789819958344
The accurate prediction of legal case outcomes is crucial for effective legal advocacy, which relies on a deep understanding of past cases. Our research aims to develop an automated chatbot for predicting the outcomes of employment-related legal cases using deep learning techniques. We compare and significantly improve on mining the New Zealand Employment Relations Authority (NZERA) dataset, using various deep learning models such as Latent Dirichlet Allocation (LDA) with different activation functions of Recurrent Neural Network (RNN) to determine their predictive performance. Our study's findings show that SoftSign-based RNN-LDA models have the highest accuracy and consistency in predicting outcomes.
Selective thinning is a crucial operation to reduce forest ignitable material, to control the eucalyptus species and maximise its profitability. The selection and removal of less vigorous stems allows the remaining st...
ISBN:
(纸本)9783031490101;9783031490118
Selective thinning is a crucial operation to reduce forest ignitable material, to control the eucalyptus species and maximise its profitability. The selection and removal of less vigorous stems allows the remaining stems to grow healthier and without competition for water, sunlight and nutrients. This operation is traditionally performed by a human operator and is time-intensive. This work simplifies selective thinning by removing the stem selection part from the human operator's side using a computer vision algorithm. For this, two distinct datasets of eucalyptus stems (with and without foliage) were built and manually annotated, and three Deep Learning object detectors (YOLOv5, YOLOv7 and YOLOv8) were tested on real context images to perform instance segmentation. YOLOv8 was the best at this task, achieving an Average Precision of 74% and 66% on non-leafy and leafy test datasets, respectively. A computer vision algorithm for automatic stem selection was developed based on the YOLOv8 segmentation output. The algorithm managed to get a Precision above 97% and a 81% Recall. The findings of this work can have a positive impact in future developments for automatising selective thinning in forested contexts.
In this paper, we propose two-sorted modal logics for the representation and reasoning of concepts arising from rough set theory (RST) and formal concept analysis (FCA). These logics are interpreted in two-sorted bidi...
ISBN:
(纸本)9783031509582;9783031509599
In this paper, we propose two-sorted modal logics for the representation and reasoning of concepts arising from rough set theory (RST) and formal concept analysis (FCA). These logics are interpreted in two-sorted bidirectional frames, which are essentially formal contexts with converse relations. The logic KB contains ordinary necessity and possibility modalities and can represent rough set-based concepts. On the other hand, the logic KF has window modality that can represent formal concepts. We study the relationship between KB and KF by proving a correspondence theorem. It is then shown that, using the formulae with modal operators in KB and KF, we can capture formal concepts based on RST and FCA and their lattice structures.
Counterfactual explanations have become a mainstay of the XAI field. This particularly intuitive statement allows the user to understand what small but necessary changes would have to be made to a given situation in o...
ISBN:
(纸本)9783031434174;9783031434181
Counterfactual explanations have become a mainstay of the XAI field. This particularly intuitive statement allows the user to understand what small but necessary changes would have to be made to a given situation in order to change a model prediction. The quality of a counterfactual depends on several criteria: realism, actionability, validity, robustness, etc. In this paper, we are interested in the notion of robustness of a counterfactual. More precisely, we focus on robustness to counterfactual input changes. This form of robustness is particularly challenging as it involves a trade-off between the robustness of the counterfactual and the proximity with the example to explain. We propose a new framework, CROCO, that generates robust counterfactuals while managing effectively this trade-off, and guarantees the user a minimal robustness. An empirical evaluation on tabular datasets confirms the relevance and effectiveness of our approach.
Intelligent Transport Systems (ITS) is a fast evolving domain with an increasingly important role in shaping the future of transport and a significant impact on a wide range of issues, many of which have ethical impli...
ISBN:
(纸本)9783031490101;9783031490118
Intelligent Transport Systems (ITS) is a fast evolving domain with an increasingly important role in shaping the future of transport and a significant impact on a wide range of issues, many of which have ethical implications. On the other hand, Ethics is essential to ensure that ITS are safe, fair, accountable, trustworthy, and respectful of privacy. This study reflects on the ethical concerns around transport system and its impact on economic, social and environmental dimensions, from the spirit of the foundational concepts of Ethics to the specific issues raised by intelligent transport, including those enhanced by artificialintelligence (AI) and Machine Learning (ML) systems. The primordial ethical concerns of transport have, in some extent, been mitigated with the introduction of the ITS paradigm, but others have arisen as a result of emerging technologies. Ethics is therefore critical in intelligent transport because of its potential to significantly impact individuals, communities, and society as a whole, and is an important tool to design more sustainable, equitable, and fair transport systems.
The use of Shap scores has become widespread in Explainable AI. However, their computation is in general intractable, in particular when done with a black-box classifier, such as neural network. Recent research has un...
ISBN:
(纸本)9783031436185;9783031436192
The use of Shap scores has become widespread in Explainable AI. However, their computation is in general intractable, in particular when done with a black-box classifier, such as neural network. Recent research has unveiled classes of open-box Boolean Circuit classifiers for which Shap can be computed efficiently. We show how to transform binary neural networks into those circuits for efficient Shap computation. We use logic-based knowledge compilation techniques. The performance gain is huge, as we show in the light of our experiments.
暂无评论