Research and development are gradually becoming data-driven and the implementation of the FAIR Guidelines(that data should be Findable, Accessible, Interoperable, and Reusable) for scientific data administration and s...
详细信息
Research and development are gradually becoming data-driven and the implementation of the FAIR Guidelines(that data should be Findable, Accessible, Interoperable, and Reusable) for scientific data administration and stewardship has the potential to remarkably enhance the framework for the reuse of research data. In this way, FAIR is aiding digital transformation. The ‘FAIRification’ of data increases the interoperability and(re)usability of data, so that new and robust analytical tools, such as machine learning(ML) models, can access the data to deduce meaningful insights, extract actionable information, and identify hidden patterns. This article aims to build a FAIR ML model pipeline using the generic FAIRification workflow to make the whole ML analytics process FAIR. Accordingly, FAIR input data was modelled using a FAIR ML model. The output data from the FAIR ML model was also made FAIR. For this, a hybrid hierarchical k-means (HHK) clustering ML algorithm was applied to group the data into homogeneous subgroups and ascertain the underlying structure of the data using a Nigerian-based FAIR dataset that contains data on economic factors, healthcare facilities, and coronavirus occurrences in all the 36 states of Nigeria. The model showed that research data and the ML pipeline can be FAIRified, shared, and reused by following the proposed FAIRification workflow and implementing technical architecture.
We propose a novel algorithm for data augmentation in nonlinear over-parametrized regression. Our data augmentation algorithm borrows from the literature on causality and extends the recently proposed Anchor regressio...
We propose a novel algorithm for data augmentation in nonlinear over-parametrized regression. Our data augmentation algorithm borrows from the literature on causality and extends the recently proposed Anchor regression (AR) method for data augmentation, which is in contrast to the current state-of-the-art domain-agnostic solutions that rely on the Mixup literature. Our Anchor data Augmentation (ADA) uses several replicas of the modified samples in AR to provide more training examples, leading to more robust regression predictions. We apply ADA to linear and nonlinear regression problems using neural networks. ADA is competitive with state-of-the-art C-Mixup solutions. Our Python implementation of ADA is available at: https://***/noraschneider/anchordataaugmentation/
We propose a novel algorithm for data augmentation in nonlinear over-parametrized regression. Our data augmentation algorithm borrows from the literature on causality and extends the recently proposed Anchor regressio...
Mental workload (MWL) identification is vital to know human cognitive functioning, performance, and well-being. In this work, we develop models for identifying low vs. high MWL using different genres of machine learni...
详细信息
Hypergraph neural networks can model more flexible connectivity relationships, are used to model higher-order interactions, and have produced strong results in many real-world applications. However, the currently exis...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Hypergraph neural networks can model more flexible connectivity relationships, are used to model higher-order interactions, and have produced strong results in many real-world applications. However, the currently existing hypergraph neural networks need more exploration in capturing the global positional information of nodes in hypergraphs. Although there have been many explorations of the problem in graph neural networks, extending these approaches to hypergraphs is fraught with challenges. The major challenge is that hyperedges in hypergraphs are the other dimensional element of the incidence structure, have more flexible definitions than edges in graphs, and require more attention when learning global positional information. We propose a novel position-aware hypergraph message-passing neural network framework to address the above challenges. Specifically, we propose a global positional embedding learning approach that can separately model global positional information for nodes and hyperedges. At the same time, we also optimize the learning of local structures with hyperedges. Experiments on several publicly available benchmark datasets find that our proposed method outperforms many state-of-the-art methods.
Stock market is a dynamic and ever-changing environment that can be both exciting and challenging for investors. Equities, primarily referred to as stocks, are traded on stock exchanges around the world and reflect ow...
详细信息
With the popularity of GPS-equipped smart devices, spatial crowdsourcing (SC) techniques have attracted growing attention in both academia and industry. In existing trajectory-aware task assignment approaches, tasks a...
详细信息
ISBN:
(数字)9798350317152
ISBN:
(纸本)9798350317169
With the popularity of GPS-equipped smart devices, spatial crowdsourcing (SC) techniques have attracted growing attention in both academia and industry. In existing trajectory-aware task assignment approaches, tasks assigned to a worker may be far apart from each other, resulting in a higher detour cost as the worker needs to deviate from the original trajectory more often than necessary. Motivated by the above observations, we investigate a trajectory-aware task coalition assignment (TCA) problem and prove it to be NP-hard. The goal is to maximize the number of assigned tasks by assigning task coalitions to workers based on their preferred trajectories. To tackle the TCA problem, we develop a batch-based three-stage framework consisting of task grouping, planning, and assignment. Extensive experiments on real and synthetic datasets demonstrate the effectiveness and efficiency of the proposed algorithms.
Football is a very famous sport worldwide and continues gaining traction to this day. As the interest in the game rises, so do the methods of interacting with the game. One such avenue is the FPL (Football Premier Lea...
Football is a very famous sport worldwide and continues gaining traction to this day. As the interest in the game rises, so do the methods of interacting with the game. One such avenue is the FPL (Football Premier League). Recent years have seen a huge spike in interest for virtual, fantasy playing applications like the FPL. A person participating is usually heavily biased towards certain players or teams. This inherent bias causes them to make certain irrational decisions which may not provide the best results. There exists a huge gap in the market for an algorithm-based approach to team selection and the aim of this research is to make an optimal application capable of filling that void. The premise of this research is to create a team recommender system that optimizes one's team selection strategy for the FPL. This research works by introducing objectivity and eliminating biases in the team selection process. It incorporates the usage of TOPSIS (Technique for Order of Preference by Similarity to Ideal Solution) and criteria-based metrics along with an XG-Boost regression predictor, with the aim of filling the gaps in the current systems and to maximize the returns while minimizing the cost associated with it. The approach implemented involved the usage of XGBoost, yielding a standard deviation of 0.041188.
Events refer to specific occurrences, incidents, or happenings that take place under a particular background. Event reasoning aims to infer events according to certain relations and predict future events. The cutting-...
详细信息
Automatic test generation can help verify and develop the behavior of mobile applications. Test reuse based on semantic similarities between applications of the same category has been utilized to reduce the manual eff...
详细信息
暂无评论