Data processing is an important step in various natural language processing tasks. As the commonly used datasets in named entity recognition contain only a limited number of samples, it is important to obtain addition...
详细信息
Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning f...
详细信息
Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How can we aggregate the input into consistent data about "collective" preferences or otherwise use it to make collective choices about model behavior? In this paper, we argue that the field of social choice is well positioned to address these questions, and we discuss ways forward for this agenda, drawing on discussions in a recent workshop on Social Choice for AI Ethics and Safety held in Berkeley, CA, USA in December 2023. Copyright 2024 by the author(s)
With the increasing demand for robust security in the rapidly expanding metaverse within 6 G networks, advanced intrusion detection systems (IDS) are becoming essential. This paper introduces a novel hybrid intrusion ...
详细信息
ISBN:
(数字)9798350391725
ISBN:
(纸本)9798350391732
With the increasing demand for robust security in the rapidly expanding metaverse within 6 G networks, advanced intrusion detection systems (IDS) are becoming essential. This paper introduces a novel hybrid intrusion detection framework that combines Gaussian Mixture Clustering with Random Forest classifiers and KMeans Clustering with Random Forest classifiers. These models are evaluated for their effectiveness in identifying complex intrusion patterns. The hybrid approaches demonstrated high detection rates, with the Gaussian Mixture Clustering ensemble achieving a detection rate of 99.5-99.6%, while the KMeans clustering model achieved a detection rate of $\mathbf{9 8 - 9 9 \%}$. Comparative analysis shows that these models enhance detection performance while maintaining robustness in a dynamic and diverse metaverse landscape.
High-quality training datasets are critical for building successful Machine Learning (ML) based NLP systems. However, these datasets are not always available in low-resource contexts such as the biomedical domain. Her...
详细信息
ISBN:
(纸本)9781450387194
High-quality training datasets are critical for building successful Machine Learning (ML) based NLP systems. However, these datasets are not always available in low-resource contexts such as the biomedical domain. Here, selecting relevant training data is as important as the choice of the ML model. In this study we propose UDON: Unsupervised Data selectiON for biomedical entity recognition using domain-specific pretrained Language Models (LMs). We first show that pretrained LMs succeed at implicitly learning the differences between datasets without any supervision, and then use these models to select relevant data instances. Next, we evaluate the proposed methods for entity recognition on seven biomedical datasets and one news domain dataset using four LMs and three selection methods. Our results show that using pretrained domain-specific LMs for data selection outperforms all other approaches. Finally, we use domain classification as an auxiliary task for pretraining the neural network on the in-domain dataset and show this yields further improvements.
Checkpoint averaging is a simple and effective method to boost the performance of converged neural machine translation models. The calculation is cheap to perform and the fact that the translation improvement almost c...
详细信息
As the pet industry develops, fine-grained breed recognition and individual recognition have emerged as essential components in biometric measurement systems for intelligent pet monitoring, aiming to identify the spec...
详细信息
As the pet industry develops, fine-grained breed recognition and individual recognition have emerged as essential components in biometric measurement systems for intelligent pet monitoring, aiming to identify the specific breed of a pet in an image and to recognize the same individual across multiple images. These capabilities lay the foundation for downstream tasks such as posture estimation and emotion analysis, supporting a wide range of real-world applications. Despite the substantial advancements achieved in existing research, two critical issues remain to be solved: the diversity of object poses affects representation in complex scenarios, and the conflict between model complexity and performance hinders application in resource-constrained conditions. To address the above issues, we propose an integrated Face-and-Body Information for Lightweight Breed and Individual Recognition (iFBI) scheme that integrates multiple pose information by a lightweight model. Specifically, a Face Alignment (FA) module and a Body Posture-Guided (BPG) module are proposed to separate face and body information from the input images, fully capturing the posture details while suppressing background areas. To further maximize the discrimination between individual samples, we design a Multi-level Representation Recognition (MRR) module that dynamically integrates complementary semantic features of face and body, consequently generating more discriminative features. In addition, a Dynamic Convolutional Model Compression (DCMC) method is implemented with an improved dual-branch backbone that significantly reduces computational costs while enhancing model performance. Extensive experiments on two self-built datasets, Pet with Fine-grained Breed Dataset (Pet-FB) and Pet with Diverse Posture Dataset (Pet-DP), and four public datasets indicate that iFBI yields superior performance in both fine-grained breed recognition and individual recognition tasks. The source code and self-built datasets—P
Correction To: Neural Computing and Applications (2024) 36:16001–16021 https://***/10.1007/s00521-024-09914-5 In this article, the sentence ‘The models conditioned with the concepts are highlighted with the dove gra...
This paper presents an approach to enhance classification accuracy of human Activity Recognition (HAR) datasets using a shallow hierarchical method, comprised of 2 Convolutional layers and 1 LSTM Layer, to identify tr...
This paper presents an approach to enhance classification accuracy of human Activity Recognition (HAR) datasets using a shallow hierarchical method, comprised of 2 Convolutional layers and 1 LSTM Layer, to identify transition tasks. The designed model was tested on the human Activity and Postural Transition (HAPT) dataset from the University of California, Irvine, achieving an accuracy of 93 %, demonstrating it's efficacy in correctly identifying a variety of human activities, including the postural transition states. An F1 score of 85 % was attained, highlighting a reliable method of reducing false positives on an unbalanced dataset. The performance on these metrics showcases the model's reliability and proficiency in classifying activities within unbalanced datasets.
Due to the possibilities provided by such technologies to provide people with live immersive virtual worlds, Extended Reality (XR) technologies such as virtual (VR), augmented (AR), and mixed reality (MR) have grown. ...
详细信息
Small form factor limits physical input space in earable (i.e., ear-mounted wearable) devices. Off-device earable inputs in alternate mid-air and on-skin around-ear interaction spaces using uni-manual gestures can add...
详细信息
暂无评论