The primary challenge in tracing the participants in sports and marathon video or images is to detect and localize the jersey/Bib number that may present in different regions of their outfit captured in cluttered envi...
详细信息
Sample selection is crucial in classification tasks with noisy labels, yet most existing sample selection methods rely on a single criterion. These approaches often face challenges, including low purity of selected cl...
Sample selection is crucial in classification tasks with noisy labels, yet most existing sample selection methods rely on a single criterion. These approaches often face challenges, including low purity of selected clean samples, and underfitting due to an insufficient number of selected clean training samples. To address these challenges, this paper proposes GNet-SSLC, a novel multi-granularity network framework that integrates multiple criteria ensemble sample selection (SS) and multiple views label correction (LC). In the SS phase, this paper proposes a metric learning-based dual k-Nearest Neighbor (k-NN) sample selection method. This method first uses corrected soft labels from the initial k-NN round to guide the selection of clean samples in the subsequent k-NN round. To further enhance selection accuracy, we combine this dual k-NN approach with a small loss sample selection technique through a voting mechanism. This multiple criteria ensemble method addresses the issues of low purity and instability inherent in single criterion approaches. In the LC phase, this paper designs a multiple views label correction framework that generates high-quality pseudo-labels for selected noisy samples. A key innovation of the framework is the design of a regularized contrastive learning loss, which optimizes the semi-supervised learning process by leveraging multiple views of training samples. The additional inclusion of training samples with high-quality pseudo-labels can effectively mitigate underfitting caused by a limited number of clean training samples. Experimental results on both synthetic and real-world noisy datasets indicate that GNet-SSLC enhances the purity and stability of the selected clean samples, and significantly improves classification performance. The enhancement is particularly notable with high noise rate dataset, such as CIFAR-100 dataset with 80% noise rate, achieving a 19.3% increase in classification accuracy compared to the baseline method.
A rough set,first described by Polish computer scientist Zdzis?aw Pawlak,is a formal approximation of a crisp set,and it is now known as a new mathematical tool to process vague *** are used for machine learning,knowl...
详细信息
A rough set,first described by Polish computer scientist Zdzis?aw Pawlak,is a formal approximation of a crisp set,and it is now known as a new mathematical tool to process vague *** are used for machine learning,knowledge discovery,feature selection,etc.,and are applied to artificial intelligence,medical informatics,civil engineering,Kansei engineering,decision science,business administration,and so ***,research on data mining using rough sets is widely spreading,and the obtained association rules are applied to the characterisation of data and decision support.
In this paper, we compare 4 learning projection models between sensor domain and text domain for Zero-shot learning (ZSL). In traditional activity recognition with sensor data, the task of collecting training dataset ...
详细信息
In this paper, we compare 4 learning projection models between sensor domain and text domain for Zero-shot learning (ZSL). In traditional activity recognition with sensor data, the task of collecting training dataset is too tough and costly to apply for social. Our challenge is making the task efficient. The Zero-shot learning's purpose is to recognize the unknown activity which is activity class out of training dataset. In our previous research, we propose the Zero-shot learning method using the word vectors made from Wikipedia corpus for recognizing the human living activities like breakfast, watching TV, etc. We found that this method success to recognize unknown activities and need to improve the projection function for performance. In this paper, we construct 4 learning models for projection and evaluate them with accelerometer sensor data annotated simple activities. As a result, we realize that (1) the learning method with twice projection is useful for performance. (2) It is difficult to identify the two unknown activities whose distance from known activities is closer than that between other combination of two unknown activities and the known ones.
This paper describes the compact and low power AI DNN discriminator which is mounted on the small drone like a hobby type. By adopting this AI module, the effective plant monitoring system using the drone has been ach...
详细信息
This study proposes large force generation by a robotic manipulator by means of providing series elastic actuators exploiting mechanical resonance to it instead of powerful actuators. To this end, the authors designed...
This study proposes large force generation by a robotic manipulator by means of providing series elastic actuators exploiting mechanical resonance to it instead of powerful actuators. To this end, the authors designed and assembled a prototype of series elastic actuator that resonates at around 1.0 Hz on the basis of its model and identified parameters, and carried out preliminary experiments with it and a load. The prototype actuator consists of a geared DC motor with an encoder and an elastic element. The elastic element is made of two torsion springs so that it can generate torque in both directions. It was confirmed in the preliminary experiments that the prototype actuator resonated at around 0.7 Hz. In addition, in order to prove efficacy of mechanical resonance for large force generation, output torque of the prototype actuator with the elastic element and without it was estimated using their identified physical parameters and dynamics, and compared them. The estimation results showed that the prototype actuator exploiting mechanical resonance generated 2.24 times larger torque than the one without the elastic element.
LoRa is an ISM-band based LPWAN communication protocol. Despite their wide network penetration of approximately 20 kilometers or higher using lower than 14 decibels transmitting power, it has been extensively document...
LoRa is an ISM-band based LPWAN communication protocol. Despite their wide network penetration of approximately 20 kilometers or higher using lower than 14 decibels transmitting power, it has been extensively documented and used in academia and industry. Although LoRa connectivity defines a public platform and enables users to create independent low-power wireless connections while relying on external architecture, it has gained considerable interest from scholars and the market. The two fundamental components of this platform are LoRaWAN and LoRa PHY. The consumer LoRaWAN component of the technology describes the network model, connectivity procedures, ability to operate the frequency range, and the types of interlinked gadgets. In contrast, the LoRa PHY component is patentable and provides information on the modulation strategy which is being utilized and its attributes. There are now several LoRa platforms available. To create usable LoRa systems, there are presently several technical difficulties to be overcome, such as connection management, allocation of resources, consistent communications, and security. This study presents a thorough overview of LoRa networking, covering the technological difficulties in setting up LoRa infrastructures and current solutions. Several outstanding challenges of LoRa communication are presented depending on our thorough research of the available solutions. The research report aims to stimulate additional research toward enhancing the LoRa Network capacity and allowing more realistic installations.
The flow field between two tandem buildings will be investigated using PIV technique and the effect of various angles of incoming flow will be described. The 3D model is composed from two blocks of different size arra...
The flow field between two tandem buildings will be investigated using PIV technique and the effect of various angles of incoming flow will be described. The 3D model is composed from two blocks of different size arrangement (ratio is 0.6) and it is subjugated to well-developed boundary layer. Experiments are conducted for many vertical and horizontal planes to study statistical features of the flow.
This paper is about the installation of trust in the Semantic Web content of the Digital Reference (DR). The DR was developed in H2020/ECSEL/Productive4.0, which contains ontologies to model production and Cyber Physi...
详细信息
暂无评论