Finding the mode of a high dimensional probability distribution D is a fundamental algorithmic problem in statistics and data analysis. There has been particular interest in efficient methods for solving the problem w...
详细信息
In this paper, we propose a novel representation for grasping using contacts between multi-finger robotic hands and objects to be manipulated. This representation significantly reduces the prediction dimensions and ac...
详细信息
Recently, using machine learning technology to realize abnormal behavior recognition in video surveillance to replace human monitoring has become a hot academic topic. In that case, constructing an efficient and unifi...
详细信息
ISBN:
(纸本)9781665480468
Recently, using machine learning technology to realize abnormal behavior recognition in video surveillance to replace human monitoring has become a hot academic topic. In that case, constructing an efficient and unified framework for multi-type abnormal behavior recognition is a worthy topic in machine learning research. This research aims to design a lightweight recognition framework that can recognize various abnormal behaviors in real-time. We propose a Novel framewOrk for the Multi-type Abnormal BEhavior Recognition (NOMABER), which consists of three parts. Firstly, the improved image pre-processing module annotates the abnormal behaviors of image data sets. Secondly, the improved YOLOv5 module is used to identify the multi-type abnormal behaviors, and then the abnormal behaviors are classified by the output module. Finally, experiments on real data sets show that NOMABER is superior to the current methods of real-time performance, identification accuracy, and types of abnormal behaviors.
Precise stock market prediction is crucial for investors, but the volatility of the stock market is influenced by multiple factors such as public sentiments, business news, and related product volatility. While severa...
详细信息
Real-world data often contain incomplete views with varying degrees of missing information. While there are existing methods for learning representations from such data, effectively utilizing all incomplete view data ...
Real-world data often contain incomplete views with varying degrees of missing information. While there are existing methods for learning representations from such data, effectively utilizing all incomplete view data and ensuring robustness to different levels of completeness remains a challenging task. To address this problem, we propose a novel framework named IMRL-AGI. IMRL-AGI combines the anchor graph-based Graph Convolutional Network (GCN) and information bottleneck. Specifically, the framework starts by constructing an anchor graph to effectively captures the nonlinear information between instances. Next, an anchor graph-based GCN is designed to extract feature information from various views. IMRL-AGI maximizes the mutual information between the views obtained by the common representation and the anchor-graph-based GCN, ensuring the accurate extraction of view information. Furthermore, the minimization of mutual information is applied to promote diversity and reduce redundancy in the multi-view representation. Extensive experiments are conducted on several real-world datasets, and the results demonstrate the superiority of IMRL-AGI.
Visual Question Answering (VQA) models fail catastrophically on questions related to the reading of text-carrying images. However, TextVQA aims to answer questions by understanding the scene texts in an image-question...
详细信息
It has been recognized that the data generated by the denoising diffusion probabilistic model (DDPM) improves adversarial training. After two years of rapid development in diffusion models, a question naturally arises...
详细信息
With the rapid development of Web3 D technologies, sketch-based model retrieval has become an increasingly important challenge, while the application of Virtual Reality and 3 D technologies has made shape retrieval of...
详细信息
With the rapid development of Web3 D technologies, sketch-based model retrieval has become an increasingly important challenge, while the application of Virtual Reality and 3 D technologies has made shape retrieval of furniture over a web browser feasible. In this paper, we propose a learning framework for shape retrieval based on two Siamese VGG-16 Convolutional Neural Networks(CNNs), and a CNN-based hybrid learning algorithm to select the best view for a shape. In this algorithm, the AlexNet and VGG-16 CNN architectures are used to perform classification tasks and to extract features, respectively. In addition, a feature fusion method is used to measure the similarity relation of the output features from the two Siamese networks. The proposed framework can provide new alternatives for furniture retrieval in the Web3 D environment. The primary innovation is in the employment of deep learning methods to solve the challenge of obtaining the best view of 3 D furniture,and to address cross-domain feature learning problems. We conduct an experiment to verify the feasibility of the framework and the results show our approach to be superior in comparison to many mainstream state-of-the-art approaches.
The broad learning system (BLS) has been attracting more and more attention due to its excellent property in the field of machine learning. A great deal of variants and hybrid structures of BLS have also been designed...
详细信息
In the industrial environment, machines often need to reflect the anomaly detection results to the total control center in time, and the general industrial network can not achieve high real-time. In order to solve suc...
In the industrial environment, machines often need to reflect the anomaly detection results to the total control center in time, and the general industrial network can not achieve high real-time. In order to solve such challenges, a set of protocol standards developed by IEEE802.1 working group, namely Time-sensitive Networking (TSN), has been introduced into industrial networks. TSN can provide high real-time and reliability for data transmission, where the reliability is achieved by Frame duplication and Frame Elimination (FRER). In the realization process of FRER, it is necessary to determine the source node, destination node, and multiple disjoint paths to transmit redundant data. However, the transmission of these redundant traffic may result in the delay of other flows, and then affects the user experience. Therefore, it is very important to choose excellent redundant traffic paths to ensure reliability and reduce the impact on other flows. In the existing research, there are many dynamic scheduling and routing heuristics to determine the path, but they do not consider the influence of the location of the source node on the whole route scheduling. This paper proposes an improved dynamic scheduling and routing heuristic method, which takes the source node into account in the routing selection. In the flow test experiments of different magnitudes, it is found that the total delay of all flows is reduced by 1.4%-4.5% under the same magnitude of schedulability compared with Ant Colony Optimization.
暂无评论