Multi-view clustering has been shown to boost clustering performance by effectively mining the complementary information from multiple views. However, we observe that learning from data with more views is not guarante...
详细信息
ISBN:
(数字)9781665469463
ISBN:
(纸本)9781665469463
Multi-view clustering has been shown to boost clustering performance by effectively mining the complementary information from multiple views. However, we observe that learning from data with more views is not guaranteed to achieve better clustering performance than from data with fewer views. To address this issue, we propose a general deep learning based framework that is guaranteed to reduce the risk of performance degradation caused by view increase. Concretely, the model is trained to simultaneously extract complementary information and discard the meaningless noise by automatically selecting features. These two learning procedures are incorporated into one unified framework by the proposed optimization objective. In theory, the empirical clustering risk of the model is no higher than learning from data before the view increase and data of the new increased single view. Also, the expected clustering risk of the model under divergence-based loss is no higher than that with high probability. Comprehensive experiments on benchmark datasets demonstrate the effectiveness and superiority of the proposed framework in achieving safe multi-view clustering.
The educational sector faces a new dimension that is dominated by lifelong learning and is affected by the technical, social, and cultural changes. This pattern represents the need to improve the teaching methods for ...
详细信息
The educational sector faces a new dimension that is dominated by lifelong learning and is affected by the technical, social, and cultural changes. This pattern represents the need to improve the teaching methods for physical education and sports science. The use of computers and other information technology to increase the effectiveness of the teaching process is a modern method. This paper aims to illustrate the use of information and communication technologies (ICT) in physical education and sports. In our field, the gradual computerization results can be summed up in the following aspects: education software, design, and planning activities, recording outcomes, motion monitoring, video analysis, comparison of performance and synchronizing, measurements at distance and time and the evaluation of the activity. Although physical education and sports are practical activities, specialists can make use of modern teaching technologies. In this paper, the system of curriculum assessment for physical education has been analyzed and researched in computer assessment. The first section introduced the method of assessment of the physical education program. The second phase of the paper represents a teaching model of the physical education mathematical model utilizing the Comprehensive Adaptive Fuzzy Evaluation Theory has been proposed. A new level is the modernization of physics education with the artificial intelligence computer education system built in this paper. The experimental results have highperformance in detecting the physical activity of college students.
Video analytics systems designed for computer vision tasks use deep learning models that rely on high-quality input data to maximize performance. However, in a real-world system, these inputs are often compressed usin...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Video analytics systems designed for computer vision tasks use deep learning models that rely on high-quality input data to maximize performance. However, in a real-world system, these inputs are often compressed using video codecs such as HEVC. Video compression degrades the quality of the inputs, thereby degrading the performance of these models. Region-of-interest (ROI) coding enables bits to be allocated to improve performance;however, the method to select regions should be computationally simple since it must occur during or before the video is compressed and transmitted for further processing. In this paper, we propose a task-aware quad-tree (TA-QT) partitioning and quantization method to achieve ROI coding for HEVC and other video coding standards. TAQT uses a lightweight edge-based model to guide task-aware video encoding to improve end-stage video analytics (ESVA) performance while reducing both bit-rate and encoding time. We demonstrate the effectiveness of our approach in terms of (a) the performance of the ESVA on compressed inputs, (b) transmission bit-rates, and (c) encoding time.
In recent years, remote work has become a more popular option for companies across various industries. While remote work provides numerous benefits, such as flexibility and increased work-life balance, it also present...
详细信息
As 5G networks rapidly expand to support higher data rates, lower latency, and increased device density, the associated security risks are also growing. That is why the deployment of 5G networks introduces significant...
详细信息
Stray light is an important indicator that affects the performance of a high-resolution spectrometer system. In order to improve the spectral resolution of the spectrometer system, this paper proposes a computer model...
详细信息
The task of named entity recognition (NER) is crucial in the creation of knowledge graphs. With the advancement of deep learning, the pre-training model BERT has become the mainstream solution for NER. However, lack o...
详细信息
Approximate nearest neighbor search (ANNS) has emerged as a crucial component of database and AI infrastructure. Ever-increasing vector datasets pose significant challenges in terms of performance, cost, and accuracy ...
详细信息
In recent years, spiking neural networks (SNNs) received significant attention as the third generation of networks and have successfully been employed in energy-efficient image classification tasks. However, typical S...
详细信息
ISBN:
(纸本)9783031301070;9783031301087
In recent years, spiking neural networks (SNNs) received significant attention as the third generation of networks and have successfully been employed in energy-efficient image classification tasks. However, typical SNN construction methods still suffer from problems such as high inference latency or incompatibility with complicated models. Thus, applications of SNNs are limited to relatively simple tasks. In this paper, we establish an SNN-based action recognition model which aims at a more challenging video classification task. Specifically, the action recognition SNN model with a deep two-stream architecture is constructed with a hybrid conversion method combining channel-wise normalization and tandem learning. A skipping-step rate decoder is applied to decrease the conversion errors and improve the transmission accuracy. A new conversion and inference method for recurrent spiking neural network (RSNN) is introduced into the framework. The tandem learning method with bounded ReLU (bReLU) function is employed to fine-tune the normalized SNN parameters, decreasing the inference latency while still preserving high accuracy. Experiments on UCF-101 show that our proposed model obtains an accuracy of 88.46% with only 200 time steps, which achieves a high-accuracy and energy-efficient performance in SNN.
Learning-based Neural Video Codecs (NVCs) have emerged as a compelling alternative to standard video codecs, demonstrating promising performance, and simple and easily maintainable pipelines. However, NVCs often fall ...
详细信息
暂无评论