Image analysis tasks use salient object detection because it not only identifies important elements of a visual scene but also lessens computational complexity by removing unimportant elements. In this research, we pr...
详细信息
Nonnegative Matrix Factorization(NMF)is one of the most popular feature learning technologies in the field of machine learning and pattern *** has been widely used and studied in the multi-view clustering tasks becaus...
详细信息
Nonnegative Matrix Factorization(NMF)is one of the most popular feature learning technologies in the field of machine learning and pattern *** has been widely used and studied in the multi-view clustering tasks because of its *** study proposes a general semi-supervised multi-view nonnegative matrix factorization *** algorithm incorporates discriminative and geometric information on data to learn a better-fused representation,and adopts a feature normalizing strategy to align the different *** specific implementations of this algorithm are developed to validate the effectiveness of the proposed framework:Graph regularization based Discriminatively Constrained Multi-View Nonnegative Matrix Factorization(GDCMVNMF)and Extended Multi-View Constrained Nonnegative Matrix Factorization(ExMVCNMF).The intrinsic connection between these two specific implementations is discussed,and the optimization based on multiply update rules is *** on six datasets show that the effectiveness of GDCMVNMF and ExMVCNMF outperforms several representative unsupervised and semi-supervised multi-view NMF approaches.
In the beginning of 2020, the world witnessed the rapid spread of the new coronavirus, COVID-19, affecting millions of people globally. However, at the outset, the availability of corona test kits was scarce, leading ...
详细信息
Owing to the high complexity of chip architecture and assay protocol, considerable effort has been directed toward the design automation of continuous-flow microfluidics over the past decade. Existing methods, however...
详细信息
We study deep neural networks for the multi-label classification (M-lab) task through the lens of neural collapse (NC). Previous works have been restricted to the multi-class classification setting and discovered a pr...
详细信息
The increasing relevance of panoptic segmentation is tied to the advancements in autonomous driving and AR/VR applications. However, the deployment of such models has been limited due to the expensive nature of dense ...
详细信息
Vision-language models (VLMs) have emerged as formidable tools, showing their strong capability in handling various open-vocabulary tasks in image recognition, text-driven visual content generation, and visual chatbot...
详细信息
In smart classrooms, conducting multi-face expression recognition based on existing hardware devices to assessstudents’ group emotions can provide educators with more comprehensive and intuitive classroom effect anal...
详细信息
In smart classrooms, conducting multi-face expression recognition based on existing hardware devices to assessstudents’ group emotions can provide educators with more comprehensive and intuitive classroom effect analysis,thereby continuouslypromotingthe improvementof teaching ***,most existingmulti-face expressionrecognition methods adopt a multi-stage approach, with an overall complex process, poor real-time performance,and insufficient generalization ability. In addition, the existing facial expression datasets are mostly single faceimages, which are of low quality and lack specificity, also restricting the development of this research. This paperaims to propose an end-to-end high-performance multi-face expression recognition algorithm model suitable forsmart classrooms, construct a high-quality multi-face expression dataset to support algorithm research, and applythe model to group emotion assessment to expand its application value. To this end, we propose an end-to-endmulti-face expression recognition algorithm model for smart classrooms (E2E-MFERC). In order to provide highqualityand highly targeted data support for model research, we constructed a multi-face expression dataset inreal classrooms (MFED), containing 2,385 images and a total of 18,712 expression labels, collected from smartclassrooms. In constructing E2E-MFERC, by introducing Re-parameterization visual geometry group (RepVGG)block and symmetric positive definite convolution (SPD-Conv) modules to enhance representational capability;combined with the cross stage partial network fusion module optimized by attention mechanism (C2f_Attention),it strengthens the ability to extract key information;adopts asymptotic feature pyramid network (AFPN) featurefusion tailored to classroomscenes and optimizes the head prediction output size;achieves high-performance endto-end multi-face expression detection. Finally, we apply the model to smart classroom group emotion assessmentand provide design refe
Dynamic Adaptive Streaming over HTTP (DASH) is a widely adopted video streaming protocol. Adaptive Bitrate Streaming (ABR) algorithm is utilized to dynamically switch between different bitrates. However, traditional A...
Dynamic Adaptive Streaming over HTTP (DASH) is a widely adopted video streaming protocol. Adaptive Bitrate Streaming (ABR) algorithm is utilized to dynamically switch between different bitrates. However, traditional ABR algorithms have gradually failed to meet users’ demands for high-quality video transmission, especially in complex network environments. Thus, the current research focus has shifted towards enhancing the accuracy of algorithm. The advent of edge computing has brought new possibilities for DASH transmission. Edge computing-based algorithms can make decisions from a more macroscopic perspective, which can enhance algorithm efficiency. In this paper, we propose an edge computing-based strategy that enables the edge server to obtain the actual file size of the next segment at every bitrate. As a result, edge servers can obtain more informations. We further propose an edge-based system model that assists the ABR algorithm in achieving better operational efficiency. With additional information, the algorithm located at the edge node possesses the capability to make more precise decisions, which is advantageous for enhancing the quality of experience (QoE) for users. Our experiments demonstrate that the proposed strategy can significantly enhance QoE and network resource utilization.
Real-time filtering and derivative signals with as small phase lag as possible are of great significance for control performances. In this work, an enhanced time optimal control (referred as f sa) based tracking diffe...
详细信息
暂无评论