Accurate interpolation algorithms are highly desired in various theoretical and engineering scenarios. Unlike the traditional numerical algorithms that have exact zero-residual constraints on observed points, the neur...
Accurate interpolation algorithms are highly desired in various theoretical and engineering scenarios. Unlike the traditional numerical algorithms that have exact zero-residual constraints on observed points, the neural network-based interpolation methods exhibit non-zero residuals at these points. These residuals, which provide observations of an underlying residual function, can guide predicting interpolation functions, but have not been exploited by the existing approaches. To fill this gap, we propose Hierarchical INTerpolation Network (HINT), which utilizes the residuals on observed points to guide target function estimation in a hierarchical fashion. HINT consists of several sequentially arranged lightweight interpolation blocks. The first interpolation block estimates the main component of the target function, while subsequent blocks predict the residual components using observed points residuals of the preceding blocks. The main component and residual components are accumulated to form the final interpolation results. Furthermore, under the assumption that finer residual prediction requires a more focused attention range on observed points, we utilize hierarchical local constraints in correlation modeling between observed and target points. Extensive experiments demonstrate that HINT outperforms existing interpolation algorithms significantly in terms of interpolation accuracy across a wide variety of datasets, which underscores its potential for practical scenarios.
This paper proposes an effective low-rank alternating direction doubling algorithm (R-ADDA) for computing numerical low-rank solutions to large-scale sparse continuous-time algebraic Riccati matrix equations. The meth...
详细信息
This paper presents an effective low-rank generalized alternating direction implicit iteration (R-GADI) method for solving large-scale sparse and stable Lyapunov matrix equations and continuous-time algebraic Riccati ...
详细信息
In this paper, we focus on using optimization methods to solve matrix equations by transforming the problem of solving the Sylvester matrix equation or continuous algebraic Riccati equation into an optimization proble...
详细信息
Flexible capacitive pressure sensors are optimal for wearable electronic equipment owing to their advantages of simplicity in architecture and low power consumption. However, fabricating a bionic flexible capacitive p...
Flexible capacitive pressure sensors are optimal for wearable electronic equipment owing to their advantages of simplicity in architecture and low power consumption. However, fabricating a bionic flexible capacitive pressure sensor with high sensitivity and a wide dynamic range of detection is still a great challenge. In this article, we propose a flexible pressure sensor based on femtosecond laser ablation with a treefrog suckers bionic microstructure. The treefrog suckers capacitive pressure sensor (TSCPS) has high sensitivity (0-10 kPa, 0.0762 kPa -1 ), wide detection range (500 kPa), high repeatability (>12000 s at 10 kPa), and fast response time (62.5 ms). For practical applications, the TSCPS was positioned on the wrist, palm, and sole of the hand to detect the movements of each part accordingly. The experimental findings indicate that the TSCPS has broad promising applications in bionic wearable products such as human locomotion monitoring.
We have introduced the generalized alternating direction implicit iteration (GADI) method for solving large sparse complex symmetric linear systems and proved its convergence properties. Additionally, some numerical r...
详细信息
Video Question Answering (VideoQA) has emerged as a vital tool to evaluate agents' ability to understand human daily behaviors. Despite the recent success of large vision language models in many multi-modal tasks,...
Video Question Answering (VideoQA) has emerged as a vital tool to evaluate agents' ability to understand human daily behaviors. Despite the recent success of large vision language models in many multi-modal tasks, complex situation reasoning over videos involving multiple human-object interaction events still remains challenging. In contrast, humans can easily tackle it by using a series of episode memories as anchors to quickly locate question-related key moments for reasoning. To mimic this effective reasoning strategy, we propose the Glance-Focus model. One simple way is to apply an action detection model to predict a set of actions as key memories. However, these actions within a closed set vocabulary are hard to generalize to various video domains. Instead of that, we train an Encoder-Decoder to generate a set of dynamic event memories at the glancing stage. Apart from using supervised bipartite matching to obtain the event memories, we further design an unsupervised memory generation method to get rid of dependence on event annotations. Next, at the focusing stage, these event memories act as a bridge to establish the correlation between the questions with high-level event concepts and low-level lengthy video content. Given the question, the model first focuses on the generated key event memory, then focuses on the most relevant moment for reasoning through our designed multi-level cross-attention mechanism. We conduct extensive experiments on four Multi-Event VideoQA benchmarks including STAR, EgoTaskQA, AGQA, and NExT-QA. Our proposed model achieves state-of-the-art results, surpassing current large models in various challenging reasoning tasks. The code and models are available at https://***/ByZ0e/Glance-Focus.
The prevalence of short video platforms has spawned a lot of fake news videos, which have stronger propagation ability than textual fake news. Thus, automatically detecting fake news videos has been an important count...
详细信息
Performance degradation or system resource exhaustion can be attributed to inadequate computing resources as a result of software *** the real world,the workload of a web server varies with time,which will cause a non...
详细信息
Performance degradation or system resource exhaustion can be attributed to inadequate computing resources as a result of software *** the real world,the workload of a web server varies with time,which will cause a nonlinear aging *** nonlinear property often makes analysis and modelling *** is one of the important factors influencing the speed of *** paper quantitatively analyzes the workload-aging relation and proposes a framework for aging control under varying *** addition,this paper proposes an approach that employs prior information of workloads to accurately forecast incoming system *** workload data are used as a threshold to divide the system resource usage data into multiple sections,while in each section the workload data can be treated as a *** section is described by an individual autoregression(AR)*** with other AR models,the proposed approach can forecast the aging process with a higher accuracy.
Aiming at the problem that reprojection error (RE) cannot reflect the camera calibration accuracy intuitively, this paper proposes a method to verify the camera calibration results in world coordinate system. Firstly,...
Aiming at the problem that reprojection error (RE) cannot reflect the camera calibration accuracy intuitively, this paper proposes a method to verify the camera calibration results in world coordinate system. Firstly, the checkerboard calibration target is photographed with the camera under test, and the internal parameters and distortion parameters of the camera are calculated by using the pinhole model and the Kannala-Brandt (KB) fisheye model, respectively. Then, the same camera is used to shoot two targets in the field of view (FOV), and the pixel coordinates of the two target points are detected from the image. Thirdly, the inverse projection model is used to calculate the distance between the two objects and the angle between the camera and the two objects, and the real distance and angle are compared. The experimental results show that the average distance errors of the pinhole model and the KB fisheye model are 0.34% and 0.35%, respectively, and the angle errors are 0.40% and 0.12%, respectively, which proves that the verification method can effectively characterize the calibration accuracy of the camera.
暂无评论