Hierarchical multi-granularity image classification is a challenging task that aims to tag each given image with multiple granularity labels *** methods tend to overlook that different image regions contribute differe...
详细信息
Hierarchical multi-granularity image classification is a challenging task that aims to tag each given image with multiple granularity labels *** methods tend to overlook that different image regions contribute differently to label prediction at different granularities,and also insufficiently consider relationships between the hierarchical multi-granularity *** introduce a sequence-to-sequence mechanism to overcome these two problems and propose a multi-granularity sequence generation(MGSG)approach for the hierarchical multi-granularity image classification ***,we introduce a transformer architecture to encode the image into visual representation ***,we traverse the taxonomic tree and organize the multi-granularity labels into sequences,and vectorize them and add positional *** proposed multi-granularity sequence generation method builds a decoder that takes visual representation sequences and semantic label embedding as inputs,and outputs the predicted multi-granularity label *** decoder models dependencies and correlations between multi-granularity labels through a masked multi-head self-attention mechanism,and relates visual information to the semantic label information through a crossmodality attention *** this way,the proposed method preserves the relationships between labels at different granularity levels and takes into account the influence of different image regions on labels with different *** on six public benchmarks qualitatively and quantitatively demonstrate the advantages of the proposed *** project is available at https://***/liuxindazz/mgs.
This paper proposes a comprehensive design scheme for the extremum seeking control(ESC)of the unmanned aerial vehicle(UAV)close formation *** proposed design scheme combines a Newton-Raphson method with an extended Ka...
详细信息
This paper proposes a comprehensive design scheme for the extremum seeking control(ESC)of the unmanned aerial vehicle(UAV)close formation *** proposed design scheme combines a Newton-Raphson method with an extended Kalman filter(EKF)to dynamically estimate the optimal position of the following UAV relative to the leading *** reflect the wake vortex effects reliably,the drag coefficient induced by the wake vortex is considered as a performance ***,the performance function is parameterized by the first-order and second-order terms of its Taylor series *** the excellent performance of nonlinear estimation,the EKF is used to estimate the gradient and the Hessian matrix of the parameterized performance *** output feedback of the proposed scheme is determined by iterative calculation of the Newton-Raphson *** with the traditional ESC and the classic ESC,the proposed design scheme avoids the slow continuous time integration of the *** allows a faster convergence of relative position ***,the proposed method can provide a smoother command during the seeking process as the second-order term of the performance function is taken into *** convergence analysis of the proposed design scheme is accomplished by showing that the output feedback is a supermartingale *** improve estimation performance of the EKF,a improved pigeon-inspired optimization(IPIO)is proposed to automatically tune the noise covariance *** Carlo simulations for a three-UAV close formation show that the proposed design scheme is robust to the initial position of the following UAV.
When searching for a dynamic target in an unknown real world scene,search efficiency is greatly reduced if users lack information about the spatial structure of the *** target search studies,especially in robotics,foc...
详细信息
When searching for a dynamic target in an unknown real world scene,search efficiency is greatly reduced if users lack information about the spatial structure of the *** target search studies,especially in robotics,focus on determining either the shortest path when the target’s position is known,or a strategy to find the target as quickly as possible when the target’s position is ***,the target’s position is often known intermittently in the real world,e.g.,in the case of using surveillance *** goal is to help user find a dynamic target efficiently in the real world when the target’s position is intermittently *** order to achieve this purpose,we have designed an AR guidance assistance system to provide optimal current directional guidance to users,based on searching a prediction *** assume that a certain number of depth cameras are fixed in a real scene to obtain dynamic target’s *** system automatically analyzes all possible meetings between the user and the target,and generates optimal directional guidance to help the user catch up with the target.A user study was used to evaluate our method,and its results showed that compared to free search and a top-view method,our method significantly improves target search efficiency.
Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtualreality and simulation *** methods for extracting features from mesh edges or faces struggle wi...
详细信息
Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtualreality and simulation *** methods for extracting features from mesh edges or faces struggle with complex 3D models because edge-based approaches miss global contexts and face-based methods overlook variations in adjacent areas,which affects the overall *** address these issues,we propose the Feature Discrimination and Context Propagation Network(FDCPNet),which is a novel approach that synergistically integrates local and global features in mesh *** FDCPNet is composed of two modules:(1)the Feature Discrimination Module,which employs an attention mechanism to enhance the identification of key local features,and(2)the Context Propagation Module,which enriches key local features by integrating global contextual information,thereby facilitating a more detailed and comprehensive representation of crucial areas within the mesh *** Experiments on popular datasets validated the effectiveness of FDCPNet,showing an improvement in the classification accuracy over the baseline ***,even with reduced mesh face numbers and limited training data,FDCPNet achieved promising results,demonstrating its robustness in scenarios of variable complexity.
The collective behaviors of animals,from schooling fish to packing wolves and flocking birds,display plenty of fascinating phenomena that result from simple interaction rules among *** emergent intelligent properties ...
详细信息
The collective behaviors of animals,from schooling fish to packing wolves and flocking birds,display plenty of fascinating phenomena that result from simple interaction rules among *** emergent intelligent properties of the animal collective behaviors,such as self-organization,robustness,adaptability and expansibility,have inspired the design of autonomous unmanned swarm *** article reviews several typical natural collective behaviors,introduces the origin and connotation of swarm intelligence,and gives the application case of animal collective *** this basis,the article focuses on the forefront of progress and bionic achievements of aerial,ground and marine robotics swarms,illustrating the mapping relationship from biological cooperative mechanisms to cooperative unmanned cluster ***,considering the significance of the coexisting-cooperative-cognitive human-machine system,the key technologies to be solved are given as the reference directions for the subsequent exploration.
Background Mixed reality(MR)video fusion systems merge video imagery with 3D scenes to make the scene more realistic and help users understand the video content and temporal–spatial correlation between them,reducing ...
详细信息
Background Mixed reality(MR)video fusion systems merge video imagery with 3D scenes to make the scene more realistic and help users understand the video content and temporal–spatial correlation between them,reducing the user′s cognitive *** video fusion are used in various applications;however,video fusion systems require powerful client machines because video streaming delivery,stitching,and rendering are computationally ***,huge bandwidth usage is another critical factor that affects the scalability of video-fusion *** Our framework proposes a fusion method for dynamically projecting video images into 3D models as *** Several experiments on different metrics demonstrate the effectiveness of the proposed *** The framework proposed in this study can overcome client limitations by utilizing remote ***,the framework we built is based on ***,the user can test the MR video fusion system with a laptop or tablet without installing any additional plug-ins or application programs.
Drogue detection is one of the challenging tasks in autonomous aerial refueling due to the requirement for accuracy and *** detection based on image intrinsic cues can achieve fast detection,but with poor *** studies ...
详细信息
Drogue detection is one of the challenging tasks in autonomous aerial refueling due to the requirement for accuracy and *** detection based on image intrinsic cues can achieve fast detection,but with poor *** studies reveal that optimization-based methods provide accurate and quick solutions for saliency *** paper presents a hybrid pigeon-inspired optimization method,the optimized color opponent,that aims to adjust the weight of color opponent channels to detect the drogue *** can optimize the weights in the selected aerial refueling scene offline,and the results are applied for drogue detection in the scene.A novel algorithm aggregated by the optimized color opponent and robust background detection is presented to provide better precision and *** results on benchmark datasets and aerial refueling images show that the proposed method successfully extracts the saliency region or drogue and exhibits superior performance against the other saliency detection methods with intrinsic *** algorithm designed in this paper is competent for the drogue detection task of autonomous aerial refueling.
Group selection in virtualreality is an important means of multi-object selection, which allows users to quickly group multiple objects and can significantly improve the operation efficiency of multiple types of obje...
详细信息
Image and video stitching have made tremendous progress in the construction of wide field-of-view(FOV). However, some long-term challenges still exist, including wide baselines between cameras,large parallaxes, and lo...
详细信息
Image and video stitching have made tremendous progress in the construction of wide field-of-view(FOV). However, some long-term challenges still exist, including wide baselines between cameras,large parallaxes, and low texture in overlapping areas. The augmented virtual environment(AVE) captures videos as live textures of 3D models in a virtual environment, and provides another 3D solution to overcome the aforementioned challenges. Existing AVE methods primarily follow from video projection, and cannot produce satisfactory stitching results compared with image stitching. In this paper, we propose a novel model-guided 3D stitching algorithm for AVE. The algorithm recovers an approximate 3D model for each video streaming and optimizes the warping of the models to meet the requirements of feature point matching of the 3D models from adjacent videos. Compared with previous state-of-the-art methods, experiment results illustrate that our method significantly improves the stitching quality.
This paper investigates a multiplayer reach-avoid differential game in 3-dimensional(3D)space,which involves multiple pursuers,multiple evaders,and a designated target *** evaders aim to reach the target region,while ...
详细信息
This paper investigates a multiplayer reach-avoid differential game in 3-dimensional(3D)space,which involves multiple pursuers,multiple evaders,and a designated target *** evaders aim to reach the target region,while the pursuers attempt to guard the target region by capturing the *** class of research holds significant practical ***,the complexity of the problem escalates substantially with the growing number of players,rendering its solution extremely *** this paper,the multiplayer game is divided into many subgames considering the cooperation among pursuers,reducing the computational burden,and obtaining numerically tractable strategies for ***,the Apollonius sphere,a fundamental geometric tool for analyzing the 3D differential game,is formulated,and its properties are *** on this,the optimal interception point for the pursuer to capture the evader is derived and the winning conditions for the pursuer and evader are ***,based on the Apollonius sphere,the optimal state feedback strategies of players are designed,and simultaneously,the optimal one-to-one pairings are ***,the Value function of the multiplayer reach-avoid differential game is explicitly given and is proved to satisfy Hamilton-Jacobi-Isaacs(HJI)***,the matching algorithm for the case with pursuers outnumbered evaders is provided through constructing a weighted bipartite graph,and the cooperative tactics for multiple pursuers are proposed,inspired by the Harris'Hawks intelligent cooperative hunting ***,numerical simulations are conducted to illustrate the effectiveness of the theoretical results for both cases where the number of adversary players is equal and unequal between the 2 groups.
暂无评论