Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to est...
详细信息
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space(content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectN N. Source code of this paper is available at https://***/yahuiliu99/PointC onT.
In this paper, we propose a pylon reconstruction method based on Neural Radiance Fields (NeRF) technology. The advantage of this method lies in its ability to reconstruct pylons from images, with a multi-view model ma...
详细信息
Dear Editor,This letter is concerned with visual perception closely related to heterogeneous *** the huge challenge brought by different image modalities,we propose a visual perception framework based on heterogeneous...
详细信息
Dear Editor,This letter is concerned with visual perception closely related to heterogeneous *** the huge challenge brought by different image modalities,we propose a visual perception framework based on heterogeneous image knowledge,i.e.,the domain knowledge associated with specific vision tasks,to better address the corresponding visual perception problems.
Fall events have unique dynamic features, which are not fully utilized by existing fall detection methods. Based on video understanding, we propose Fall-LSTM to learn such features pertinently without additional input...
详细信息
In this paper, we propose a dual polarization dual mode 3dB beam splitter. By utilizing the shallow etched multimode interference (MMI) coupler, the proposed device can handle TE0, TE1, TM0 and TM1 modes simultaneousl...
详细信息
Reconstructing interacting hands from monocular color images plays an important role in promoting the understanding of human behavior and the application of existing AR/VR technology. With the emergence of interacting...
详细信息
We propose and demonstrate a polarization-independent dual mode spot size converter (SSC) on silicon integrated platform. By utilizing gradual index distributed subwavelength gratings (GRIN-SWG). The proposed device c...
详细信息
This paper investigates the cooperative output regulation problem of heterogeneous linear multi-agent systems over directed graphs with the constraint of communication *** that there exists an exosystem whose state in...
详细信息
This paper investigates the cooperative output regulation problem of heterogeneous linear multi-agent systems over directed graphs with the constraint of communication *** that there exists an exosystem whose state information is not available to all agents,the authors develop distributed adaptive event-triggered observers for the followers based on relative information between neighboring *** should be pointed out that,two kinds of time-varying gains are introduced to avoid relying on any global information associated with the network,and dynamic triggering conditions are designed to get rid of continuous *** the basis of the designed observers,the authors devise a local controller for each *** with the existing related works,the main contribution of the current paper is that the cooperative output regulation problem for general directed graphs is solved requiring neither global information nor continuous communications.
Coherent imaging systems have been applied in the detection of target of interest, natural resource exploration, ailment diagnosis, etc. However, it is easy to generate speckle-degraded images due to the coherent inte...
详细信息
暂无评论