Recently, Transformer has achieved great success in computer vision. However, it is constrained because the spatial and temporal complexity grows quadratically with the number of large points in 3D object detection ...
Recently, Transformer has achieved great success in computer vision. However, it is constrained because the spatial and temporal complexity grows quadratically with the number of large points in 3D object detection applications from point clouds. Previous point-wise methods are suffering from time consumption and limited receptive fields to capture information among points. To address these limitations, we propose the cosh-attention, which reduces the computation complexity of space and time from the quadratic order to linear order with respect to the number of points. In the cosh-attention, the traditional softmax operator is replaced by non-negative ReLU activation and hyperbolic-cosine-based operator with re-weighting mechanism. Then based on the key component, cosh-attention, we present a two-stage hyperbolic cosine transformer (ChTR3D) for 3D object detection from point clouds. It refines proposals by applying cosh-attention in linear computation complexity to encode rich contextual relationships among points. Extensive experiments on the widely used KITTI dataset and Waymo Open Dataset demonstrate that, compared with vanilla attention, the cosh-attention significantly improves the inference speed with competitive performance. Experiment results show that, among two-stage state-of-the-art methods using point-level features to refine proposals, the proposed ChTR3D is the fastest one.
In this paper, a novel method combining orthogonal polarization laser self-mixing interference is proposed. The method utilizes a rotating cuvette to measure the refractive index of liquids at different concentration...
In this paper, a novel method combining orthogonal polarization laser self-mixing interference is proposed. The method utilizes a rotating cuvette to measure the refractive index of liquids at different concentrations. The cuvette is filled with the liquid to be measured and rotated by a certain angle. The change in the number of interference fringes, caused by comparing an empty cuvette with a liquid-filled cuvette, is used to calculate the refractive index of the liquid. A four-fold logic subdivision algorithm is then used to improve measurement resolution. The experimental results show that for pure water and different NaCl and glucose solutions concentrations, the average relative errors are 0.47%, 0.59%, and 2.17%, respectively, with the maximum relative error within ±2.54%. The standard deviation of all solutions is less than 3.4%.
In order to identify the tilt direction of the self-mixing interference (SMI) signals under weak feedback regime interfered by noise, a deep learning method is proposed. The one-dimensional U-Net (1D U-Net) neural ne...
In order to identify the tilt direction of the self-mixing interference (SMI) signals under weak feedback regime interfered by noise, a deep learning method is proposed. The one-dimensional U-Net (1D U-Net) neural network can discriminate the direction of self-mixing fringes accurately and quickly. For experimental SMI signals, the 1D U-Net can be used for discriminant direction after a one-step normalization. Simulation and experimental results show that the proposed method is suitable for SMI signals with noise within the whole weak feedback regime, and can maintain a high discrimination accuracy for signals interfered by 5dB noise. Combined with fringe counting method, accurate and rapid SMI signal displacement reconstruction can be realized.
Target tracking is a fundamental task and an important part in the field of computer vision. The image information contained in the visible (RGB) modality and thermal infrared (T) modality has differences. RGBT tracki...
详细信息
Target detection is a significant research direction in the realm of computer vision, aimed at analyzing and localizing targets in input images to obtain their categories and locations. However, this task is faced wit...
详细信息
The purpose of point cloud completion is designed to high-precision complete the complete shape from local observations. However, the previous approaches have tended to focus on a good perception of local details and ...
详细信息
Combing with the generalized Hamiltonian system theory,by introducing a special form of sinusoidal function,a class of n-dimensional(n=1,2,3)controllab.e multi-scroll conservative chaos with complicated dynamics is **...
详细信息
Combing with the generalized Hamiltonian system theory,by introducing a special form of sinusoidal function,a class of n-dimensional(n=1,2,3)controllab.e multi-scroll conservative chaos with complicated dynamics is *** dynamics characteristics including bifurcation behavior and coexistence of the system are analyzed in detail,the latter reveals abundant coexisting ***,the proposed system passes the NIST tests and has been implemented physically by *** to the multi-scroll dissipative chaos,the experimental portraits of the proposed system show better ergodicity,which have potential application value in secure communication and image encryption.
Skeleton-based human action recognition has become a popular topic among researchers. This is because using skeletal data provides a robust solution to problems encountered in complex environments, such as changes in ...
详细信息
Face anti-spoofing is to prevent face images with attack properties from entering face recognition and causing confusion or spoofing of the face recognition function. Since most face presentation attacks do not posses...
详细信息
In this paper, facing the aerial transportation task with a large volume cargo, the system of quadrotor with a cable-suspended rigid payload is investigated. The dynamic model of proposed system is built with Lagrange...
详细信息
暂无评论