To address nonlinearity, strong coupling, and disturbances in fighter aircraft attitude control, this paper proposes an intelligent control method based on multi-agent deep deterministic policy gradient (MADDPG) and l...
详细信息
Recently, Transformer has achieved great success in computer vision. However, it is constrained because the spatial and temporal complexity grows quadratically with the number of large points in 3D object detection ...
Recently, Transformer has achieved great success in computer vision. However, it is constrained because the spatial and temporal complexity grows quadratically with the number of large points in 3D object detection applications from point clouds. Previous point-wise methods are suffering from time consumption and limited receptive fields to capture information among points. To address these limitations, we propose the cosh-attention, which reduces the computation complexity of space and time from the quadratic order to linear order with respect to the number of points. In the cosh-attention, the traditional softmax operator is replaced by non-negative ReLU activation and hyperbolic-cosine-based operator with re-weighting mechanism. Then based on the key component, cosh-attention, we present a two-stage hyperbolic cosine transformer (ChTR3D) for 3D object detection from point clouds. It refines proposals by applying cosh-attention in linear computation complexity to encode rich contextual relationships among points. Extensive experiments on the widely used KITTI dataset and Waymo Open Dataset demonstrate that, compared with vanilla attention, the cosh-attention significantly improves the inference speed with competitive performance. Experiment results show that, among two-stage state-of-the-art methods using point-level features to refine proposals, the proposed ChTR3D is the fastest one.
In this paper, a novel method combining orthogonal polarization laser self-mixing interference is proposed. The method utilizes a rotating cuvette to measure the refractive index of liquids at different concentration...
In this paper, a novel method combining orthogonal polarization laser self-mixing interference is proposed. The method utilizes a rotating cuvette to measure the refractive index of liquids at different concentrations. The cuvette is filled with the liquid to be measured and rotated by a certain angle. The change in the number of interference fringes, caused by comparing an empty cuvette with a liquid-filled cuvette, is used to calculate the refractive index of the liquid. A four-fold logic subdivision algorithm is then used to improve measurement resolution. The experimental results show that for pure water and different NaCl and glucose solutions concentrations, the average relative errors are 0.47%, 0.59%, and 2.17%, respectively, with the maximum relative error within ±2.54%. The standard deviation of all solutions is less than 3.4%.
In order to identify the tilt direction of the self-mixing interference (SMI) signals under weak feedback regime interfered by noise, a deep learning method is proposed. The one-dimensional U-Net (1D U-Net) neural ne...
In order to identify the tilt direction of the self-mixing interference (SMI) signals under weak feedback regime interfered by noise, a deep learning method is proposed. The one-dimensional U-Net (1D U-Net) neural network can discriminate the direction of self-mixing fringes accurately and quickly. For experimental SMI signals, the 1D U-Net can be used for discriminant direction after a one-step normalization. Simulation and experimental results show that the proposed method is suitable for SMI signals with noise within the whole weak feedback regime, and can maintain a high discrimination accuracy for signals interfered by 5dB noise. Combined with fringe counting method, accurate and rapid SMI signal displacement reconstruction can be realized.
Image coloring is an inherently uncertain and multimodal problem. By inputting a grayscale image into a coloring network, visually plausible colored photos can be generated. Conventional methods primarily rely on sem...
Image coloring is an inherently uncertain and multimodal problem. By inputting a grayscale image into a coloring network, visually plausible colored photos can be generated. Conventional methods primarily rely on semantic information for image colorization. Although effective in coloring images with clear semantic information, these methods still suffer from color contamination and semantic confusion. This is largely due to the limited capacity of convolutional neural networks to effectively learn deep semantic information inherent in *** this paper, we propose a network structure that addresses these limitations by leveraging multi-level semantic information classification and fusion. Additionally, we introduce a global semantic fusion network to combat the issues of color contamination. The proposed coloring encoder accurately extracts object-level semantic information from *** further enhance visual plausibility, we employ a self-supervised adversarial training method. We train the network structure on various datasets with varying amounts of data and evaluate its performance using the ImageNet validation set and COCO validation set. Experimental results demonstrate that our proposed RepColor can generate more realistic images compared to previous approaches, showcasing its high generalization ability.
A lightweight road-assistant detection algorithm, EBD-YOLO, based on YOLOv5s is proposed to address the problems of high model complexity, computation cost, and difficulty in deployment on resource-limited embedde...
A lightweight road-assistant detection algorithm, EBD-YOLO, based on YOLOv5s is proposed to address the problems of high model complexity, computation cost, and difficulty in deployment on resource-limited embedded terminals in existing assisted driving detection algorithms. First, lightweight Transformer model EfficientViT was used as the backbone feature extraction network of YOLOv5s model to reduce network parameters and calculation costs. Secondly, a Focal-GIoU Loss function is proposed for bounding box regression to accelerate model convergence and reduce loss. Thirdly, the feature pyramid structure is improved to a weighted bi-directional feature pyramid network (BiFPN) to enhance localization and semantic features. Then, a dynamic head framework is added to unify the attention mechanism with the object detection head to improve its performance. Finally, a Soft-CIoU_NMS algorithm is proposed in the post-processing stage to enhance occluded targets' localization and detection ability and reduce the missed detection rate. We conducted experiments on the KITTI and BDD100K datasets for autonomous driving, and the results showed that the EBD-YOLO model reduced in size by 38.4% and 37.2%, respectively. In comparison, the computational cost was reduced by 48.1%. As measured by mAP@0.5, the detection accuracy improved by 0.5% and 5.8%, respectively, and mAP@0.5:0.95 improved by 2.8% and 7%, respectively. These improvements satisfied the requirements for deployment on embedded terminals in cars.
Deep learning for point clouds faces the challenge of their inherent unordered nature, which makes traditional CNN-like methods not directly applicable. However, due to the inherent permutation invariance, transfor...
Deep learning for point clouds faces the challenge of their inherent unordered nature, which makes traditional CNN-like methods not directly applicable. However, due to the inherent permutation invariance, transformers provide solutions to unordered points problems faced in LiDAR-based object detection. In this paper, a two-stage LiDAR 3D object detection framework is presented, namely Point-Voxel Dual Transformer (PV-DT3D), which is a transformer-based method. In the proposed PV-DT3D, point-voxel fusion features are used for proposal refinement. Specifically, in the PV-DT3D, keypoints are sampled from entire point cloud scene and used to encode representative scene features via a proposal-aware voxel set abstraction module. Subsequently, following the generation of proposals by the region proposal networks (RPN), the internal encoded keypoints are fed into dual transformer encoder-decoder architecture. In 3D object detection, the proposed PV-DT3D is the first to take advantage of both pointwise transformer and channel-wise architecture to capture contextual information from the perspective of spatial and channel dimensions. Experiments conducted on the highly competitive KITTI 3D car detection leaderboard show that, the PV-DT3D achieves superior detection accuracy among state-of-the-art point-voxel-based methods.
In the field of neuroeconomics, the assessment of cognitive workload is a crucial issue with significant implications for real-world applications. Previous research has made progress in task-based germane cognitive lo...
详细信息
In this paper, a dual-beam photothermal self-mixing substance trace detection system is proposed. The crystal violet (CV) solution of the sample undergoes a thermal lens effect under the action of pump photoperiod exc...
详细信息
In this paper, a dual-beam photothermal self-mixing substance trace detection system is proposed. The crystal violet (CV) solution of the sample undergoes a thermal lens effect under the action of pump photoperiod excitation, and the refractive index changes cause the deviation of the probe beam, and the self-mixing interference (SMI) effect occurs when the modulated light returns to the cavity. By combining theoretical and experimental results, the power fluctuation of the SMI signal caused by the thermo-optic effect is linearly related to CV concentration and other parameters. At an excitation power of 24 mW, the detection limit can reach $8.0 \times {10^{- 8}}\;{\rm mol/L}$ . This method does not require labeling or complex CV pretreatment and has high sensitivity and flexibility, providing a guide for CV characterization in biological, environmental, and pharmaceutical research.
作者:
Yang, JianxiongRan, LiChe, YanboDu, MingxingTianjin University of Technology
Tianjin Key Laboratory of Control Theory and Applications in Complicated System Tianjin300384 China Chongqing University
State Key Laboratory of Power Transmission Equipment and System Security and New Technology School of Electrical Engineering Chongqing400044 China University of Warwick
School of Engineering CoventryCV4 7AL United Kingdom Tianjin University
Key Laboratory of Smart Grid of Ministry of Education School of Electrical and Information Engineering Tianjin300072 China
Parallel configurations of power chips within a multichip IGBT power module (M-IGBT-PM) are employed to meet the required current and power ratings. When the internal chips of an M-IGBT-PM experience uneven temperatur...
详细信息
暂无评论