Modeling cellular dynamics from single-cell RNA sequencing (scRNA-seq) data is critical for understanding cell development and underlying gene regulatory relationships. Many current methods rely on single-cell velocit...
详细信息
Few-shot learning (FSL) as a data-scarce method, aims to recognize instances of unseen classes solely based on very few examples. However, the model can easily become overfitted due to the biased distribution formed w...
详细信息
Monocular depth estimation is a fundamental task in computer vision and has drawn increasing attention. Recently, attention-based models and encoder-decoder architectures have led to great improvements in monocular de...
Monocular depth estimation is a fundamental task in computer vision and has drawn increasing attention. Recently, attention-based models and encoder-decoder architectures have led to great improvements in monocular depth estimation. Typically, most of the previous methods used repeated simple up-sampling operations during decoding, which may not make full use of the potential properties of the features extracted by the encoder, and there are problems of inaccurate prediction of the edge and depth maximum region. We propose an attention-based feature fusion module for encoder and decoder. We treat the monocular depth estimation as a pixel-level optimization problem, where the coarsest encoder feature is used to initialize the pixel-level optimization, which is then refined to higher resolution by the proposed attentional feature fusion (AFF). We formulate the prediction problem as ordinal regression over the bin centers that discretize the continuous depth range. It predicts a correspondingly different distribution of bins based on different pictures and we predict bins at the coarsest level using global pooling and MLP layers. In the NYUV2 dataset, the proposed architecture improving original model by 2.5.% and 1.1%, in terms of Log10 and Absolute relative error, respectively.
作者:
Guo, KuoLi, YifanChen, HaoShen, Hong-BinYang, YangShanghai Jiao Tong University
Key Lab. of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Department of Computer Science and Engineering Shanghai200240 China Shanghai Jiao Tong University
Key Laboratory of System Control and Information Processing Ministry of Education of China Institute of Image Processing and Pattern Recognition Shanghai200240 China Carnegie Mellon University
School of Computer Science Computational Biology Department PittsburghPA15213 United States
Isoforms refer to different mRNA molecules transcribed from the same gene, which can be translated into proteins with varying structures and functions. Predicting the functions of isoforms is an essential topic in bio...
详细信息
Fine-grained 3D shape classification (FGSC) has garnered significant attention recently and has made notable advancements. However, due to high inter-class similarity and intra-class diversity, it is still a challenge...
详细信息
Interactive medical image segmentation methods have become increasingly popular in recent years. These methods combine manual labeling and automatic segmentation, reducing the workload of annotation while maintaining ...
详细信息
Generalized eigenvalue problem (GEP) plays a significant role in signal processing and machine learning. This paper proposes a consensus-based distributed algorithm for GEP in multi-agent systems, where data samples a...
详细信息
Passenger flow prediction is vitally significant for intelligent transportation systems (ITS). Most of the studies typically focus on the passenger flow prediction for an individual station, and only capture the tempo...
Passenger flow prediction is vitally significant for intelligent transportation systems (ITS). Most of the studies typically focus on the passenger flow prediction for an individual station, and only capture the temporal features without considering any spatial features. Constructing a passenger flow prediction model for multiple stations, or even a whole network, is more valuable for practical applications. Therefore, we develop a dynamic spatio-temporal network (DSTNet) with a self-attention (SA) mechanism for multi-station passenger flow prediction. A dynamic graph convolutional network (DGCN) is applied for the spatial feature extraction, and gated recurrent unit (GRU) is combined to learn the temporal features. SA is applied to further assign the weights for the extracted spatio-temporal features. The Experiment has been conducted on the passenger flow in the Xiamen bus rapid transit (BRT). The results demonstrate that the proposed DSTNet with SA (SA-DSTNet) outperforms the baselines in the multi-station passenger flow prediction task.
The wide application of deep neural networks (DNNs) demands an increasing amount of attention to their real-world robustness, i.e., whether a DNN resists black-box adversarial attacks, among which score-based query at...
详细信息
This paper studies a strike path strategy for quadcopter drones targeting ground maneuvering targets. The strategy sets the strike path to two different strike speeds, which improves the stability and robustness of qu...
详细信息
ISBN:
(数字)9798331506100
ISBN:
(纸本)9798331506117
This paper studies a strike path strategy for quadcopter drones targeting ground maneuvering targets. The strategy sets the strike path to two different strike speeds, which improves the stability and robustness of quadcopter drones while shortening strike time and increasing hit rates. Consider the attitude control of quadcopter unmanned aerial vehicles during motion, and verify the flight reliability of the mechanism through simulation experiments. Set different slope strike paths and obtain the optimal strike path through experiments, while proving the effectiveness of this strike strategy in engineering applications.
暂无评论