Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate *** inherent traits often lead to increased miss and false detection rat...
详细信息
Remote sensing imagery,due to its high altitude,presents inherent challenges characterized by multiple scales,limited target areas,and intricate *** inherent traits often lead to increased miss and false detection rates when applying object recognition algorithms tailored for remote sensing ***,these complexities contribute to inaccuracies in target localization and hinder precise target *** paper addresses these challenges by proposing a solution:The YOLO-MFD model(YOLO-MFD:Remote Sensing Image Object Detection withMulti-scale Fusion Dynamic Head).Before presenting our method,we delve into the prevalent issues faced in remote sensing imagery ***,we emphasize the struggles of existing object recognition algorithms in comprehensively capturing critical image features amidst varying scales and complex *** resolve these issues,we introduce a novel ***,we propose the implementation of a lightweight multi-scale module called *** module significantly improves the model’s ability to comprehensively capture important image features by merging multi-scale feature *** effectively addresses the issues of missed detection and mistaken alarms that are common in remote sensing ***,an additional layer of small target detection heads is added,and a residual link is established with the higher-level feature extraction module in the backbone *** allows the model to incorporate shallower information,significantly improving the accuracy of target localization in remotely sensed ***,a dynamic head attentionmechanism is *** allows themodel to exhibit greater flexibility and accuracy in recognizing shapes and targets of different ***,the precision of object detection is significantly *** trial results show that the YOLO-MFD model shows improvements of 6.3%,3.5%,and 2.5%over the original YOLOv8 model in Precision,map@0.5 a
Lithium-sulfur batteries(LSBs)are considered promising candidates for next-generation battery technologies owing to their outstanding theoretical energy density and ***,the low conductivity and polysulfide shuttling e...
详细信息
Lithium-sulfur batteries(LSBs)are considered promising candidates for next-generation battery technologies owing to their outstanding theoretical energy density and ***,the low conductivity and polysulfide shuttling effect of S cathodes severely hamper the practical performance of ***,in situ-generated single layer MXene nanosheet/hierarchical porous carbonized wood fiber(MX/PCWF)composites are prepared via a nonhazardous eutectic activation strategy coupled with pyrolysis-induced gas *** unique architecture,wherein single layer MXene nanosheets are constructed on carbonized wood fiber walls,ensures rapid polysulfide conversion and continuous electron transfer for redox *** C-Ti-C bonds formed between MXene and PCWF can considerably expedite the conversion of polysulfides,effectively suppressing the shuttle *** impressive capacity of 1301.1 m A h g^(-1)at 0.5 C accompanied by remarkable stability is attained with the MX/PCWF host,as evidenced by the capacity maintenance of 722.6 m A h g^(-1)after 500 ***,the MX/PCWF/S cathode can still deliver a high capacity of 886.8 m A h g^(-1)at a high S loading of 5.6 mg cm^(-2).The construction of two-dimensional MXenes on natural wood fiber walls offers a competitive edge over S-based cathode materials and demonstrates a novel strategy for developing high-performance batteries.
In recent years,live streaming has become a popular application,which uses TCP as its primary transport *** UDP Internet Connections(QUIC)protocol opens up new opportunities for live ***,how to leverage QUIC to transm...
详细信息
In recent years,live streaming has become a popular application,which uses TCP as its primary transport *** UDP Internet Connections(QUIC)protocol opens up new opportunities for live ***,how to leverage QUIC to transmit live videos has not been studied *** paper first investigates the achievable quality of experience(QoE)of streaming live videos over TCP,QUIC,and their multipath extensions Multipath TCP(MPTCP)and Multipath QUIC(MPQUIC).We observe that MPQUIC achieves the best performance with bandwidth aggregation and transmission ***,network fluctuations may cause heterogeneous paths,high path loss,and band-width degradation,resulting in significant QoE *** by the above observations,we investigate the multipath packet scheduling problem in live streaming and design 4D-MAP,a multipath adaptive packet scheduling scheme over ***,a linear upper confidence bound(LinUCB)-based online learning algorithm,along with four novel scheduling mechanisms,i.e.,Dispatch,Duplicate,Discard,and Decompensate,is proposed to conquer the above problems.4D-MAP has been evaluated in both controlled emulation and real-world networks to make comparison with the state-of-the-art multipath transmission *** results reveal that 4D-MAP outperforms others in terms of improving the QoE of live streaming.
Drug-target interactions(DTIs) prediction plays an important role in the process of drug *** computational methods treat it as a binary prediction problem, determining whether there are connections between drugs and t...
详细信息
Drug-target interactions(DTIs) prediction plays an important role in the process of drug *** computational methods treat it as a binary prediction problem, determining whether there are connections between drugs and targets while ignoring relational types information. Considering the positive or negative effects of DTIs will facilitate the study on comprehensive mechanisms of multiple drugs on a common target, in this work, we model DTIs on signed heterogeneous networks, through categorizing interaction patterns of DTIs and additionally extracting interactions within drug pairs and target protein pairs. We propose signed heterogeneous graph neural networks(SHGNNs), further put forward an end-to-end framework for signed DTIs prediction, called SHGNN-DTI,which not only adapts to signed bipartite networks, but also could naturally incorporate auxiliary information from drug-drug interactions(DDIs) and protein-protein interactions(PPIs). For the framework, we solve the message passing and aggregation problem on signed DTI networks, and consider different training modes on the whole networks consisting of DTIs, DDIs and PPIs. Experiments are conducted on two datasets extracted from Drug Bank and related databases, under different settings of initial inputs, embedding dimensions and training modes. The prediction results show excellent performance in terms of metric indicators, and the feasibility is further verified by the case study with two drugs on breast cancer.
The Internet of Medical Things(IoMT)is an application of the Internet of Things(IoT)in the medical *** is a cutting-edge technique that connects medical sensors and their applications to healthcare systems,which is es...
详细信息
The Internet of Medical Things(IoMT)is an application of the Internet of Things(IoT)in the medical *** is a cutting-edge technique that connects medical sensors and their applications to healthcare systems,which is essential in smart ***,Personal Health Records(PHRs)are normally kept in public cloud servers controlled by IoMT service providers,so privacy and security incidents may be ***,Searchable Encryption(SE),which can be used to execute queries on encrypted data,can address the issue ***,most existing SE schemes cannot solve the vector dominance threshold *** response to this,we present a SE scheme called Vector Dominance with Threshold Searchable Encryption(VDTSE)in this *** use a Lagrangian polynomial technique and convert the vector dominance threshold problem into a constraint that the number of two equal-length vectors’corresponding bits excluding wildcards is not less than a threshold ***,we solve the problem using the proposed technique modified in Hidden Vector Encryption(HVE).This technique makes the trapdoor size linear to the number of attributes and thus much smaller than that of other similar SE schemes.A rigorous experimental analysis of a specific application for privacy-preserving diabetes demonstrates the feasibility of the proposed VDTSE scheme.
Unsupervised learning methods such as graph contrastive learning have been used for dynamic graph represen-tation learning to eliminate the dependence of ***,existing studies neglect positional information when learni...
详细信息
Unsupervised learning methods such as graph contrastive learning have been used for dynamic graph represen-tation learning to eliminate the dependence of ***,existing studies neglect positional information when learning discrete snapshots,resulting in insufficient network topology *** the same time,due to the lack of appropriate data augmentation methods,it is difficult to capture the evolving patterns of the network *** address the above problems,a position-aware and subgraph enhanced dynamic graph contrastive learning method is proposed for discrete-time dynamic ***,the global snapshot is built based on the historical snapshots to express the stable pattern of the dynamic graph,and the random walk is used to obtain the position representation by learning the positional information of the ***,a new data augmentation method is carried out from the perspectives of short-term changes and long-term stable structures of dynamic ***,subgraph sampling based on snapshots and global snapshots is used to obtain two structural augmentation views,and node structures and evolving patterns are learned by combining graph neural network,gated recurrent unit,and attention ***,the quality of node representation is improved by combining the contrastive learning between different structural augmentation views and between the two representations of structure and *** results on four real datasets show that the performance of the proposed method is better than the existing unsupervised methods,and it is more competitive than the supervised learning method under a semi-supervised setting.
The use of generative adversarial network(GAN)-based models for the conditional generation of image semantic segmentation has shown promising results in recent ***,there are still some limitations,including limited di...
详细信息
The use of generative adversarial network(GAN)-based models for the conditional generation of image semantic segmentation has shown promising results in recent ***,there are still some limitations,including limited diversity of image style,distortion of detailed texture,unbalanced color tone,and lengthy training *** address these issues,we propose an asymmetric pre-training and fine-tuning(APF)-GAN model.
Advancements in maritime satellite technology have significantly impacted the maritime industry, enhancing both communication and safety at sea. These technological improvements have enabled Automatic Identification S...
详细信息
The effectiveness of modeling contextual information has been empirically shown in numerous computer vision tasks. In this paper, we propose a simple yet efficient augmented fully convolutional network(AugFCN) by aggr...
详细信息
The effectiveness of modeling contextual information has been empirically shown in numerous computer vision tasks. In this paper, we propose a simple yet efficient augmented fully convolutional network(AugFCN) by aggregating content-and position-based object contexts for semantic ***, motivated because each deep feature map is a global, class-wise representation of the input,we first propose an augmented nonlocal interaction(AugNI) to aggregate the global content-based contexts through all feature map interactions. Compared to classical position-wise approaches, AugNI is more efficient. Moreover, to eliminate permutation equivariance and maintain translation equivariance, a learnable,relative position embedding branch is then supportably installed in AugNI to capture the global positionbased contexts. AugFCN is built on a fully convolutional network as the backbone by deploying AugNI before the segmentation head network. Experimental results on two challenging benchmarks verify that AugFCN can achieve a competitive 45.38% mIoU(standard mean intersection over union) and 81.9% mIoU on the ADE20K val set and Cityscapes test set, respectively, with little computational overhead. Additionally, the results of the joint implementation of AugNI and existing context modeling schemes show that AugFCN leads to continuous segmentation improvements in state-of-the-art context modeling. We finally achieve a top performance of 45.43% mIoU on the ADE20K val set and 83.0% mIoU on the Cityscapes test set.
Embodied visual exploration is critical for building intelligent visual agents. This paper presents the neural exploration with feature-based visual odometry and tracking-failure-reduction policy(Ne OR), a framework f...
详细信息
Embodied visual exploration is critical for building intelligent visual agents. This paper presents the neural exploration with feature-based visual odometry and tracking-failure-reduction policy(Ne OR), a framework for embodied visual exploration that possesses the efficient exploration capabilities of deep reinforcement learning(DRL)-based exploration policies and leverages feature-based visual odometry(VO) for more accurate mapping and positioning results. An improved local policy is also proposed to reduce tracking failures of feature-based VO in weakly textured scenes through a refined multi-discrete action space, keyframe fusion, and an auxiliary task. The experimental results demonstrate that Ne OR has better mapping and positioning accuracy compared to other entirely learning-based exploration frameworks and improves the robustness of feature-based VO by significantly reducing tracking failures in weakly textured scenes.
暂无评论