Human pose estimation is a fundamental yet challenging task in computervision. However, difficult scenarios such as invisible keypoints, occlusions and small-scale persons are still not well-handed. In this paper, we...
详细信息
Salient object detection(SOD)in RGB and depth images has attracted increasing research *** RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities,while few meth...
详细信息
Salient object detection(SOD)in RGB and depth images has attracted increasing research *** RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities,while few methods explicitly consider how to preserve modality-specific *** this study,we propose a novel framework,the specificity-preserving network(SPNet),which improves SOD performance by exploring both the shared information and modality-specific ***,we use two modality-specific networks and a shared learning network to generate individual and shared saliency prediction *** effectively fuse cross-modal features in the shared learning network,we propose a cross-enhanced integration module(CIM)and propagate the fused feature to the next layer to integrate cross-level ***,to capture rich complementary multi-modal information to boost SOD performance,we use a multi-modal feature aggregation(MFA)module to integrate the modalityspecific features from each individual decoder into the shared *** using skip connections between encoder and decoder layers,hierarchical features can be fully *** experiments demonstrate that our SPNet outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection *** project is publicly available at https://***/taozh2017/SPNet.
Broad, open source availability of large pretrained foundation models on the internet through platforms such as HuggingFace has taken the world of practical deep learning by storm. A classical pipeline for neural netw...
详细信息
Accurately identifying building distribution from remote sensing images with complex background information is challenging. The emergence of diffusion models has prompted the innovative idea of employing the reverse d...
详细信息
Accurately identifying building distribution from remote sensing images with complex background information is challenging. The emergence of diffusion models has prompted the innovative idea of employing the reverse denoising process to distill building distribution from these complex backgrounds. Building on this concept, we propose a novel framework, building extraction diffusion model(BEDiff), which meticulously refines the extraction of building footprints from remote sensing images in a stepwise fashion. Our approach begins with the design of booster guidance, a mechanism that extracts structural and semantic features from remote sensing images to serve as priors, thereby providing targeted guidance for the diffusion process. Additionally, we introduce a cross-feature fusion module(CFM) that bridges the semantic gap between different types of features, facilitating the integration of the attributes extracted by booster guidance into the diffusion process more effectively. Our proposed BEDiff marks the first application of diffusion models to the task of building extraction. Empirical evidence from extensive experiments on the Beijing building dataset demonstrates the superior performance of BEDiff, affirming its effectiveness and potential for enhancing the accuracy of building extraction in complex urban landscapes.
When designing solar systems and assessing the effectiveness of their many uses,estimating sun irradiance is a crucial first *** study examined three approaches(ANN,GA-ANN,and ANFIS)for estimating daily global solar r...
详细信息
When designing solar systems and assessing the effectiveness of their many uses,estimating sun irradiance is a crucial first *** study examined three approaches(ANN,GA-ANN,and ANFIS)for estimating daily global solar radiation(GSR)in the south of Algeria:Adrar,Ouargla,and *** proposed hybrid GA-ANN model,based on genetic algorithm-based optimization,was developed to improve the ANN *** GA-ANN and ANFIS models performed better than the standalone ANN-based model,with GA-ANN being better suited for forecasting in all sites,and it performed the best with the best values in the testing phase of Coefficient of Determination(R=0.9005),Mean Absolute Percentage Error(MAPE=8.40%),and Relative Root Mean Square Error(rRMSE=12.56%).Nevertheless,the ANFIS model outperformed the GA-ANN model in forecasting daily GSR,with the best values of indicators when testing the model being R=0.9374,MAPE=7.78%,and rRMSE=10.54%.Generally,we may conclude that the initial ANN stand-alone model performance when forecasting solar radiation has been improved,and the results obtained after injecting the genetic algorithm into the ANN to optimize its weights were *** model can be used to forecast daily GSR in dry climates and other climates and may also be helpful in selecting solar energy system installations and sizes.
Part of speech tagging is one of the most fundamental needs of intelligent text processing, which is assigning the most appropriate grammatical category to each word on the text. Hence, provision of a tagger with high...
详细信息
Prenatal depression,which can affect pregnant women’s physical and psychological health and cause postpartum depression,is increasing ***,it is essential to detect prenatal depression early and conduct an attribution...
详细信息
Prenatal depression,which can affect pregnant women’s physical and psychological health and cause postpartum depression,is increasing ***,it is essential to detect prenatal depression early and conduct an attribution *** studies have used questionnaires to screen for prenatal depression,but the existing methods lack *** diagnose the early signs of prenatal depression and identify the key factors that may lead to prenatal depression from questionnaires,we present the semantically enhanced option embedding(SEOE)model to represent questionnaire *** can quantitatively determine the relationship and patterns between options and *** first quantifies options and resorts them,gathering options with little difference,since Word2Vec is highly dependent on *** resort task is transformed into an optimization problem involving the traveling salesman ***,all questionnaire samples are used to train the options’vector using ***,an LSTM and GRU fused model incorporating the cycle learning rate is constructed to detect whether a pregnant woman is suffering from *** verify the model,we compare it with other deep learning and traditional machine learning *** experiment results show that our proposed model can accurately identify pregnant women with depression and reach an F1 score of *** most relevant factors of depression found by SEOE are also verified in the *** addition,our model is of low computational complexity and strong generalization,which can be widely applied to other questionnaire analyses of psychiatric disorders.
Aiming to solve the problems of large model size, unbalanced detection speed and detection accuracy of traditional target-detection algorithms in autonomous driving scenarios, a lightweight YOLO algorithm is proposed ...
详细信息
Modern power systems become more vulnerable to cyber threats due to their growing interconnectivity, interdependence, and complexity. Widespread deployment of distributed energy resources (DERs) further expands the th...
详细信息
Modern power systems become more vulnerable to cyber threats due to their growing interconnectivity, interdependence, and complexity. Widespread deployment of distributed energy resources (DERs) further expands the threat landscape to the grid edge, where fewer cybersecurity protections exist. In this article, a systematic cyber-physical events demonstration, enabled by an integrated transmission, distribution, and communication co-simulation framework, is performed. It analyzes cyber risks to power grid under DER-enabled automatic generation control from different angles. Unlike existing works, the simulation captures millisecond-to-minutes frequency and voltage transient dynamics at a cross-region system scale.
Online Social Networks(OSNs)are based on the sharing of different types of information and on various interactions(comments,reactions,and sharing).One of these important actions is the emotional reaction to the *** di...
详细信息
Online Social Networks(OSNs)are based on the sharing of different types of information and on various interactions(comments,reactions,and sharing).One of these important actions is the emotional reaction to the *** diversity of reaction types available on Facebook(namely FB)enables users to express their feelings,and its traceability creates and enriches the users’emotional identity in the virtual *** paper is based on the analysis of 119875012 FB reactions(Like,Love,Haha,Wow,Sad,Angry,Thankful,and Pride)made at multiple levels(publications,comments,and sub-comments)to study and classify the users’emotional behavior,visualize the distribution of different types of reactions,and analyze the gender impact on emotion *** of these can be achieved by addressing these research questions:who reacts the most?Which emotion is the most expressed?
暂无评论