Stroke-based rendering method has shown its superiority in generating stylized paintings from realistic photographs. However, the existing methods often divide the image into regular blocks for parallel painting or st...
详细信息
Stroke-based rendering method has shown its superiority in generating stylized paintings from realistic photographs. However, the existing methods often divide the image into regular blocks for parallel painting or start painting by progressively narrowing down the painting region from the entire canvas. Not only does this lead to an irrational allocation of stroke resources, but also deviates from the painting approach employed by human artists. To address this, we propose a novel painting method based on hierarchical reinforcement learning, namely HRL-Painter, which consists of a high-level agent that strategically plans the sequence of painting regions and a low-level agent that carries out specific painting tasks in the corresponding regions. In the initial stage, we consider the entire canvas as the painting region and then use a small number of strokes for a rough depiction. Next, our high-level agent plans the optimal sequence of painting regions based on the content of the target image, taking into account the error between the current canvas and the target image. Finally, the low-level agent is dedicated to executing detailed painting tasks within the painting regions proposed by the high-level agent. Extensive experiments on standard datasets including CelebA, ImageNet, CUB-200 Birds and Stanford Cars-196 demonstrate that our proposed hierarchical painting agent not only produce high-quality canvases but also exhibit a painting process that closely resembles the human painting style, showcasing excellent interpretability.
learning technology has promoted the rapid development of visual object tracking, among which algorithms based on twin networks are a hot research direction. Although this method has broad application prospects, its p...
详细信息
learning technology has promoted the rapid development of visual object tracking, among which algorithms based on twin networks are a hot research direction. Although this method has broad application prospects, its performance is often greatly reduced when encountering target occlusion or similar objects in the background. In response to this issue, a method is proposed to integrate channel and spatial dimension attention mechanisms into the backbone architecture of twin networks, to optimize the algorithm's recognition accuracy for tracking targets and its stability in changing environments. Then, a region recommendation network based on adaptive anchor box generation is adopted, combined with twin networks to enhance the network's modeling ability for complex situations. Finally, a new visual tracking algorithm is designed. Through comparative experiments, the success rate of the former increased by 0.6% and 0.9% respectively on the two datasets, and its accuracy also increased by 1.2% and 1.8% accordingly. The success rate of the latter increased by 1.5% and 1.2% respectively in the two datasets, and the accuracy also increased by 1.2% and 0.6% respectively. From this, the improved algorithm can improve the performance of target tracking and has certain application potential in visual target tracking.
Time series forecasting plays a crucial role in various real-world applications, such as finance, energy, traffic, and healthcare, providing valuable insights for decision-making processes. The aggregation of informat...
详细信息
Time series forecasting plays a crucial role in various real-world applications, such as finance, energy, traffic, and healthcare, providing valuable insights for decision-making processes. The aggregation of information windows with different resolutions has proven effective in time series forecasting tasks and provides the model diverse contextual information. As a result, the network can better capture and model the heterogeneity present in the data, thereby improving performance. However, most of the current work focuses on extracting multilevel-resolution information without considering the possibility that important information can be supplemented. Meanwhile, these methods also tend to ignore the effect of resolution on frequency. To address these challenges, we introduce the Time-Frequency Domain Multi-Resolution Expansion Network (TFMRN) for long-series forecasting using multi-resolution time-frequency data. The proposed TFMRN aims to expand the data in both the time and frequency domains, enabling the model to capture finer details that may not be evident in the original data. In addition, we also propose an Information Gating Unit (IGU) to enhance the selection and guidance of rich information from the expanded time-frequency multi-resolution data. Experimental results demonstrate that the proposed method yields better performance compared with the state-of-the-art methods in both univariate and multivariate time forecasting tasks.
Eliminating bias from data representations is crucial to ensure fairness in recommendation. Existing studies primarily focus on weakening the correlation between data representations and sensitive attributes, yet may ...
详细信息
Chirality is one of the fundamental properties of molecules traditionally con-structed from ***,we report for thefirst time the successful construction of asymmetric chiral structures utilizing highly symmetric endohe...
详细信息
Chirality is one of the fundamental properties of molecules traditionally con-structed from ***,we report for thefirst time the successful construction of asymmetric chiral structures utilizing highly symmetric endohedral metallo-fullerene superatoms based on their own bonding ***,stable mirror-symmetric sinister and rectus structures are obtained by selecting a super-atom capable of forming four chemical bonds as the chiral *** analysis shows that the chiral vibration frequency of superatomic assemblies can be as low as a few wavenumbers,which greatly expands the range of chiral spectra com-pared to atom-based *** term this type of chirality based on superatoms as“superatomic-based chirality”.It is anticipated that this work will significantly expand the variety of chiral structures at the atomic level.
Object detection in remote sensing images has gained prominence alongside advancements in sensor technology and earth observation systems. Although current detection frameworks demonstrate remarkable achievements in n...
详细信息
Object detection in remote sensing images has gained prominence alongside advancements in sensor technology and earth observation systems. Although current detection frameworks demonstrate remarkable achievements in natural imagery analysis, their performance degrades when applied to remote imaging scenarios due to two inherent limitations: (1) complex background interference, which causes object features to be easily obscured by noise, leading to reduced detection accuracy;(2) the variation in object scales leads to a decrease in the model's generalization ability. To address these issues, we propose a progressive semantic-aware fusion network (ProSAF-Net). First, we design a shallow detail aggregation module (SDAM), which adaptively integrates features across different channels and scales in the early Neck stage through dynamically adjusted fusion weights, fully exploiting shallow detail information to refine object edge and texture representation. Second, to effectively integrate shallow detail information and high-level semantic abstractions, we propose a deep semantic fusion module (DSFM), which employs a progressive feature fusion mechanism to incrementally integrate deep semantic information, strengthening the global representation of objects while effectively complementing the rich shallow details extracted by SDAM, enhancing the model's capability in distinguishing objects and refining spatial localization. Furthermore, we develop a spatial context-aware module (SCAM) to fully exploit both global and local contextual information, effectively distinguishing foreground from background and suppressing interference, thus improving detection robustness. Finally, we propose auxiliary dynamic loss (ADL), which adaptively adjusts loss weights based on object scales and utilizes supplementary anchor priors to expedite parameter convergence during coordinate regression, thereby improving the model's positioning accuracy for targets. Extensive experiments on the RSOD,
Deep predictive models have been widely employed in software engineering (SE) tasks due to their remarkable success in artificial intelligence (AI). Most of these models are trained in a supervised manner, and their p...
详细信息
Deep predictive models have been widely employed in software engineering (SE) tasks due to their remarkable success in artificial intelligence (AI). Most of these models are trained in a supervised manner, and their performance heavily relies on the quality of training data. Unfortunately, mislabeling or label noise is a common issue in SE datasets, which can significantly affect the validity of models trained on such datasets. Although learning with noise approaches based on deep learning (DL) have been proposed to address the issue of mislabeling in AI datasets, the distinct characteristics of SE datasets in terms of size and data quality raise questions about the effectiveness of these approaches within the SE context. In this paper, we conduct a comprehensive study to understand how mislabeled samples exist in SE datasets, how they impact deep predictive models, and how well existing learning with noise approaches perform on SE datasets. Through an empirical evaluation on two representative datasets for the Bug Report Classification and software Defect Prediction tasks, our study reveals that learning with noise approaches have the potential to handle mislabeled samples in SE tasks, but their effectiveness is not always consistent. Our research shows that it is crucial to address mislabeled samples in SE tasks. To achieve this, it is essential to take into account the specific properties of the dataset to develop effective solutions. We also highlight the importance of addressing potential class distribution changes caused by mislabeled samples and present the limitations of existing approaches for addressing mislabeled samples. Therefore, we urge the development of more advanced techniques to improve the effectiveness and reliability of deep predictive models in SE tasks.
The early stage and accurate diagnosis of Alzheimer's Disease (AD) in neuroimaging remains a significant challenge. We introduce an innovative deep learning framework that incorporates a Focused Linear Attention (...
详细信息
Internet of Things (IoT) applications have been increasingly developed. Authenticated key agreement (AKA) plays an essential role in secure communication in IoT applications. Without the PKI certificate and high time-...
详细信息
Internet of Things (IoT) applications have been increasingly developed. Authenticated key agreement (AKA) plays an essential role in secure communication in IoT applications. Without the PKI certificate and high time-complexity bilinear pairing operations, identity-based AKA (ID-AKA) protocols without pairings are more suitable for protecting the keys in IoT applications. In recent years, many pairing-free ID-AKA protocols have been proposed. Moreover, these protocols have some security flaws or relatively extensive computation and communication efficiency. Focusing on these problems, the security analyses of some recently proposed protocols have been provided first. We then proposed a family of eCK secure ID-AKA protocols without pairings to solve these security problems, which can be applied in IoT applications to guarantee communication security. Meanwhile, the security proofs of these proposed ID-AKA protocols are provided, which show they can hold provable eCK security. Some more efficient instantiations have been provided, which show the efficient performance of these proposed ID-AKA protocols. Moreover, comparisons with similar schemes have shown that these protocols have the least computation and communication efficiency at the same time.
3D human pose estimation is an important premise for human behavior analysis and understanding, which has a wide range of applications in intelligent transportation, human-computer interaction, and animation productio...
详细信息
暂无评论