A prominent effect of label noise on neural networks is the disruption of the consistency of predictions. While prior efforts primarily focused on predictions' consistency at the individual instance level, they of...
详细信息
A prominent effect of label noise on neural networks is the disruption of the consistency of predictions. While prior efforts primarily focused on predictions' consistency at the individual instance level, they often fell short of fully harnessing the consistency across multiple instances. This paper introduces subclass consistency regularization (SCCR) to maximize the potential of this collective consistency of predictions. SCCR mitigates the impact of label noise on neural networks by imposing constraints on the consistency of predictions within each subclass. However, constructing high-quality subclasses poses a formidable challenge, which we formulate as a special clustering problem. To efficiently establish these subclasses, we incorporate a clustering-based contrastive learning framework. Additionally, we introduce the Q-enhancing algorithm to tailor the contrastive learning framework, ensuring alignment with subclass construction. We conducted comprehensive experiments using benchmark datasets and real datasets to evaluate the effectiveness of our proposed method under various scenarios with differing noise rates. The results unequivocally demonstrate the enhancement in classification accuracy, especially in challenging high-noise settings. Moreover, the refined contrastive learning framework significantly elevates the quality of subclasses even in the presence of noise. Furthermore, we delve into the compatibility of contrastive learning and learning with noisy labels, using the projection head as an illustrative example. This investigation sheds light on an aspect that has hitherto been overlooked in prior research efforts.
This paper presents a Multiscale Temporal Correlation Learning with the Mamba-Fused Attention Model (MMA), an efficient and effective method for reconstructing a video clip from a spike stream. Spike cameras offer uni...
详细信息
This paper presents a Multiscale Temporal Correlation Learning with the Mamba-Fused Attention Model (MMA), an efficient and effective method for reconstructing a video clip from a spike stream. Spike cameras offer unique advantages for capturing rapid scene changes with high temporal resolution. A spike stream contains sufficient information for multiple image reconstructions. However, existing methods generate only a single image at a time for a given spike stream, which results in excessive redundant computations between consecutive frames when aiming at restoring a video clip, thereby increasing computational costs significantly. The proposed MMA addresses such challenges by constructing a spike-to-video model, directly producing an image sequence at a time. Specifically, we propose a U-shaped Multiscale Temporal Correlation Learning (MTCL) to fuse the features at different temporal resolutions for clear video reconstruction. At each scale, we introduce a Fine-Grained Attention (FGA) module for fine-spatial context modeling within a patch and a Mamba module for integrating features across patches. Adopting a lightweight U-shaped structure and fine-grained feature extraction at each level, our method reconstructs high-quality image sequences quickly. The experimental results show that the proposed MMA surpasses current state-of-the-art methods in image quality, computation cost, and model size.
In the era of expanding cloud-edge online service systems, predictive maintenance (PdM) based on key performance indicators (KPIs), such as CPU utilization, response rate, and network bandwidth, is essential for syste...
详细信息
In the era of expanding cloud-edge online service systems, predictive maintenance (PdM) based on key performance indicators (KPIs), such as CPU utilization, response rate, and network bandwidth, is essential for system operational reliability and security. Traditional data-driven PdM approaches for cloud-edge services often monitor individual indicators in isolation, neglecting their interrelationships. Although convolutional modules have been utilized for collaborative representation learning of localized high-dimensional KPIs, the high-computational complexity of CNNs and the limited kernel size hinder their efficacy in modeling high-dimensional, time-sensitive KPIs. To address these challenges, we propose a lightweight multichannel multilayer perceptron (MCMLP) framework for collaborative PdM. This framework enables efficient pointwise manipulation to capture inner-indicator hidden patterns within KPIs sequences without relying on grid-based convolution. Our MCMLP enhances locality through a two-stage dual-channel feature extraction process that captures spatial and temporal dependencies, integrating inner-indicator and cross-timeline features. The proposed MCMLP can also be easily plugged into existing CNN-based detectors as a substitute, offering a computationally economical alternative. Empirical results in real-world KPI data from various cloud-edge service scenarios demonstrate that MCMLP significantly outperforms traditional CNN-based methods, with a roughly 22% reduction in training time at a 95% confidence level (alpha = 0.05). This study also provides actionable insights for deploying MCMLP in cloud-edge service systems.
Urban flooding has become a pressing challenge for many countries and regions. Meanwhile, ecosystem-based disaster risk reduction approaches have been recognized as a sustainable and effective strategy for managing fl...
详细信息
Urban flooding has become a pressing challenge for many countries and regions. Meanwhile, ecosystem-based disaster risk reduction approaches have been recognized as a sustainable and effective strategy for managing flood risks. This study designed a conceptual framework for assessing the supply-demand risk of urban flood resilience (UFR) from the perspective of ecosystem services (ESs). Taking the city of Nanjing, China, as an example, the InVEST model and the multi-criteria comprehensive evaluation method were employed to quantify the supply of UFR provided by natural ecosystems and the demand for UFR from socio-economic systems. Additionally, based on UFR supply-demand evaluation indicators calculated for each subdistrict, the Self- Organizing Map (SOM) was used to cluster the subdistricts. Finally, UFR supply-demand matching was conducted on the subdistrict clusters, and different flood-risk levels were identified based on the supply-demand ratio. The results showed that high flood-risk subdistricts are mainly concentrated in central urban area, low flood-risk subdistricts are primarily in urban periphery, and subdistricts in urban-rural transitional zones exhibit medium flood risk. Statistical analysis revealed that this zonal pattern is closely related to land use types and the distribution of social resources. Therefore, this study provides a scientific basis for developing management strategies of urban flood prevention from the perspective of ESs.
As the era of large-scale highway maintenance arrives,the maintenance strategies have transitioned to a holistic approach that prioritizes safety,economic feasibility,and environmental *** research introduces a multi-...
详细信息
As the era of large-scale highway maintenance arrives,the maintenance strategies have transitioned to a holistic approach that prioritizes safety,economic feasibility,and environmental *** research introduces a multi-objective optimization model for highway maintenance that incorporates the interplay of decision-maker preferences across three key objectives:Highway safety performance,maintenance engineering cost,and carbon *** study employs a large-sample data analysis on a subset of the Lianhuo Highway network,which includes 2,842 pavement *** approach mitigates the impact of outliers,ensuring a substantial data buffer that fortifies the model’s capacity for generalization and bolsters its *** findings reveal a Pareto-optimal relationship among the three scrutinized variables.A particularly noteworthy observation is the M-shaped trajectory of carbon emissions,which initially rise,then decline,and ultimately rebound,contingent upon the selected maintenance ***,an examination of the relationship between maintenance costs and safety performance discloses a trend of diminishing marginal returns,illustrating that the incremental gains in safety performance attenuate as maintenance investment escalates.
TNL (Tracking by Natural Language) aims to locate the target described by a natural language sentence in a video. Most existing TNL methods are typically composed of three modules: object grounding, object tracking, a...
详细信息
TNL (Tracking by Natural Language) aims to locate the target described by a natural language sentence in a video. Most existing TNL methods are typically composed of three modules: object grounding, object tracking, and switching module, and their performance is limited by the poor performance of the grounding and switching modules due to the complex backgrounds and inaccurate information stored in the memory. This paper presents a global-local framework to address these issues, which includes a prompt-guided grounding module, a trained local tracking module, and a memory-based switcher module. The prompt-guided grounding module uses noun prompts to guide the CLIP model in focusing more on target regions and aligning visual features semantically with linguistic features, avoiding being misled by distractors and background. The memory-based switch module stores historical information with higher-quality memory, allowing the model to make more accurate decisions based on reliable data, thus improving the overall performance. Experiments on TNL2K, LaSOT, and OTB-Lang demonstrate the effectiveness and generalizability of the proposed framework.
The recently invented retina-inspired spike camera has shown great potential for capturing dynamic scenes. However, reconstructing high-quality images from the binary spike data remains a challenge due to the existenc...
详细信息
The recently invented retina-inspired spike camera has shown great potential for capturing dynamic scenes. However, reconstructing high-quality images from the binary spike data remains a challenge due to the existence of noises in the camera. This paper proposes SpikeODE, a novel approach to reconstructing clear images by exploring temporal-spatial correlation to depress noises. The main idea of our method is to restore the continuous dynamic process of real scenes in a latent space and learn the temporal correlations in a fine-grained manner. Furthermore, to model the dynamic process more effectively, we design a conditional ODE where the latent state of each timestamp is conditioned on the observed spike data. Subsequently, forward and backward inferences are conducted through the ODE to investigate the correlations between the representation of the target timestamp and the information from both past and future contexts. Additionally, we incorporate a Unet structure with a pixel-wise attention mechanism at each level to learn spatial correlations. Experimental results demonstrate that our method outperforms state-of-the-art methods across several metrics.
Personalized PageRank (PPR) is a traditional measure for node proximity on large graphs. For a pair of nodes s and t, the PPR value pi(s)(t)equals the probability that an alpha-discounted random walk from s terminates...
详细信息
Personalized PageRank (PPR) is a traditional measure for node proximity on large graphs. For a pair of nodes s and t, the PPR value pi(s)(t)equals the probability that an alpha-discounted random walk from s terminates at t and reflects the importance between s and tin a bidirectional way. As a generalization of Google's celebrated PageRank centrality, PPR has been extensively studied and has found multifaceted applications in many fields, such as network analysis, graph mining, and graph machine learning. Despite numerous studies devoted to PPR over the decades, efficient computation of PPR remains a challenging problem, and there is a dearth of systematic summaries and comparisons of existing algorithms. In this paper, we recap several frequently used techniques for PPR computation and conduct a comprehensive survey of various recent PPR algorithms from an algorithmic perspective. We classify these approaches based on the types of queries they address and review their methodologies and contributions. We also discuss some representative algorithms for computing PPR on dynamic graphs and in parallel or distributed environments.
With the development of graph neural networks, how to handle large-scale graph data has become an increasingly important topic. Currently, most graph neural network models which can be extended to large-scale graphs a...
详细信息
With the development of graph neural networks, how to handle large-scale graph data has become an increasingly important topic. Currently, most graph neural network models which can be extended to large-scale graphs are based on random sampling methods. However, the sampling process in these models is detached from the forward propagation of neural networks. Moreover, quite a few works design sampling based on statistical estimation methods for graph convolutional networks and the weights of message passing in GCNs nodes are fixed, making these sampling methods not scalable to message passing networks with variable weights, such as graph attention networks. Noting the end-to-end learning capability of neural networks, we propose a learnable sampling method. It solves the problem that random sampling operations cannot calculate gradients and samples nodes with an unfixed probability. In this way, the sampling process is dynamically combined with the forward propagation process of the features, allowing for better training of the networks. And it can be generalized to all message passing models. In addition, we apply the learnable sampling method to GNNs and propose two models. Our method can be flexibly combined with different graph neural network models and achieves excellent accuracy on benchmark datasets with large graphs. Meanwhile, loss function converges to smaller values at a faster rate during training than past methods. (c) 2023 Elsevier Ltd. All rights reserved.
In dynamic environments, making classification decisions based on classical intelligent decision support systems is a challenge, as the classification performance of decision -making and the time -cost of learning nee...
详细信息
In dynamic environments, making classification decisions based on classical intelligent decision support systems is a challenge, as the classification performance of decision -making and the time -cost of learning need to be considered simultaneously. Moreover, many tasks of classification decisions lack label information because annotating data is time-consuming, labor-intensive and expensive process. This means that some standard intelligent decision support systems will perform inferior performance if they cannot dynamically make full use of the information behind abundant unlabeled data. Therefore, by incorporating knowledge representation and dynamic updating mechanisms into concept learning processes, we introduce a novel dynamic concept learning approach, namely semi -supervised concept -cognitive computing system (s2C3S), for making classification decisions by jointly utilizing some labeled data and abundant unlabeled data under dynamic environments. A theoretical analysis has shown that the proposed s2C3S can achieve significantly lower computational costs and higher classification accuracies than the existing incremental K Nearest Neighbor method (IKNN) and concept -cognitive computing system (C3S). The experimental results on various datasets further demonstrated that our system is effective for dynamic classification decision -making with limited labeled data under dynamic learning processes. Additionally, s2C3S can also be applied to computer -assisted intelligent diagnosis from the given medical images (such as chest X-ray images) dynamically and accurately.
暂无评论