The current urban intelligent transportation is in a rapid development stage, and coherence control of vehicle formations has important implications in urban intelligent transportation research. This article focuses o...
详细信息
We introduce a novel method using a new generative model that automatically learns effective representations of the target and background appearance to detect,segment and track each instance in a video *** from curren...
详细信息
We introduce a novel method using a new generative model that automatically learns effective representations of the target and background appearance to detect,segment and track each instance in a video *** from current discriminative tracking-by-detection solutions,our proposed hierarchical structural embedding learning can predict more highquality masks with accurate boundary details over spatio-temporal space via the normalizing *** formulate the instance inference procedure as a hierarchical spatio-temporal embedded learning across time and *** the video clip,our method first coarsely locates pixels belonging to a particular instance with Gaussian distribution and then builds a novel mixing distribution to promote the instance boundary by fusing hierarchical appearance embedding information in a coarse-to-fine *** the mixing distribution,we utilize a factorization condition normalized flow fashion to estimate the distribution parameters to improve the segmentation *** qualitative,quantitative,and ablation experiments are performed on three representative video instance segmentation benchmarks(i.e.,YouTube-VIS19,YouTube-VIS21,and OVIS)and the effectiveness of the proposed method is *** impressively,the superior performance of our model on an unsupervised video object segmentation dataset(i.e.,DAVIS19)proves its *** algorithm implementations are publicly available at https://***/zyqin19/HEVis.
There is a growing interest in sustainable ecosystem development, which includes methods such as scientific modeling, environmental assessment, and development forecasting and planning. However, due to insufficient su...
详细信息
Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application *** the introduction of end-to-end direct regression methods,the field has ent...
详细信息
Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application *** the introduction of end-to-end direct regression methods,the field has entered a new stage of ***,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal *** this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external ***,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding *** call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively.
Crowd counting provides an important foundation for public security and urban *** to the existence of small targets and large density variations in crowd images,crowd counting is a challenging *** methods usually appl...
详细信息
Crowd counting provides an important foundation for public security and urban *** to the existence of small targets and large density variations in crowd images,crowd counting is a challenging *** methods usually apply convolution neural networks(CNNs)to regress a density map,which requires annotations of individual persons and ***-supervised methods can avoid detailed labeling and only require counts as annotations of images,but existing methods fail to achieve satisfactory performance because a global perspective field and multi-level information are usually *** propose a weakly-supervised method,DTCC,which effectively combines multi-level dilated convolution and transformer methods to realize end-to-end crowd *** main components include a recursive swin transformer and a multi-level dilated convolution regression *** recursive swin transformer combines a pyramid visual transformer with a fine-tuned recursive pyramid structure to capture deep multi-level crowd features,including global *** multi-level dilated convolution regression head includes multi-level dilated convolution and a linear regression head for the feature extraction *** module can capture both low-and high-level features simultaneously to enhance the receptive *** addition,two regression head fusion mechanisms realize dynamic and mean fusion *** on four well-known benchmark crowd counting datasets(UCF_CC_50,ShanghaiTech,UCF_QNRF,and JHU-Crowd++)show that DTCC achieves results superior to other weakly-supervised methods and comparable to fully-supervised methods.
Time series anomaly detection is crucial in various industrial applications to identify unusual behaviors within the time series *** to the challenges associated with annotating anomaly events,time series reconstructi...
详细信息
Time series anomaly detection is crucial in various industrial applications to identify unusual behaviors within the time series *** to the challenges associated with annotating anomaly events,time series reconstruction has become a prevalent approach for unsupervised anomaly ***,effectively learning representations and achieving accurate detection results remain challenging due to the intricate temporal patterns and dependencies in real-world time *** this paper,we propose a cross-dimension attentive feature fusion network for time series anomaly detection,referred to as ***,a series and feature mixing block is introduced to learn representations in 1D ***,a fast Fourier transform is employed to convert the time series into 2D space,providing the capability for 2D feature ***,a cross-dimension attentive feature fusion mechanism is designed that adaptively integrates features across different dimensions for anomaly *** results on real-world time series datasets demonstrate that CAFFN performs better than other competing methods in time series anomaly detection.
In an ever-changing environment,software as a Service(SaaS)can rarely protect users'*** able to manage and control the privacy is therefore an important goal for *** the participant of composite service is substit...
详细信息
In an ever-changing environment,software as a Service(SaaS)can rarely protect users'*** able to manage and control the privacy is therefore an important goal for *** the participant of composite service is substituted,it is unclear whether the composite service satisfy user privacy requirement or *** this paper,we propose a privacy policies automatic update method to enhance user privacy when a service participant change in the composite ***,we model the privacy policies and service variation ***,according to the service variation rules,the privacy policies are automatically generated through the negotiation between user and service ***,we prove the feasibility and applicability of our method with the *** the service quantity is 50,ratio that the services variations are successfully checked by monitor is 81%.Moreover,ratio that the privacy policies are correctly updated is 93.6%.
In recent years,the demand for real-time data processing has been increasing,and various stream processing systems have *** the amount of data input to the stream processing system fluctuates,the computing resources r...
详细信息
In recent years,the demand for real-time data processing has been increasing,and various stream processing systems have *** the amount of data input to the stream processing system fluctuates,the computing resources required by the stream processing job will also *** resources used by stream processing jobs need to be adjusted according to load changes,avoiding the waste of computing *** present,existing works adjust stream processing jobs based on the assumption that there is a linear relationship between the operator parallelism and operator resource consumption(e.g.,throughput),which makes a significant deviation when the operator parallelism *** paper proposes a nonlinear model to represent operator *** divide the operator performance into three stages,the Non-competition stage,the Non-full competition stage,and the Full competition *** our proposed performance model,given the parallelism of the operator,we can accurately predict the CPU utilization and operator *** with actual experiments,the prediction error of our model is below 5%.We also propose a quick accurate auto-scaling(QAAS)method that uses the operator performance model to implement the auto-scaling of the operator parallelism of the Flink *** to previous work,QAAS is able to maintain stable job performance under load changes,minimizing the number of job adjustments and reducing data backlogs by 50%.
Aerospace incident reports, crucial for identifying abnormal aircraft occurrences, are submitted to regulatory authorities like FAA or EASA. These reports, often in natural language, offer clarity and context. Existin...
详细信息
PROBLEM Recent years have witnessed the rapid progress of self-supervised language models (LMs)[1],especially large language models (LLMs)[2].LLMs not only achieved state-of-the-art performance on many natural languag...
PROBLEM Recent years have witnessed the rapid progress of self-supervised language models (LMs)[1],especially large language models (LLMs)[2].LLMs not only achieved state-of-the-art performance on many natural language processing tasks,but also captured widespread attention from the public due to their great potential in a variety of real-world applications (***,search engines,writing assistants,etc.)through providing general-purpose intelligent services.A few of the LLMs are becoming foundation models,an analogy to infrastructure,that empower hundreds of downstream applications.
暂无评论