For the high-performancecomputing in a WAN environment,the geographical locations of national supercomputing centers are scattered and the network topology is complex,so it is difficult to form a unified view of *** ...
详细信息
For the high-performancecomputing in a WAN environment,the geographical locations of national supercomputing centers are scattered and the network topology is complex,so it is difficult to form a unified view of *** aggregate the widely dispersed storage resources of national supercomputing centers in China,we have previously proposed a global virtual data space named GVDS in the project of“highperformancecomputing Virtual Data Space”,a part of the National Key Research and Development Program of *** GVDS enables large-scale applications of the high-performancecomputing to run efficiently across ***,the applications running on the GVDS are often data-intensive,requiring large amounts of data from multiple supercomputing centers across *** this regard,the GVDS suffers from performance bottlenecks in data migration and access across *** solve the above-mentioned problem,this paper proposes a performance optimization framework of GVDS including the multitask-oriented data migration method and the request access-aware IO proxy resource allocation *** a WAN environment,the framework proposed in this paper can make an efficient migration decision based on the amount of migrated data and the number of multiple data sources,guaranteeing lower average migration latency when multiple data migration tasks are running in *** addition,it can ensure that the thread resource of the IO proxy node is fairly allocated among different types of requests(the IO proxy is a module of GVDS),so as to improve the application’s performance across *** experimental results show that the framework can effectively reduce the average data access delay of GVDS while improving the performance of the application greatly.
A preliminary work of our study was published at IJCAI’21 [47], which is substantially extended in the following aspects: (1) In Section 1, we analyze the necessity of introducing item attributes for detecting unreli...
详细信息
A preliminary work of our study was published at IJCAI’21 [47], which is substantially extended in the following aspects: (1) In Section 1, we analyze the necessity of introducing item attributes for detecting unreliable instances, together with the problems and challenges that attributes may bring in. (2) In Section 2, we add discussions about the limitations of existing attribute-aware recommender systems (Section 2.2) and denoising methods (Section 2.3) in the context of detecting unreliable instances. (3) In Section 4.2, we further conduct an in-depth analysis at the attribute level to demonstrate the capability of attributes for rectifying instance loss and uncertainty, as well as the disturbance caused by attributes. (4) We generalize BERD to a generic framework BERD+ in Section 5.1, equipped with novel modules, i.e., HU-GCN (Section 5.2) and EPE (Section 5.4), which properly incorporate item attributes while reducing their disturbance for rectifying instance uncer tainty (Section 5.5) and loss (Section 5.6). The generic BERD+ can be flexibly plugged into existing SRSs for performance enhanced recommendation via eliminating unreliable data. (5) In Section 6.2, we apply our BERD+ framework to seven state-of-the-art SRSs on five real-world datasets to illustrate its superiority. (6) To avoid unfair comparison caused by item attributes, we build and compare with the baseline that combines the original BERD and an advanced attribute-aware recommender system, KSR [19]. (7) For more comprehensive comparison, in Section 6.2.2, we compare BRED+ with two state-of-the-art denoising approaches;in Section 6.2.3, to examine the efficacy of HU-GCN and EPE, we compare HU-GCN with various attribute embedding techniques, i.e., variants of graph neural networks, and compare EPE with different attribute fusing methods, i.e., adding, concatenation, and weighted sum. (8) In Section 6.2.4, a detailed ablation study is conducted to verify the effectiveness of each module of BERD+. (
This paper tackles the high computational/space complexity associated with multi-head self-attention(MHSA)in vanilla vision *** this end,we propose hierarchical MHSA(H-MHSA),a novel approach that computes self-attenti...
详细信息
This paper tackles the high computational/space complexity associated with multi-head self-attention(MHSA)in vanilla vision *** this end,we propose hierarchical MHSA(H-MHSA),a novel approach that computes self-attention in a hierarchical ***,we first divide the input image into patches as commonly done,and each patch is viewed as a ***,the proposed H-MHSA learns token relationships within local patches,serving as local relationship ***,the small patches are merged into larger ones,and H-MHSA models the global dependencies for the small number of the merged *** last,the local and global attentive features are aggregated to obtain features with powerful representation *** we only calculate attention for a limited number of tokens at each step,the computational load is reduced ***,H-MHSA can efficiently model global relationships among tokens without sacrificing fine-grained *** the H-MHSA module incorporated,we build a family of hierarchical-attention-based transformer networks,namely *** demonstrate the superiority of HAT-Net in scene understanding,we conduct extensive experiments on fundamental vision tasks,including image classification,semantic segmentation,object detection and instance ***,HAT-Net provides a new perspective for vision *** and pretrained models are available at https://***/yun-liu/HAT-Net.
The demand for high-performance hardware solutions for machine learning tasks is growing as medical imaging evolves. In this paper, we will focus on the latest hardware advanced technologies: GPUs, TPUs and FPGAs that...
详细信息
Due to privacy and security concerns, recent advancements in group fairness advocate for model training regardless of demographic information. However, most methods still require prior knowledge of demographics. In th...
Breast cancer remains a leading cause of mortality among women, with millions of new cases diagnosed annually. Early detection through screening is crucial. Using neural networks to improve the accuracy of breast canc...
详细信息
Since the last few years, open-source hardware is preferred in the landscape of hardware development. Originally proprietary hardware was only developed by big multinational companies like AMD, Intel, and NVIDIA, whic...
详细信息
With the development of emerging technologies such as cloud computing and large AI models (such as LLM), many applications have placed higher demands on the intensive read and write processing of massive data. Address...
详细信息
The uncertainty in the position and size of occluding objects greatly affects the extraction of identity features in facial recognition, which is a challenge that existing methods fail to effectively address. To tackl...
详细信息
Early diagnosis of osteonecrosis of the femoral head (ONFH) can inhibit the progression and improve femoral head preservation. The radiograph difference between early ONFH and healthy ones is not apparent to the naked...
详细信息
暂无评论