Today's deep learning models face an increasing demand to handle dynamic shape tensors and computation whose shape information remains unknown at compile time and varies in a nearly infinite range at runtime. This...
详细信息
Today's deep learning models face an increasing demand to handle dynamic shape tensors and computation whose shape information remains unknown at compile time and varies in a nearly infinite range at runtime. This shape dynamism brings tremendous challenges for existing compilation pipelines designed for static models which optimize tensor programs relying on exact shape values. This paper presents TSCompiler, an end-to-end compilation framework for dynamic shape models. TSCompiler first proposes a symbolic shape propagation algorithm to recover symbolic shape information at compile time to enable subsequent optimizations. TSCompiler then partitions the shape-annotated computation graph into multiple subgraphs and fine-tunes the backbone operators from the subgraph within a hardware-aligned search space to find a collection of high-performance schedules. TSCompiler can propagate the explored backbone schedule to other fusion groups within the same subgraph to generate a set of parameterized tensor programs for fused cases based on dependence analysis. At runtime, TSCompiler utilizes an occupancy-targeted cost model to select from pre-compiled tensor programs for varied tensor shapes. Extensive evaluations show that TSCompiler can achieve state-of-the-art speedups for dynamic shape models. For example, we can improve kernel efficiency by up to 3.97× on NVIDIA RTX3090, and 10.30× on NVIDIA A100 and achieve up to five orders of magnitude speedups on end-to-end latency.
High penetration of renewable energy sources(RESs)induces sharply-fluctuating feeder power,leading to volt-age deviation in active distribution *** prevent voltage violations,multi-terminal soft open points(M-sOPs)hav...
详细信息
High penetration of renewable energy sources(RESs)induces sharply-fluctuating feeder power,leading to volt-age deviation in active distribution *** prevent voltage violations,multi-terminal soft open points(M-sOPs)have been integrated into the distribution systems to enhance voltage con-trol ***,the M-SOP voltage control recalculated in real time cannot adapt to the rapid fluctuations of photovol-taic(PV)power,fundamentally limiting the voltage controllabili-ty of *** address this issue,a full-model-free adaptive graph deep deterministic policy gradient(FAG-DDPG)model is proposed for M-SOP voltage ***,the attention-based adaptive graph convolutional network(AGCN)is lever-aged to extract the complex correlation features of nodal infor-mation to improve the policy learning ***,the AGCN-based surrogate model is trained to replace the power flow cal-culation to achieve model-free ***,the deep deterministic policy gradient(DDPG)algorithm allows FAG-DDPG model to learn an optimal control strategy of M-SOP by continuous interactions with the AGCN-based surrogate *** tests have been performed on modified IEEE 33-node,123-node,and a real 76-node distribution systems,which demonstrate the effectiveness and generalization ability of the proposed FAG-DDPGmodel.
In offshore aquaculture operations, personnel equipped with diving gear are often necessary to inspect the underwater net cages for damage, particularly on the sea floor. This manual inspection process is time-consumi...
详细信息
This paper considers link scheduling in a wireless network comprising of two types of nodes:(i)hybrid access points(HAPs)that harvest solar en-ergy,and(ii)devices that harvest radio frequency(RF)energy whenever HAPs *...
详细信息
This paper considers link scheduling in a wireless network comprising of two types of nodes:(i)hybrid access points(HAPs)that harvest solar en-ergy,and(ii)devices that harvest radio frequency(RF)energy whenever HAPs *** aim is to de-rive the shortest possible link schedule that determines the transmission time of inter-HAPs links,and uplinks from devices to *** first outline a mixed in-teger linear program(MILP),which can be run by a central node to determine the optimal schedule and transmit power of HAPs and *** then out-line a game theory based protocol called Distributed Schedule Minimization Protocol(DSMP)that is run by HAPs and ***,it does not require causal energy arrivals and channel gains *** results show that DSMP produces schedule lengths that are at most 1.99x longer than the schedule computed by MILP.
Uncertainty quantification approaches have been more critical in large language models (LLMs), particularly high-risk applications requiring reliable outputs. However, traditional methods for uncertainty quantificatio...
详细信息
Waveguide-integrated mid-infrared(MIR)photodetectors are pivotal components for the development of molecular spectroscopy applications,leveraging mature photonic integrated circuit(PIC)*** various strategies,critical ...
详细信息
Waveguide-integrated mid-infrared(MIR)photodetectors are pivotal components for the development of molecular spectroscopy applications,leveraging mature photonic integrated circuit(PIC)*** various strategies,critical challenges still remain in achieving broadband photoresponse,cooling-free operation,and large-scale complementary-metal-oxide-semiconductor(CMOS)-compatible *** leap beyond these limitations,the bolometric effect-a thermal detection mechanism-is introduced into the waveguide *** importantly,we pursue a free-carrier absorption(FCA)process in germanium(Ge)to create an efficient light-absorbing medium,providing a pragmatic solution for full coverage of the MIR spectrum without incorporating exotic materials into ***,we present an uncooled waveguide-integrated photodetector based on a Ge-on-insulator(Ge-OI)PIC architecture,which exploits the bolometric effect combined with ***,our device exhibits a broadband responsivity of 28.35%/mW across 4030-4360 nm(and potentially beyond),challenging the state of the art,while achieving a noise-equivalent power of 4.03×10-7W/Hz0.5 at 4180 *** further demonstrate label-free sensing of gaseous carbon dioxide(CO2)using our integrated photodetector and sensing waveguide on a single *** approach to room-temperature waveguide-integrated MIR photodetection,harnessing bolometry with FCA in Ge,not only facilitates the realization of fully integrated lab-on-a-chip systems with wavelength flexibility but also provides a blueprint for MIR PICs with CMOS-foundry-compatibility.
Machine learning algorithms generally assume that the data are balanced in nature. However, medical datasets suffer from the curse of dimensionality and class imbalance problems. The medical datasets are obtained from...
详细信息
Machine learning algorithms generally assume that the data are balanced in nature. However, medical datasets suffer from the curse of dimensionality and class imbalance problems. The medical datasets are obtained from the patient information which creates an imbalance in class distribution as the number of normal persons is more than the number of patients and contains a large number of features to represent a sample. It tends to the machine learning algorithms biased toward the majority class which degrades their classification performance for minority class samples and increases the computation overhead. Therefore, oversampling, feature selection and feature weighting-based four strategies are proposed to deal with the problems of class imbalance and high dimensionality. The key idea behind the proposed strategies is to generate a balanced sample space along with the optimal weighted feature space of the most relevant and discriminative features. The Synthetic Minority Oversampling Technique is utilized to generate the synthetic minority class samples and reduce the bias toward the majority class. An Improved Elephant Herding Optimization algorithm is applied to select the optimal features and weights for reducing the computation overhead and improving the interpretation ability of the learning algorithms by providing weights to relevant features. In addition, thirteen methods are developed from the proposed strategies to deal with the problems of high-dimensionality and imbalanced data. The optimized k-Nearest Neighbor (k-NN) learning algorithm is utilized to perform classification. The performance of the proposed methods is evaluated and compared for sixteen high-dimensional imbalanced medical datasets. Further, Freidman’s mean rank test is applied to show the statistical difference between the proposed methods. Experimental and statistical results show that the proposed Feature Weighting followed by the Feature Selection (FW–FS) method performed significantly b
Semantic segmentation is an important sub-task for many ***,pixel-level ground-truth labeling is costly,and there is a tendency to overfit to training data,thereby limiting the generalization *** domain adaptation can...
详细信息
Semantic segmentation is an important sub-task for many ***,pixel-level ground-truth labeling is costly,and there is a tendency to overfit to training data,thereby limiting the generalization *** domain adaptation can potentially address these problems by allowing systems trained on labelled datasets from the source domain(including less expensive synthetic domain)to be adapted to a novel target *** conventional approach involves automatic extraction and alignment of the representations of source and target domains *** limitation of this approach is that it tends to neglect the differences between classes:representations of certain classes can be more easily extracted and aligned between the source and target domains than others,limiting the adaptation over all ***,we address:this problem by introducing a Class-Conditional Domain Adaptation(CCDA)*** incorporates a class-conditional multi-scale discriminator and class-conditional losses for both segmentation and ***,they measure the segmentation,shift the domain in a classconditional manner,and equalize the loss over *** results demonstrate that the performance of our CCDA method matches,and in some cases,surpasses that of state-of-the-art methods.
Unstructured Numerical Image Dataset Separation (UNIDS) method employing an enhanced unsupervised clustering technique. The objective is to delineate an optimal number of distinct groups within the input grayscale (G-...
详细信息
Water quality prediction methods forecast the short-or long-term trends of its changes, providing proactive advice for preventing and controlling water pollution. Existing water quality prediction methods typically fa...
详细信息
暂无评论