Mobile edge computing has shown its potential in serving emerging latency-sensitive mobile applications in ultra-dense 5G networks via offloading computation workloads from the remote cloud data center to the nearby n...
详细信息
Mobile edge computing has shown its potential in serving emerging latency-sensitive mobile applications in ultra-dense 5G networks via offloading computation workloads from the remote cloud data center to the nearby network ***,current computation offloading studies in the heterogeneous edge environment face multifaceted challenges:Dependencies among computational tasks,resource competition among multiple users,and diverse long-term *** applications typically consist of several functionalities,and one huge category of the applications can be viewed as a series of sequential *** this study,we first proposed a novel multiuser computation offloading framework for long-term sequential ***,we presented a comprehensive analysis of the task offloading process in the framework and formally defined the multiuser sequential task offloading ***,we decoupled the long-term offloading problem into multiple single time slot offloading problems and proposed a novel adaptive method to solve *** further showed the substantial performance advantage of our proposed method on the basis of extensive experiments.
The synthetic Floquet lattice,generated by multiple strong drives with mutually incommensurate frequencies,provides a powerful platform for quantum simulation of topological *** this study,we propose a 4-band tight-bi...
详细信息
The synthetic Floquet lattice,generated by multiple strong drives with mutually incommensurate frequencies,provides a powerful platform for quantum simulation of topological *** this study,we propose a 4-band tight-binding model of the Chern insulator with a Chern number C=±2 by coupling two layers of the half Bernevig–Hughes–Zhang lattice and subsequently mapping it onto the Floquet lattice to simulate its topological *** determine the Chern number of our Floquet-version model,we extend the energy pumping method proposed by Martin et al.[2017 ***.X 7041008]and the topological oscillation method introduced by Boyers et al.[2020 ***.125160505],followed by numerical simulations for both *** simulation results demonstrate the successful extraction of the Chern number using either of these methods,providing an excellent prediction of the phase diagram that closely aligns with the theoretical one derived from the original bilayer half Bernevig–Hughes–Zhang ***,we briefly discuss a potential experimental implementation for our *** work demonstrates significant potential for simulating complex topological matter using quantum computing platforms,thereby paving the way for constructing a more universal simulator for non-interacting topological quantum states and advancing our understanding of these intriguing phenomena.
Non-volatile memories(NVMs)provide lower latency and higher bandwidth than block ***,NVMs are byte-addressable and provide persistence that can be used as memory-level storage devices(non-volatile main memory,NVMM).Th...
详细信息
Non-volatile memories(NVMs)provide lower latency and higher bandwidth than block ***,NVMs are byte-addressable and provide persistence that can be used as memory-level storage devices(non-volatile main memory,NVMM).These features change storage hierarchy and allow CPU to access persistent data using load/store ***,we can directly build a file system on ***,traditional file systems are designed based on slow block *** use a deep and complex software stack to optimize file system *** design results in software overhead being the dominant factor affecting NVMM file ***,scalability,crash consistency,data protection,and cross-media storage should be reconsidered in NVMM file *** survey existing work on optimizing NVMM file ***,we analyze the problems when directly using traditional file systems on NVMM,including heavy software overhead,limited scalability,inappropriate consistency guarantee techniques,***,we summarize the technique of 30 typical NVMM file systems and analyze their advantages and ***,we provide a few suggestions for designing a high-performance NVMM file system based on real hardware Optane DC persistent memory ***,we suggest applying various techniques to reduce software overheads,improving the scalability of virtual file system(VFS),adopting highly-concurrent data structures(e.g.,lock and index),using memory protection keys(MPK)for data protection,and carefully designing data placement/migration for cross-media file system.
Real-time transformation was important for the practical implementation of impedance flow *** major obstacle was the time-consuming step of translating raw data to cellular intrinsic electrical properties(e.g.,specifi...
详细信息
Real-time transformation was important for the practical implementation of impedance flow *** major obstacle was the time-consuming step of translating raw data to cellular intrinsic electrical properties(e.g.,specific membrane capacitance C_(sm) and cytoplasm conductivityσ_(cyto)).Although optimization strategies such as neural network-aided strategies were recently reported to provide an impressive boost to the translation process,simultaneously achieving high speed,accuracy,and generalization capability is still *** this end,we proposed a fast parallel physical fitting solver that could characterize single cells’C_(sm)andσ_(cyto)within 0.62 ms/cell without any data preacquisition or pretraining *** achieved the 27000-fold acceleration without loss of accuracy compared with the traditional *** on the solver,we implemented physics-informed real-time impedance flow cytometry(piRT-IFC),which was able to characterize up to 100,902 cells’C_(sm) andσ_(cyto)within 50 min in a real-time *** to the fully connected neural network(FCNN)predictor,the proposed real-time solver showed comparable processing speed but higher ***,we used a neutrophil degranulation cell model to represent tasks to test unfamiliar samples without data for *** being treated with cytochalasin B and N-Formyl-Met-Leu-Phe,HL-60 cells underwent dynamic degranulation processes,and we characterized cell’s C_(sm)andσ_(cyto)using *** to the results from our solver,accuracy loss was observed in the results predicted by the FCNN,revealing the advantages of high speed,accuracy,and generalizability of the proposed piRT-IFC.
Currently, large-scale vision and language models has significantly improved the performances of cross-modal retrieval tasks. However, large-scale models require a substantial amount of computing resources, so the exe...
详细信息
Exploring the expected quantizing scheme with suitable mixed-precision policy is the key to compress deep neural networks(DNNs)in high efficiency and *** exploration implies heavy workloads for domain experts,and an a...
详细信息
Exploring the expected quantizing scheme with suitable mixed-precision policy is the key to compress deep neural networks(DNNs)in high efficiency and *** exploration implies heavy workloads for domain experts,and an automatic compression method is ***,the huge search space of the automatic method introduces plenty of computing budgets that make the automatic process challenging to be applied in real *** this paper,we propose an end-to-end framework named AutoQNN,for automatically quantizing different layers utilizing different schemes and bitwidths without any human *** can seek desirable quantizing schemes and mixed-precision policies for mainstream DNN models efficiently by involving three techniques:quantizing scheme search(QSS),quantizing precision learning(QPL),and quantized architecture generation(QAG).QSS introduces five quantizing schemes and defines three new schemes as a candidate set for scheme search,and then uses the Differentiable Neural architecture Search(DNAS)algorithm to seek the layer-or model-desired scheme from the *** is the first method to learn mixed-precision policies by reparameterizing the bitwidths of quantizing schemes,to the best of our *** optimizes both classification loss and precision loss of DNNs efficiently and obtains the relatively optimal mixed-precision model within limited model size and memory *** is designed to convert arbitrary architectures into corresponding quantized ones without manual intervention,to facilitate end-to-end neural network *** have implemented AutoQNN and integrated it into *** experiments demonstrate that AutoQNN can consistently outperform state-of-the-art *** 2-bit weight and activation of AlexNet and ResNet18,AutoQNN can achieve the accuracy results of 59.75%and 68.86%,respectively,and obtain accuracy improvements by up to 1.65%and 1.74%,respectively,compared with state-of-the-art ***,c
Recurrent neural networks (RNNs) have been heavily used in applications relying on sequence data such as time series and natural languages. As a matter of fact, their behaviors lack rigorous quality assurance due to t...
详细信息
Recurrent neural networks (RNNs) have been heavily used in applications relying on sequence data such as time series and natural languages. As a matter of fact, their behaviors lack rigorous quality assurance due to the black-box nature of deep learning. It is an urgent and challenging task to formally reason about the behaviors of RNNs. To this end, we first present an extension of linear-time temporal logic to reason about properties with respect to RNNs, such as local robustness, reachability, and some temporal properties. Based on the proposed logic, we formalize the verification obligation as a Hoare-like triple, from both qualitative and quantitative perspectives. The former concerns whether all the outputs resulting from the inputs fulfilling the pre-condition satisfy the post-condition, whereas the latter is to compute the probability that the post-condition is satisfied on the premise that the inputs fulfill the pre-condition. To tackle these problems, we develop a systematic verification framework, mainly based on polyhedron propagation, dimension-preserving abstraction, and the Monte Carlo sampling. We also implement our algorithm with a prototype tool and conduct experiments to demonstrate its feasibility and efficiency.
Optoelectronic synapses that integrate visual perception and pre-processing hold significant potential for neuromorphic vision systems(NVSs). However, due to a lack of wavelength sensitivity, existing NVS mainly foc...
详细信息
Optoelectronic synapses that integrate visual perception and pre-processing hold significant potential for neuromorphic vision systems(NVSs). However, due to a lack of wavelength sensitivity, existing NVS mainly focuses on gray-scale image processing, making it challenging to recognize color images. Additionally, the high power consumption of optoelectronic synapses, compared to the 10 fJ energy consumption of biological synapses, limits their broader application. To address these challenges, an energy-efficient NVS capable of color target recognition in a noisy environment was developed,utilizing a MoS2optoelectronic synapse with wavelength sensitivity. Benefiting from the distinct photon capture capabilities of 450, 535, and 650 nm light, the optoelectronic synapse exhibits wavelength-dependent synaptic plasticity, including excitatory postsynaptic current(EPSC), paired-pulse facilitation(PPF), and long-term plasticity(LTP). These properties can effectively mimic the visual memory and color discrimination functions of the human vision system. Results demonstrate that the NVS, based on MoS2optoelectronic synapses, can eliminate the color noise at the sensor level, increasing color image recognition accuracy from 50% to 90%. Importantly, the optoelectronic synapse operates at a low voltage spike of0.0005 V, consuming only 0.075 fJ per spike, surpassing the energy efficiency of both existing optoelectronic and biological synapses. This ultra-low power, color-sensitive device eliminates the need for color filters and offers great promise for future deployment in filter-free NVS.
Pedestrian wind flow is a critical factor in designing livable residential environments under growing complex urban *** pedestrian wind flow during the early design stages is essential but currently suffers from ineff...
详细信息
Pedestrian wind flow is a critical factor in designing livable residential environments under growing complex urban *** pedestrian wind flow during the early design stages is essential but currently suffers from inefficiencies in numerical *** learning,particularly generative adversarial networks(GAN),has been increasingly adopted as an alternative method to provide efficient prediction of pedestrian wind ***,existing GAN-based wind flow prediction schemes have limitations due to the lack of considering the spatial and frequency characteristics of wind flow *** study proposes a novel approach termed SFGAN,which embeds spatial and frequency characteristics to enhance pedestrian wind flow *** the spatial domain,Gaussian blur is employed to decompose wind flow into components containing wind speed and distinguished flow edges,which are used as the embedded spatial *** information of wind flow is obtained through discrete wavelet transformation and used as the embedded frequency *** spatial and frequency characteristics of wind flow are jointly utilized to enforce consistency between the predicted wind flow and ground truth during the training phase,thereby leading to enhanced *** results demonstrate that SFGAN clearly improves wind flow prediction,reducing Wind_MAE,Wind_RMSE and the Fréchet Inception Distance(FID)score by 5.35%,6.52%and 12.30%,compared to the previous best method,*** also analyze the effectiveness of incorporating the spatial and frequency characteristics of wind flow in predicting pedestrian wind *** reduces errors in predicting wind flow at large error intervals and performs well in wake regions and regions surrounding *** enhanced predictions provide a better understanding of performance variability,bringing insights at the early design stage to improve pedestrian wind *** proposed spatial-frequen
Graph convolutional networks(GCNs)have received significant attention from various research fields due to the excellent performance in learning graph *** GCN performs well compared with other methods,it still faces **...
详细信息
Graph convolutional networks(GCNs)have received significant attention from various research fields due to the excellent performance in learning graph *** GCN performs well compared with other methods,it still faces *** a GCN model for large-scale graphs in a conventional way requires high computation and storage ***,motivated by an urgent need in terms of efficiency and scalability in training GCN,sampling methods have been proposed and achieved a significant *** this paper,we categorize sampling methods based on the sampling mechanisms and provide a comprehensive survey of sampling methods for efficient training of *** highlight the characteristics and differences of sampling methods,we present a detailed comparison within each category and further give an overall comparative analysis for the sampling methods in all ***,we discuss some challenges and future research directions of the sampling methods.
暂无评论