In the article, the authors propose a new optimization method inspired directly by the behavior of the Duroc pig herd, which was bred in New England. the new metaheuristics called Artificial Duroc Pigs Optimization (A...
详细信息
For delay-sensitive network services, Age of Information (AoI) is appropriate to represent the freshness of information. Meanwhile, withthe joint development of Network Function Virtualization (NFV) and Software Defi...
详细信息
For delay-sensitive network services, Age of Information (AoI) is appropriate to represent the freshness of information. Meanwhile, withthe joint development of Network Function Virtualization (NFV) and Software Defined Network (SDN) technology, these network services can be managed as Service Function Chains (SFCs), composed of a collection of Virtual Network Functions (VNFs). Because the standard SFC architectures have been challenging to accommodate the low AoI demand for information in new scenarios like the Industrial Internet of things (IIoT) and telemedicine, new SFC structures are constantly being proposed. In this paper, AoI is minimized by utilizing parallel paths to forward data between VNF instances. We formulate it as an Integer Linear Programming (ILP) problem. Moreover, a heuristic algorithm based on Particle Swarm Optimization (PSO) is proposed to solve it in large-scale networks. According to the simulation results analysis, our strategy can reduce the overall AoI of flows by at least 15% when compared to the conventional SFC architectures.
Due to the high mobility and limited transmission range of vehicles, the data download capacity of single vehicle is greatly limited, which brings poor performance to users. In this paper, we divide the data into bloc...
详细信息
ISBN:
(纸本)9789811681745;9789811681738
Due to the high mobility and limited transmission range of vehicles, the data download capacity of single vehicle is greatly limited, which brings poor performance to users. In this paper, we divide the data into blocks. We want to design a data block broadcasting scheme so that all vehicles can receive data blocks as many as possible in a base station(BS) range. We first give the mathematical model and find it is difficult to be solved directly. then we design a heuristic algorithm for solving the problem. the main idea of our algorithm is to give each data block a weight. the data block withthe largest weight is broadcast by BS, and several vehicles are selected to broadcast the remaining data blocks. We call our algorithm as the Iterative Strategy for Data Allocation(ISDA) algorithm. then considering the actual situation, we subsequently propose the online algorithm. through experiments and simulations, we prove that our scheme can effectively improve the data download rate and reduce the download delay.
Recently, OpenCL has been emerging as a programming model for energy-efficient FPGA accelerators. However, the state-of-the-art OpenCL frameworks for FPGAs suffer from poor performance and usability. this paper propos...
详细信息
ISBN:
(纸本)9781728146614
Recently, OpenCL has been emerging as a programming model for energy-efficient FPGA accelerators. However, the state-of-the-art OpenCL frameworks for FPGAs suffer from poor performance and usability. this paper proposes a high-level synthesis framework of OpenCL for FPGAs, called SOFF. It automatically synthesizes a datapath to execute many OpenCL kernel threads in a pipelined manner. It also synthesizes an efficient memory subsystem for the datapath based on the characteristics of OpenCL kernels. Unlike previous high-level synthesis techniques, we propose a formal way to handle variable-latency instructions, complex control flows, OpenCL barriers, and atomic operations that appear in real-world OpenCL kernels. SOFF is the first OpenCL framework that correctly compiles and executes all applications in the SPEC ACCEL benchmark suite except three applications that require more FPGA resources than are available. In addition, SOFF achieves the speedup of 1.33 over Intel FPGA SDK for OpenCL without any explicit user annotation or source code modification.
Cloud computing has been around us for a while now, and everywhere we look around, data surrounds us. Data is transferred over a network and computed over some cloud platform. the concept of the fog layer has also bee...
详细信息
ISBN:
(纸本)9781665461986
Cloud computing has been around us for a while now, and everywhere we look around, data surrounds us. Data is transferred over a network and computed over some cloud platform. the concept of the fog layer has also been around, and there have been multiple implementations of how these two sets of layers can work together and implement an excellent and optimized structure. Numerous factors are involved in creating an architecture like that, including time, energy, network bandwidth, etc. Every solution presented tries to cater to most of these factors and gives the best possible outcome. But there has been a trade-off always. Such architecture has been here for many years. these solutions and many new ones are implemented to find a common optimal point. Such solutions can be divided into categories like bio-inspired algorithms, evolution-based algorithms, etc. there can also be a division based on these algorithms' heuristic and metaheuristic nature. this paper focuses on all these techniques and compares the objectives and limitations of previous implementations. this paper also pointed out majorly used algorithms repeatedly seen during this research and simulated on the same set of parameters to determine which solution presents a more efficient performance.
In quantum computing, research and development of devices, architecture, optimization techniques, algorithms, and applications are evolving with eagerness in parallel. To make these work mutually beneficial, practical...
详细信息
ISBN:
(数字)9781728157757
ISBN:
(纸本)9781728157757
In quantum computing, research and development of devices, architecture, optimization techniques, algorithms, and applications are evolving with eagerness in parallel. To make these work mutually beneficial, practical and accurate quantum computer simulators as system-wide design frameworks are necessary. In this paper, we focus on measurement and initialization errors of qubits. these errors are inevitable because these procedures are interfaces between classical and quantum information processing. We model the quantum non-demolition detection technique as measurement and initialization methods and implement them on Intel-QS, a high-performance quantum simulator. Our case study with quantum Fourier transform on 8 qubits configuration demonstrates the importance of taking these errors into account. that is, post-selection, which improves the initialization fidelity, can enlarge the gap between the theoretical result and incorrect outputs by 3.35 times.
this work establishes a client-cloud collaborative computing framework, realizing the augmented reality synthesis of photos on the mobile platform. this framework solves the bottleneck of unrealistic image rendering c...
详细信息
this work establishes a client-cloud collaborative computing framework, realizing the augmented reality synthesis of photos on the mobile platform. this framework solves the bottleneck of unrealistic image rendering caused by the insufficient computing power of mobile hardware in augmented reality synthesis. On the client side, a real-time preview function is provided for users to perceive the final result. On the cloud side, a complete rendering pipeline withhighcomputing is used to achieve highly realistic picture rendering and composition. On the basis of augmenting reality, this framework increases the interaction between humans and virtual models. We propose a method to integrate user interaction input and image semantic information to solve the limitation of inaccurate estimation of depth information in distant area. Finally, this work takes the augmented reality display of virtual cherry tree as an example to build a sample program. the result shows that the framework is of high practicability and can automatically compose highly realistic photos.
Today's high-performancecomputing (HPC) systems often use GPUs as dedicated hardware accelerators to meet the computation requirements of applications such as neural networks, genetic decoding, and hydrodynamic s...
详细信息
ISBN:
(纸本)9783030445331;9783030445348
Today's high-performancecomputing (HPC) systems often use GPUs as dedicated hardware accelerators to meet the computation requirements of applications such as neural networks, genetic decoding, and hydrodynamic simulations. Meanwhile, FPGAs have also been considered as alternative suitable hardware accelerators due to their advancing computational capabilities and low power consumption. Moreover, the developments of high-Level Synthesis (HLS) allow users to generate FPGA designs directly from mainstream languages, e.g., C, C++, and OpenCL. However, writing efficient high-level programs with good performance is still a time-consuming task, and the lack of knowledge about FPGA architecture can lead to poor scalability and portability. In this paper, we propose an architecture design for Computational Fluid Dynamics (CFD) simulations based on the HLS method. Our design can adjust the performance by utilizing the parallelism inside both temporal and spatial domains of CFD simulations. We also discuss the data reuse buffer optimization choices while considering the potability of HLS codes. A performance model is introduced to guide the design space exploration under the constraints of available resources on FPGA. We evaluate our design via a Xilinx VCU1525 FPGA board and compare the results with other state-of-the-art studies. Experiment results show that VCU1525 can achieve 629.6 GFLOP/s in D2Q9 LBM-BGK model and the design and optimization methods can be used for developing various CFD applications.
Processing data streams arriving at high speed requires the development of models that can provide fast and accurate predictions. Although deep neural networks are the state-of-the-art for many machine learning tasks,...
详细信息
暂无评论