Coastal and Regional Ocean Community model (CROCO) is a modeling system used in oceanographic simulations. CROCO solves primitive hydrodynamic equations for momentum, heat and mass transport on a three dimensional, te...
Coastal and Regional Ocean Community model (CROCO) is a modeling system used in oceanographic simulations. CROCO solves primitive hydrodynamic equations for momentum, heat and mass transport on a three dimensional, terrain following grid. Several biogeochemical models are already implemented in CROCO, including the PISCES model, which it borrows from NEMO, a different oceanography modeling system. While PISCES is a highly complex model, used in many oceanographic applications, it lacks variable stoichiometry and some plankton functional types needed in certain applications. On the other hand, Biogeochemical Flux Model (BFM) is a dedicated biogeochemistry modeling system, customizable and of user-defined complexity. It can be run as a standalone model or in conjunction with an oceanographic simulation. Previously we have reported on our work regarding the incorporation of BFM into CROCO. In this work we address the performance aspects of this coupling. Namely, compared to e.g. PISCES, BFM is a much more complex model with more tracers, which makes it more computationally expensive. It is therefore of the essence to investigate, and to highlight possible areas of improvement in this coupling’s computational performance.
The promising applications of large language models are often limited by the constrained GPU memory capacity available on edge devices. Mixture-of-Experts (MoE) models help address this issue by activating only a subs...
详细信息
Meshless methods are an active and modern branch of numerical analysis with many intriguing benefits. One of the main open research questions related to local meshless methods is how to select the best possible stenci...
详细信息
With the immense computing power at our disposal, the numerical solution of partial differential equations (PDEs) is becoming a day-to-day task for modern computational scientists. However, the complexity of real-life...
详细信息
To support highly scalable and fine-grained computing paradigms such as microservices and serverless computing better, modern hardware-assisted confidential computing systems, such as Intel TDX and ARM CCA, introduce ...
To support highly scalable and fine-grained computing paradigms such as microservices and serverless computing better, modern hardware-assisted confidential computing systems, such as Intel TDX and ARM CCA, introduce permission table to achieve fine-grained and scalable memory isolation among different domains. However, it also adds an extra dimension to page walks besides page tables, leading to significantly more memory references (e.g., 4 12 for RISC-V Sv39) 1 . We observe that most costs (about 75%) caused by the extra dimension of page walks are used to validate page table pages. Based on this observation, this paper proposes HPMP (Hybrid Physical Memory Protection), a hardware-software co-design (on RISC-V) that protects page table pages using segment registers and normal pages using permission tables to balance scalability and performance. We have implemented HPMP and Penglai-HPMP (a TEE system based on HPMP) on FPGA with two RISC-V cores (both in-order and out-of-order). Evaluation results show that HPMP can reduce costs by 23.1%–73.1% on BOOM and significantly improve performance on real-world applications, including serverless computing (FunctionBench) and Redis.
One of the most popular methods employed in computational electromagnetics is the Finite Difference Time Domain (FDTD) method. We generalise it to a meshless setting using the Radial Basis Function generated Finite Di...
详细信息
The failure atomic and isolated execution of clients operations is a default requirement for a system that serve multiple loosely coupled clients at a server. However, disaggregated memory breaks this requirement in r...
详细信息
One of the most popular methods employed in computational electromagnetics is the Finite Difference Time Domain (FDTD) method. We introduce its generalisation to a meshless setting using the Radial Basis Function gene...
One of the most popular methods employed in computational electromagnetics is the Finite Difference Time Domain (FDTD) method. We introduce its generalisation to a meshless setting using the Radial Basis Function generated Finite Difference (RBF-FD) method and investigate its properties on a simple test problem.
Since the advent of mesh-free methods as a tool for the numerical analysis of systems of Partial Differential Equations (PDEs), many variants of differential operator approximation have been proposed. In this work, we...
详细信息
In this paper, we address a way to reduce the total computational cost of meshless approximation by reducing the required stencil size through spatially varying computational node regularity. Rather than covering the ...
详细信息
暂无评论