Learned Index Structures (LIS) view a sorted index as a model that learns the data distribution, takes a data element key as input, and outputs the predicted position of the key. The original LIS can only handle looku...
详细信息
It is well known that any graph admits a crossing-free straight-line drawing in R3 and that any planar graph admits the same even in R2. For a graph G and d ∈ {2, 3}, let ρ1d(G) denote the smallest number of lines i...
详细信息
We introduce a new complexity measure for finite strings using probabilistic finite-state automata (PFAs), in the same spirit as existing notions employing DFAs and NFAs, and explore its properties. The PFA complexity...
详细信息
The present paper establishes an explicit multi-dimensional state space collapse (SSC) for parallel-processingsystems with arbitrary compatibility constraints between servers and job types. This breaks major new grou...
详细信息
作者:
Enge, AndreasINRIA
Université de Bordeaux CNRS CANARI Talence33400 France
The FastECPP algorithm is currently the fastest approach to prove the primality of general numbers, and has the additional benefit of creating certificates that can be checked independently and with a lower complexity...
详细信息
High-performance platforms capable of running deep neural network (DNN)-based applications are necessary for embedded systems such as autonomous-driving systems. These systems must be compact and power-efficient, rath...
详细信息
High-performance platforms capable of running deep neural network (DNN)-based applications are necessary for embedded systems such as autonomous-driving systems. These systems must be compact and power-efficient, rather than relying on rich-computational power platforms such as graphics processing units (GPUs). Additionally, platforms capable of executing multiple applications in parallel in such large-scale systems are required. Clustered many-core platforms, such as the Kalray massively parallel processor array (MPPA) 3-80 Coolidge, have been designed to meet these requirements. Coolidge employs more computing cores compared with single-core or multi-core processors, allowing for reduced clock frequencies per core. Consequently, Coolidge is a high-performance platform with low-power usage. Additionally, in Coolidge, cores are grouped into clusters, each of which can independently run different applications. This enables a single Coolidge platform to support multiple applications simultaneously. In the realm of cyber-physical systems (CPS), which bridge the physical and digital domains, these platforms become crucial. CPS relies on real-time embedded systems, like those in autonomous vehicles, which necessitate low-power, high-performance platforms that can perform complex computations like DNNs for object detection. In our evaluation, we examine DNN inference speed and explore the performance of the Coolidge platform. The DNN task for the evaluation employs object detection, which is commonly used in autonomous-driving systems. The acceptable speed threshold for real-time applications in object detection is 30 frames per second. Our study reveals that the "you only look once"v5 model can exceed performance benchmarks with only one cluster in Coolidge for int8 data type and two clusters for the higher precision fp16 data type. Moreover, when two clusters are utilized for DNN tasks, the remaining clusters are available for other non-DNN applications, underlining the
Strategies and their evaluations play important roles in speeding up the computation of large smooth-degree isogenies. The concept of optimal strategies for such computation was introduced by De Feo et al., and virtua...
详细信息
We study the algorithmic complexity of the problem of deciding whether a Linear Time Invariant dynamical system with rational coefficients has bounded trajectories. Despite its ubiquitous and elementary nature in Syst...
详细信息
ISBN:
(纸本)9781665436595
We study the algorithmic complexity of the problem of deciding whether a Linear Time Invariant dynamical system with rational coefficients has bounded trajectories. Despite its ubiquitous and elementary nature in systems and Control, it turns out that this question is quite intricate, and, to the best of our knowledge, unsolved in the literature. We show that classical tools, such as Gaussian Elimination, the Routh-Hurwitz Criterion, and the Euclidean Algorithm for GCD of polynomials indeed allow for an algorithm that is polynomial in the bit size of the instance. However, all these tools have to be implemented with care, and in a non-standard way, which relies on an advanced analysis.
Coded computation is an emerging paradigm of applying coding theory to large-scale distributed computing to provide resilience against slow or otherwise unavailable workers. We propose a new approach to view coded com...
详细信息
Suppose a distributed storage system containing M files is replicated across N servers, and a user wants to privately retrieve one file by accessing the servers such that the identity of the retrieved file is kept sec...
详细信息
Suppose a distributed storage system containing M files is replicated across N servers, and a user wants to privately retrieve one file by accessing the servers such that the identity of the retrieved file is kept secret from any subset of up to T servers, where each file can be viewed as a vector over the q-ary finite field Fq. A scheme designed for this purpose is called a T-private information retrieval (T-PIR) scheme. We consider the problem of explicitly constructing capacity-achieving T-PIR schemes over small finite fields. In this paper, we first provide a general framework for constructing explicit capacity-achieving T-PIR schemes for all parameters, which only relies on an MDS array matrix with a special information set. To construct such an MDS array matrix, we propose a new family of matrices over finite fields, called the generalized minor matrices of the Moore matrix, and establish a series of key identities. By combining favourable properties of generalized minor matrices with our framework, we construct an explicit capacityachieving T-PIR scheme with optimal sub-packetization over the field Fq, as small as possible, for three classes of parameters N, T,M ≥ 3. Specifically, the first class of construction works for all N = d(2t − 1), T = dt, and the field size q is the least prime power satisfying qt−1 ≥ N. Moreover, this construction generalizes the scheme proposed by Xu and Wang in 2022, which only considers the case of N = d(2t−1), T = dt with 2t−1≥ N. For all N = d(2t + 1), T = dt, our second T-PIR scheme is the first explicit construction, and the field size q is the least prime power satisfying qt ≥ N, which is the smallest field size among all known explicit capacity-achieving T-PIR schemes. Particularly, when 2t≥ N, the field size of such constructions can be reduced to 2. In the case of N = 4s and T = 2s + 1, our scheme is the first one to reduce the field size to q = 2. Compared with all known explicitly capacity-achieving T-PIR schemes, the
暂无评论