作者:
Lai, YuanLiu, YifengTsinghua Univ
Sch Architecture Dept Urban Planning Beijing Peoples R China NYU
Marron Inst Urban Management New York NY 10012 USA Tsinghua Univ
Sch Architecture Dept Architecture Beijing Peoples R China
Informal urban settlements are critical for sustainable development but challenging for computational analytics due to a lack of data, raising issues on data representativeness and the digital divide. This study explo...
详细信息
ISBN:
(纸本)9781665416474
Informal urban settlements are critical for sustainable development but challenging for computational analytics due to a lack of data, raising issues on data representativeness and the digital divide. This study explores a hybrid approach integrating conventional urban design theory and digitizing methods to compute places and associated human activities in a data-absent environment. Using an informal settlement area in Beijing as a testing site, this study demonstrates a digitizing pipeline that could be applied to other data-poor sites. Research findings reveal spatial characteristics of observed human activity and associated factors relevant to the informal use of space. Such investigation in computing informal places and human activity brings vital value to address the current digital divide and spatial justice in urban development.
Motor-Imagery Brain-Machine Interfaces (MI-BMIs) promise direct and accessible communication between human brains and machines by analyzing brain activities recorded with Electroencephalography (EEG). Latency, reliabi...
详细信息
ISBN:
(数字)9781728169972
ISBN:
(纸本)9781728169972
Motor-Imagery Brain-Machine Interfaces (MI-BMIs) promise direct and accessible communication between human brains and machines by analyzing brain activities recorded with Electroencephalography (EEG). Latency, reliability, and privacy constraints make it unsuitable to offload the computation to the cloud. Practical use cases demand a wearable, battery-operated device with low average power consumption for long-term use. Recently, sophisticated algorithms, in particular deep learning models, have emerged for classifying EEG signals. While reaching outstanding accuracy, these models often exceed the limitations of edge devices due to their memory and computational requirements. In this paper, we demonstrate algorithmic and implementation optimizations for EEGNET, a compact Convolutional Neural network (CNN) suitable for many BMI paradigms. We quantize weights and activations to 8-bit fixed-point with a negligible accuracy loss of 0.4% on 4-class MI, and present an energy-efficient hardware-aware implementation on the Mr. Wolf parallel ultra-low power (PULP) System-on-Chip (SoC) by utilizing its custom RISC-V ISA extensions and 8-core compute cluster. With our proposed optimization steps, we can obtain an overall speedup of 64 x and a reduction of up to 85% in memory footprint with respect to a single-core layer-wise baseline implementation. Our implementation takes only 5.82 ms and consumes 0.627 mJ per inference. With 21.0 GMAC/s/W, it is 256x more energy-efficient than an EEGNET implementation on an ARM Cortex-M7 (0.082 GMAC/s/W).
Over the past decades, the volume of Internet of Things (IoT) data has exploded which raise the problem of indexing, storing and retrieving efficiently. Most studies are based on dividing the target dataset into subse...
详细信息
ISBN:
(纸本)9781665449199
Over the past decades, the volume of Internet of Things (IoT) data has exploded which raise the problem of indexing, storing and retrieving efficiently. Most studies are based on dividing the target dataset into subsets using balls with one or two pivots. However, in the era of big data, where efficient indexing is important, the subspace volumes grow exponentially, which could degenerate the index. This problem is due to the inherent inadequacy of space partitioning. The topology must avoid biased allocation of objects for separable sets and must not influence the index structure. To meet this criteria, in this paper a new indexing structure called QCCF-tree (Quad tree based on Containers at the Cloud- Fog computing level), is proposed, based on dividing the space into four balk with four pivots. For the enhancement of the retrieving time in this new structure, the query is searched in parallel at each node of the QCCF-tree. The experimental evaluation of the proposed system, compared with several indexing systems, show that QCCF-tree outperforms most indexing systems either in the construction or the similarity query search.
Applications augmented with adaptive capabilities are becoming common in parallelcomputing environments which share resources such as main memory, network, or disk I/O. For large-scale scientific applications, dynami...
详细信息
ISBN:
(纸本)0769526373
Applications augmented with adaptive capabilities are becoming common in parallelcomputing environments which share resources such as main memory, network, or disk I/O. For large-scale scientific applications, dynamic adjustments to a computationally-intensive part may lead to a large pay-off in facilitating efficient execution of the entire application while aiming at avoiding resource contention. Application-specific knowledge, often best revealed during the run-time, is required to initiate and time these adjustments. In particular General Atomic and Molecular Electronic Structure System (GAMESS) used for ab initio molecular quantum chemistry calculations has two different implementations of Self-Consistency Field (SCF) methods, each of which targets either disk I/O or memory. This paper describes a mechanism enabling switching of algorithms during GAMESS run-time and shows the effect of the adaptations on the performance of GAMESS calculations as well as on a parallel GAMESS execution for different resource availability. The test results indicate that, in the presence of I/O resource contention, parallel GAMESS enhanced with adaptive mechanism may sustain the performance similar to that of full resource availability.
作者:
Pianini, DaniloUniv Bologna
Alma Mater Studiorum Dipartimento Informat Sci & Ingn I-47522 Cesena FC Italy
Many interesting systems in several disciplines can be modeled as networks of nodes that can store and exchange data: pervasive systems, edge computing scenarios, and even biological and bio-inspired systems. These sy...
详细信息
ISBN:
(纸本)9783030781972;9783030781989
Many interesting systems in several disciplines can be modeled as networks of nodes that can store and exchange data: pervasive systems, edge computing scenarios, and even biological and bio-inspired systems. These systems feature inherent complexity, and often simulation is the preferred (and sometimes the only) way of investigating their behavior;this is true both in the design phase and in the verification and testing phase. In this tutorial paper, we provide a guide to the simulation of such systems by leveraging Alchemist, an existing research tool used in several works in the literature. We introduce its meta-model and its extensible architecture;we discuss reference examples of increasing complexity;and we finally show how to configure the tool to automatically execute multiple repetitions of simulations with different controlled variables, achieving reliable and reproducible results.
Computer architectures for high performance computing have traditionally been based on an assumption of one parallel application running alone on one machine. The current trend is, however, that huge computer installa...
详细信息
ISBN:
(纸本)9781424449217
Computer architectures for high performance computing have traditionally been based on an assumption of one parallel application running alone on one machine. The current trend is, however, that huge computer installations offer compute power to a set of users or customers, each demanding only a subset of the available compute resources. This places new requirements on the architecture, in that it must support dynamic partitioning of the resources into several virtual servers as demand changes. We introduce a novel framework which supports flexible formation of such virtual servers while preventing interference between the communication of different virtual servers. This paper investigates the impacts of a shared interconnection network on applications running on virtual compute servers. We show that the interconnect performance supplied to each job is highly unpredictable, and that a job can experience a performance degradation of 97% when its traffic interferes with the traffic of concurrent jobs. With a minor reduction in the utilization of each processing node, this can be considerably improved through a combination of routing-containment in the interconnection network and a carefully designed resource allocation strategy.
Decentralized learning is an emerging field of research that opens doors for many novel pervasive computing applications. In decentralized learning, model training is offloaded to devices in the edge, and in some appr...
详细信息
ISBN:
(纸本)9781665416474
Decentralized learning is an emerging field of research that opens doors for many novel pervasive computing applications. In decentralized learning, model training is offloaded to devices in the edge, and in some approaches, functions entirely without a central controller. SWARM is a tool for fast and large-scale simulations to test the performance of the practical implementations of decentralized learning algorithms that underlie many pervasive computing applications. In SWARM, developers can easily launch simulations for their algorithms by simply writing code that defines the behavior of a device when collaborating with others. The developer delegates to SWARM the emulation of the devices' encounters, given a pervasive computing scenario. By decoupling the encounter emulation and the learning algorithm execution, SWARM makes the configuration of diverse application scenarios easy and their simulations repeatable. Moreover, developers can evaluate the scalability of an algorithm in diverse and large-scale application contexts as SWARM can automatically deploy and manage multiple worker nodes. Finally, the SWARM Dashboard provides a visualization of the simulation progress and the algorithm performance.
Performing machine learning inference at the network edge, named Edge Inference, showing benefits like low latency, reduced data traffic, and improved user privacy, has attracted massive attention. computing Power Net...
详细信息
parallel file systems provide high-performance disk access by transparently striping data across multiple disks and I/O nodes. Similar to peer-to-peer systems (e.g. Freenet, Oceanstore, Chord/ CFS, Past), parallel fil...
详细信息
ISBN:
(纸本)1892512459
parallel file systems provide high-performance disk access by transparently striping data across multiple disks and I/O nodes. Similar to peer-to-peer systems (e.g. Freenet, Oceanstore, Chord/ CFS, Past), parallel file systems for clusters are employed on networked computers whose nodes are not guaranteed to be always available, due to node failures or network failure. However, different from peer-to-peer systems, cluster file systems like PVFS do not handle these failures very well. In this work, we explore how cluster file system can utilize certain peer-to-peer techniques that can handle failing nodes and thus allow for high data availability.
A timely and accurate prediction of the vehicle's trajectory can render a good service, help predict traffic flow, and in the more interesting case, detect potentially dangerous situations as early as possible. It...
详细信息
ISBN:
(纸本)9781538637906
A timely and accurate prediction of the vehicle's trajectory can render a good service, help predict traffic flow, and in the more interesting case, detect potentially dangerous situations as early as possible. It is widespread to use historical data to predict trajectories. However, due to the sparsity of trajectory, sometimes it is difficult to predicate trajectory. A novel approach to trajectory prediction is proposed which has the capability to predict the trajectory with sparse data. To achieve this, we denote city route and trajectory as a series of fixed points. Under normal conditions, only the most similarly trajectory is used to predict trajectories, which may mislead the result. To remedy this problem, we use several most similarity trajectories to predict the next point, and regard the most points as our result. The experimental results show that the approach improves the prediction accuracy 8%, coverage 2%, and greatly outperform the baseline algorithm (30 ms) compared with Spatial Iterative Grid Partition (SIGP).
暂无评论