Regenerating codes are new network codes proposed to reduce the data required for fault repair, which can improve the recovery efficiency of faulty nodes in data storage systems. However, unlike Reed-Solomon code, whi...
详细信息
We investigated the computational capabilities of FABRIC, a nationwide research infrastructure with nearly 40 sites, for scaling neuroscience simulations. From the hardware standpoint, single-site characterization sho...
详细信息
Open source code reuse and code cross-platform deployment accelerate the spread of software vulnerabilities, and pose challenges for accurate detection of cross-platform vulnerabilities. The binary vulnerability simil...
详细信息
Serverless edge computing has emerged as a new paradigm for running short-lived computations on edge devices. Considering the challenges posed by multiple edge servers and non-negligible cold start latency in serverle...
详细信息
This paper conducts in-depth research and discussion on data storage security in cloud computing environment. This study collected 1000 dummy data from different organizations as the basic data set for the study. This...
详细信息
Allocating the most competent crowdworkers to each upcoming task is a fundamental challenge in crowdsourcing. The mechanism becomes complicated when the arriving tasks require a high level of expertise within a constr...
详细信息
ISBN:
(纸本)9781450397964
Allocating the most competent crowdworkers to each upcoming task is a fundamental challenge in crowdsourcing. The mechanism becomes complicated when the arriving tasks require a high level of expertise within a constrained budget. The validation of skill matching between tasks and crowdworkers adds a new dimension to the traditional problem of task allocation. In addition, in real-world scenarios, the influx of both tasks and workers is dynamic, making it nearly impossible to predict the precise amount of computational resources required for the crowdsourcing platform to operate efficiently. Serverless computing is a new pay-per-use, autoscalable, Function-as-a-Service based model, that ensures parallel execution of lightweight event-driven functions. The developer with serverless can solely concentrate on writing application logic with zero effort on resource provision, server management, environmental configuration, and availability. Today, collaboration has become the new competition. In light of these considerations, we propose a novel framework to facilitate task allocation strategies for crowdsourcing applications, deployed within a serverless platform in order to improve performance. The results obtained are compared to the baseline, Online-Greedy, and simulations are run in both serverless and local environments.
In the context of mobile edge computing, efficiently deploying microservices to reduce finish time and enhance user service quality is a challenging task. However, existing research still has certain deficiencies in c...
详细信息
The growing demand for intelligent environments unleashes an extraordinary cycle of privacy-aware applications that makes individuals' life more comfortable and safe. Examples of these applications include pedestr...
详细信息
ISBN:
(纸本)9781665453783
The growing demand for intelligent environments unleashes an extraordinary cycle of privacy-aware applications that makes individuals' life more comfortable and safe. Examples of these applications include pedestrian tracking systems in large areas. Although the ubiquity of camera-based systems, they are not a preferable solution due to the vulnerability of leaking the privacy of pedestrians. In this paper, we introduce a novel privacy-preserving system for pedestrian tracking in smart environments using multiple distributed LiDARs of non-overlapping views. The system is designed to leverage LiDAR devices to track pedestrians in partially covered areas due to practical constraints, e.g., occlusion or cost. Therefore, the system uses the point cloud captured by different LiDARs to extract discriminative features that are used to train a metric learning model for pedestrian matching purposes. To boost the system's robustness, we leverage a probabilistic approach to model and adapt the dynamic mobility patterns of individuals and thus connect their sub-trajectories. We deployed the system in a largescale testbed with 70 colorless LiDARs and conducted three different experiments. The evaluation result at the entrance hall confirms the system's ability to accurately track the pedestrians with a 0.98 F-measure even with zero-covered areas. This result highlights the promise of the proposed system as the next generation of privacy-preserving tracking means in smart environments.
Graph-based approximate nearest neighbor algorithms have shown high performance and quality. However, such approaches require a large amount of memory and still take a long time to construct high-quality nearest neigh...
详细信息
As rich user-generated geotagged data, microblogs have been exploited in several data analytic contexts, e.g., popular topic trends, popular site detection, and geo-targeted recommendation. To support such analysis, w...
详细信息
ISBN:
(纸本)9781665421973
As rich user-generated geotagged data, microblogs have been exploited in several data analytic contexts, e.g., popular topic trends, popular site detection, and geo-targeted recommendation. To support such analysis, we developed an efficient multidimensional index structure and parallel processing approaches for top-k frequent spatiotemporal terms query: a common analytic query on geotagged social data. Given a spatiotemporal range, the query aggregates the frequencies of terms among the social posts and identifies the most frequent terms in that range. The present work is different from studies that extract this information from stream data because we focus on large historical datasets. The key challenge is to improve query performance with minimum storage requirements. We propose a distributed index structure that transforms spatiotemporal coordinates into unique codes to generate rowkeys in key-value stores (KVSs) and balances the data distribution across distributed systems. Then, we utilize data localization to calculate sorted term lists (STLs) in parallel. To reduce input/output between KVSs and the client, we theoretically estimate the necessary length of STLs to calculate the top-k frequent terms and send only a part of the STLs to the client. Several experiments, on both real and artificial datasets, showed our approach to have both lower space requirements and better query performance than baseline approaches.
暂无评论