Root cause analysis is a common data analysis task. While question-answering systems enable people to easily articulate a why question (e.g., why students in Massachusetts have high ACT Math scores on average) and obt...
详细信息
Deepfake detection remains challenging, particularly when identifying deepfakes generated by unseen forgery methods. Recent studies have shown that detectors trained on forgery data from Generative Adversarial Network...
详细信息
Raw flash file systems are essential in today's cost-sensitive embedded systems and legacy embedded devices. The performance of raw flash file systems is often limited by their inefficient garbage collection (GC) ...
详细信息
ISBN:
(纸本)9781450393409
Raw flash file systems are essential in today's cost-sensitive embedded systems and legacy embedded devices. The performance of raw flash file systems is often limited by their inefficient garbage collection (GC) due to the tight capacity of the onboard flash memory and CPU computing power. However, in real-world scenarios, we observed that the GC overhead can account for more than 85% of the total I/O time, which not only degrades the performance of the file system but also shortens the endurance of the flash chip. However, existing GC optimization used in high-end SSD or serverside file systems usually leads to high CPU and memory usage, which is not suitable for the embedded environment. In this paper, we propose LightGC, a low-complexity and high-efficiency GC scheme. We design an optimized clustering algorithm for page hotness measurement and a hot-delay GC victim selection strategy. LightGC can also be adapted to locality changes in workloads to maximize the effect of cold and heat separation. Our experimental results show that LightGC offers a relative improvement of up to 19% in GC efficiency, a 5%-15% reduction in write amplification, and a 10%-17% increase in flash memory endurance compared to other existing GC methods. In addition, driven by LightGC, we implement the UBIFS2 file system, an upgraded version of the UBIFS flash file system. We have done careful engineering design so that UBIFS2 and the original UBIFS are compatible with each other and users can switch between them freely.
The prevention of criminal activity has changed dramatically over the past two decades, largely due to the increased reliance on systems that provide crime data analysis. Created specifically for police, judicial sent...
详细信息
Bitcoin and the Dark Web present an interesting synergy that enables both legitimate anonymity and illicit activities, making it an important landscape to understand, especially as the Dark Web, with its hidden servic...
详细信息
ISBN:
(纸本)9798400714764
Bitcoin and the Dark Web present an interesting synergy that enables both legitimate anonymity and illicit activities, making it an important landscape to understand, especially as the Dark Web, with its hidden services, relies heavily on Bitcoin as a pseudonymous currency for transactions. However, a lack of scalable tools and timely datasets has limited systematic analysis of this ecosystem. To address this gap, we introduce Venom, a scalable framework for mapping Bitcoin activity on the Dark Web. Venom integrates multithreaded crawling, data extraction, and dataset generation, resulting in a comprehensive resource that allows us to easily collect snapshots of over 177,000 onion sites in roughly 24 hours. With the paper, we share both the tool and an example snapshot containing both per-site metadata and Bitcoin transaction data. Preliminary analysis reveals concentrated activity among key players and widespread content mirroring, offering new insights into the Dark Web's economic structure. Venom provides a critical resource for advancing research and monitoring in this domain.
Leafmap is a Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment. It is built upon several open-source packages, such as ipyleaflet and *** (for creating interac...
详细信息
Sparse direct solvers play a vital role in large-scale high performance computing in science and engineering. Existing distributed sparse direct methods employ multifrontal/supernodal patterns to aggregate columns of ...
详细信息
Mean-field models are an established method to analyze large stochastic systems with N interacting objects by means of simple deterministic equations that are asymptotically correct when N tends to infinity. For finit...
详细信息
We present DeLiBA-K, an improved version of the Development of Linux Block I/O Accelerators (DeLiBA) framework. DeLiBA-K operates at the Linux kernel level, bypassing the user-space interactions of DeLiBA-1 and -2 to ...
ISBN:
(纸本)9798350355543
We present DeLiBA-K, an improved version of the Development of Linux Block I/O Accelerators (DeLiBA) framework. DeLiBA-K operates at the Linux kernel level, bypassing the user-space interactions of DeLiBA-1 and -2 to interact with the block and network I/O kernel stack directly. Another critical feature of DeLiBA-K is implementing and benchmarking the modern io_uring Asynchronous I/O (AIO) API within a 16nm AMD Alveo U280 FPGA I/O framework. This allows for better parallelism and reduced latency in I/O operations. Our results show significant performance gains, up to a 3.2x improvement in I/O operations per second (IOPS) and 3.45x increased throughput for synthetic workloads. Real-world applications see a 30% reduction in execution time for data-intensive tasks. DeLiBA-K has been successfully tested in an industrial environment using real workloads, demonstrating its effectiveness in large-scale enterprise environments.
Autonomous vehicles are predicted to dominate the transportation industry in the foreseeable future. Safety is one of the major challenges to the early deployment of self-driving systems. To ensure safety, self-drivin...
详细信息
暂无评论