This research presents a new framework called Spark-CNN that incorporates Apache Sparks distributed computing capabilities with Convolutional Neural Networks (CNNs) for images work processing. The results of experimen...
详细信息
ISBN:
(数字)9798350366846
ISBN:
(纸本)9798350366853
This research presents a new framework called Spark-CNN that incorporates Apache Sparks distributed computing capabilities with Convolutional Neural Networks (CNNs) for images work processing. The results of experimental evaluation show that the proposed framework is indeed effective in handling large-scale image datasets, wherein substantial improvements have been noted regarding reduced processing time, scalability and accuracy. In Trial 1, Spark-CNN outperformed standalone –Spark (180 minutes), and all CNNs (240minutes) with an estimated processing season of 120 mins. Scale2 demonstrated the scalability of analyzing 2, sustaining constant processing times despite an increase in image dataset size from 1 million to 10 million. Further, in the third examination, Spark-CNN got an accuracy of 89%, surpassing standalone CNNs with a percentage score of 85%. Comparing the Trial 4 with Horovod-CNN, Spark-CNN showed a cutthroat processing time (120 minutes) and enhanced accuracy (89% vs., Trial 5 presented the influence of cluster setup on Spark-CNN revealing reduced execution times and better precision with growing numbers of executors beneficiaries per agent. These results establish Spark-CNN as a robust and effective solution to address scalable image processing challenges across different areas.
A novel logic block circuit consisting of two multimode logic cells is proposed for the design of a tile-based FPGA fabricated with a 0.5um SOI-CMOS logic process. Each logic cell contains two 3-LUTs. The proposed 3-L...
详细信息
A novel logic block circuit consisting of two multimode logic cells is proposed for the design of a tile-based FPGA fabricated with a 0.5um SOI-CMOS logic process. Each logic cell contains two 3-LUTs. The proposed 3-LUT based logic cell circuit increases logic density by about 12% compared with a traditional 4-LLJT implementation. The logic block can be used in two functional modes: LUT mode and distributed RAM mode, the latter of which can be configured in two modes: Single-Port RAM and DualPort RAM. Comparing with the published data on the CLB in Xilinx Spartan FPGA, the maximum LUT logic propagation delay has about 20% improvement and the distributed RAM average access time has about 21% improvement.
作者:
M. NikRaveshBISC Program
Computer Sciences Division EECS Department University of California Berkeley CA USA
Retrieving relevant information is a crucial component of cased-based reasoning systems for Internet applications such as search engines. The task is to use user-defined queries to retrieve useful information accordin...
详细信息
Retrieving relevant information is a crucial component of cased-based reasoning systems for Internet applications such as search engines. The task is to use user-defined queries to retrieve useful information according to certain measures. Even though techniques exist for locating exact matches, finding relevant partial matches might be a problem. It may not be also easy to specify query requests precisely and completely - resulting in a situation known as a fuzzy-querying. It is usually not a problem for small domains, but for large repositories such as World Wide Web, a request specification becomes a bottleneck. Thus, a flexible retrieval algorithm is required, allowing for imprecise specification or search. Therefore, we envision that non-classical techniques such as fuzzy logic based-clustering methodology based on perception, fuzzy similarity, fuzzy aggregation, and FLSI for automatic information retrieval and search with partial matches are required.
The emerging Smart Agriculture based on Internet of Things (IoT) is facing major challenges like data sharing, storage, and monitoring, primarily due to the distributed nature of IoT and massive scale. We performed a ...
详细信息
ISBN:
(数字)9781728184203
ISBN:
(纸本)9781728184210
The emerging Smart Agriculture based on Internet of Things (IoT) is facing major challenges like data sharing, storage, and monitoring, primarily due to the distributed nature of IoT and massive scale. We performed a review of the literature and found that blockchain performance, scalability, cost, and throughput are the major challenges in adopting blockchain for smart agriculture. To overcome these challenges, this paper proposes a scalable and distributed data sharing system integrating access control for smart agriculture. We demonstrate our approach in a smart agriculture setting, which consists of four tiers that are: smart agriculture, smart contract, Interplanetary File System (IPFS) and agriculture stakeholders (remote users). This paper explains in detail the different components of our proposed architecture. Our approach uses anonymous identities to ensure users' privacy. Our approach is fully scalable because a large number of resource owners can use their data sharing smart contracts to create, update or delete data sharing policies. In addition, our approach does not require transaction fees when the smart contract receives a large number of policy evaluation requests. For the sake of simplicity, we publish and test a single data sharing smart contract. However, in practice, multiple smart contracts need to be deployed to allow each resource owner to securely share agriculture data with stakeholders. Finally, we evaluate the performance of our proposed system on the EOS blockchain to show that the resource consumption (in terms of computing power and network bandwidth) introduced by our framework are insignificant compared to its scalability, cost and security benefits.
Machine learning (ML) often operates on data fragmented across silos through two paradigms: distributed or centralized. This study illuminates the underexplored signifi-cance of data integration (DI) metadata in both ...
详细信息
ISBN:
(数字)9798350317152
ISBN:
(纸本)9798350317169
Machine learning (ML) often operates on data fragmented across silos through two paradigms: distributed or centralized. This study illuminates the underexplored signifi-cance of data integration (DI) metadata in both methodologies. Our contribution is threefold. First, we formalize the complex relationships of data sources with DI metadata. Second, we propose an approach that transforms DI metadata into matrix representations, and streamlines data transformation and linear algebra operations over source datasets. Third, we present an optimization method, effectively deciding between factorization versus materialization. By leveraging logic-based pruning rules and an ML-based cost estimator, our approach outperforms state-of-the-art baselines and makes the trade-off of factorization and materialization with up to 90.5% accuracy.
With each new DDoS attack potentially becoming a higher intensity attack than the previous ones, current ISP measures of over-provisioning or employing a scrubbing service are becoming ineffective and inefficient. We ...
详细信息
ISBN:
(数字)9781728156842
ISBN:
(纸本)9781728156859
With each new DDoS attack potentially becoming a higher intensity attack than the previous ones, current ISP measures of over-provisioning or employing a scrubbing service are becoming ineffective and inefficient. We argue that we need an in-network solution (i.e., entirely in the data plane), to detect DDoS attacks, identify the corresponding traffic and mitigate promptly. In this paper, we propose the first distributed in-network defense architecture, DIDA, to cope with the sophisticated amplified reflection DDoS (AR-DDoS) attacks. We leverage programmable stateful data planes and efficient data structures and show that it is possible to keep track of per-user connections in an automated and distributed manner without overwhelming the network controller. Building on top of this data, DIDA can easily detect if unsolicited attack packets are sent towards a victim within an ISP network. Once an attack is detected, the routers at the network edge automatically block the malicious sources. We prototype DIDA in P4. Our preliminary experiments show that DIDA can detect and mitigate 99.8% of amplification attacks containing 7, 000 different sources while requiring less than 1% of the memory of current programmable switches.
Recently many researchers have been studying the feasibility of a support system for medical diagnosis through the World Wide Web, and there have been many attempts to construct a distributed database of medical image...
详细信息
Recently many researchers have been studying the feasibility of a support system for medical diagnosis through the World Wide Web, and there have been many attempts to construct a distributed database of medical images connected by the computer network. In these cases, the efficient and easy presentation of three-dimensional (3-D) medical images should be a quite important tool. The authors made a 3-D presentation system through the World Wide Web with a practical example image. The target 3-D object is a human head reconstructed from 177 MRI cross section slices, and the data for this display are described by the VRML 2.0 format. The observer can see the 3-D head model from any viewpoint, furthermore, he can operate the model through the network in a simple manner. Now, VRML is a popular and standard language to describe 3-D object, so one doesn't need to prepare the special software or special instruments. However, one needs some procedure to use it more efficiently. Here, the authors present an interactive 3-D presentation procedure for medical images on the network using VRML 2.0, and it is mentioned the presentation procedure is quite useful for remote medial service, teaching procedure for medical students and explanation for patients.
Many scientific applications suffer from the lack of a unified approach to support the management and efficient processing of large-scale data. The Twister MapReduce Framework, which not only supports the traditional ...
详细信息
Many scientific applications suffer from the lack of a unified approach to support the management and efficient processing of large-scale data. The Twister MapReduce Framework, which not only supports the traditional MapReduce programming model but also extends it by allowing iterations, addresses these problems. This paper describes how Twister is applied to several kinds of scientific applications such as BLAST, MDS Interpolation and GTM Interpolation in a non-iterative style and to MDS without interpolation in an iterative style. The results show the applicability of Twister to data parallel and EM algorithms with small overhead and increased efficiency.
With the accelerated growth of big spatial data volume produced by various devices, a plethora of development research to handle big spatial data has been done in the past decade. This research reviews the fundamental...
详细信息
ISBN:
(数字)9781665472159
ISBN:
(纸本)9781665472166
With the accelerated growth of big spatial data volume produced by various devices, a plethora of development research to handle big spatial data has been done in the past decade. This research reviews the fundamental components and characteristics included in analytic systems for effectively managing big spatial data. After that, an overview of recent researches on big spatial data is discussed using four main components: source, storage, processing, and visualization. The components are then described in detail, including examples of how they are used in existing work. Afterward, some researchers’ work on systems of big spatial data is discussed showing how they support these four components. Furthermore, a comparison between these works is given in terms of the important performance metrics. Finally, this paper addresses the future research directions.
暂无评论