检索结果-内蒙古大学图书馆

Security and data integrity for lans and wans

Information Management & Computer Security 1995年第4期3卷 15-19页

作者： Fitzgerald, Kevin J. KPMG Information Solutions Melbourne Australia

The typical characteristic of today's LAN and WAN environments is one of mixture. Old systems are mixed with a wide variety of new systems. Establishing effective network security in multi-platform, multi-vendor, enterprise-wide IT environments is the greatest challenge yet to businesses which depend on such systems and to the information security industry itself. „Security and data integrity” in this article have been taken to mean the coverage of information availability, confidentiality and integrity. © 1995 MCB UP Ltd.

关键词： Access control Computer networks distributed data processing Downsizing Reliability Security

来源：评论

学校读者我要写书评

暂无评论

Karasu: A Collaborative Approach to Efficient Cluster Configuration for Big data Analytics

Karasu: A Collaborative Approach to Efficient Cluster Config...

引用

IEEE International Performance, Computing, and Communications Conference (IPCCC)

作者： Scheinert, Dominik Wiesner, Philipp Wittkopp, Thorsten Thamsen, Lauritz Will, Jonathan Kao, Odej Tech Univ Berlin Berlin Germany Univ Glasgow Glasgow Lanark Scotland

ISBN: (纸本)9798350302936

Selecting the right resources for big data analytics jobs is hard because of the wide variety of configuration options like machine type and cluster size. As poor choices can have a significant impact on resource efficiency, cost, and energy usage, automated approaches are gaining popularity. Most existing methods rely on profiling recurring workloads to find near-optimal solutions over time. Due to the cold-start problem, this often leads to lengthy and costly profiling phases. However, big data analytics jobs across users can share many common properties: they often operate on similar infrastructure, using similar algorithms implemented in similar frameworks. The potential in sharing aggregated profiling runs to collaboratively address the cold start problem is largely unexplored. We present Karasu, an approach to more efficient resource configuration profiling that promotes data sharing among users working with similar infrastructures, frameworks, algorithms, or datasets. Karasu trains lightweight performance models using aggregated runtime information of collaborators and combines them into an ensemble method to exploit inherent knowledge of the configuration search space. Moreover, Karasu allows the optimization of multiple objectives simultaneously. Our evaluation is based on performance data from diverse workload executions in a public cloud environment. We show that Karasu is able to significantly boost existing methods in terms of performance, search time, and cost, even when few comparable profiling runs are available that share only partial common characteristics with the target job.

关键词： Scalable data Analytics distributed data processing Performance Modeling Resource Management

来源：评论

学校读者我要写书评

暂无评论

Big data processing System for Analysis of GitHub Events 22

Big Data Processing System for Analysis of GitHub Events

引用

22nd International Conference on Soft Computing and Measurements (SCM)

作者： Voinov, Nikita Garzon, Katterine Rodriguez Nikiforov, Igor Drobintsev, Pavel Peter Great St Petersburg Polytech Univ Inst Comp Sci & Technol St Petersburg Russia

ISBN: (纸本)9781728136028

The article describes architecture of a big data processing system based on Apache Hadoop, Apache Flume and Apache Spark toolset. Application of the developed system is shown for storage and analysis of dataset containing generated events within GitHub repository - the world's largest web-service for version control using Git. System performance results are evaluated using chosen metrics.

关键词： Big data distributed data processing Apache Hadoop MapReduce GitHub repository

来源：评论

学校读者我要写书评

暂无评论

Scalable Construction of Text Indexes with Thrill

Scalable Construction of Text Indexes with Thrill

引用

IEEE International Conference on Big data (Big data)

作者： Bingmann, Timo Gog, Simon Kurpicz, Florian Karlsruhe Inst Technol Inst Theoret Informat Karlsruhe Germany Tech Univ Dortmund Dept Comp Sci Dortmund Germany

ISBN: (纸本)9781538650356

The suffix array is the key to efficient solutions for myriads of string processing problems in different application domains, like data compression, data mining, or bioinformatics. With the rapid growth of available data, suffix array construction algorithms have to be adapted to advanced computational models such as external memory and distributed computing. In this article, we present five suffix array construction algorithms utilizing the new algorithmic big data batch processing framework Thrill, which allows scalable processing of input sizes on distributed systems in orders of magnitude that have not been considered before.

关键词： suffix array C plus big data tool distributed data processing

来源：评论

学校读者我要写书评

暂无评论

distributed Computing - A Challenge to Personnel Management

引用

Industrial Management & data Systems 1986年第11-12期86卷 13-13页

作者： Hamilton, R.A. Department of Business Studies Napier College Edinburgh United Kingdom

The computer systems developed during the 1960s and 1970s made very little impact on management decision. Management Information System design was constrained by three factors — the technology was large‐scale and inevitably centralised and controlled by data processing staff; the systems were designed by specialist staff who rarely understood the business requirements; and managers themselves had little knowledge or “hands‐on” experience of computers. In the 1980s a greater awareness of the need for planning and better use of personnel information, coupled with the development of distributed processing systems, has presented personnel management with opportunities to use computing technology as a means of increasing the professionalism of practising personnel managers. Effective use will only occur if the implementation of technology is matched by appraisal of skills and organisation within personnel departments. Staff will need a minimum level of computing expertise and some managers will need skills in modelling, particularly financial modelling. The relationship between personnel and data processing needs careful redefining to build a link between the two and data processing staff need to design and communicate an end‐user strategy.

关键词： distributed data processing Implementation Personnel Management

来源：评论

学校读者我要写书评

暂无评论

Industrial track: Architecting railway KPIs data processing with Big data technologies

Industrial track: Architecting railway KPIs data processing ...

引用

IEEE International Conference on Big data (Big data)

作者： Suleykin, Alexander Panfilov, Peter Bakhtadze, Natalya Russian Acad Sci VA Trapeznikov Inst Control Sci Moscow Russia Natl Res Univ Higher Sch Econ Sch Business Informat Moscow Russia Bauman Moscow State Tech Univ Moscow Russia

ISBN: (纸本)9781728108582

in our conducted research we have built the data processing pipeline for storing railway KPIs data based on Big data open-source technologies - Apache Hadoop, Kafka, Karim HDFS Connector. Spark, Airflow and PostgreSQL. Created methodology for data load testing allowed to iteratively perform data load tests with increased data size and evaluate needed cluster software and hardware resources and, finally, detected bottlenecks of solution. As a result of the research we proposed architecture for data processing and storage, gave recommendations on data pipeline optimization. In addition, we calculated approximate cluster machines sizing for current dataset volume for data processing and storage services.

关键词： Big data technologies distributed data processing Hadoop Spark railway KPIs

来源：评论

学校读者我要写书评

暂无评论

Getting beyond the dark side of distributed computing

引用

Information Management & Computer Security 1996年第2期4卷 52-53页

作者： Howe, Randy Cambridge Technology Partners Cambridge Massachusetts United States

Presents major findings from an in-depth multi-company American study on the effective management of the distributed computing environment. The study, led by Cambridge Technology Partners, focuses on the management approaches that contribute to the greatest success in the client/server arena. Synthesizes the study's broad findings into eight major areas of concern and focus for future distributed systems management. © 1996, MCB UP Limited

关键词： Client-server computing distributed data processing Information technology Management philosophy Surveys

来源：评论

学校读者我要写书评

暂无评论

CIM INFORMATION MANAGEMENT-SYSTEM - AN EXPRESS-BASED INTEGRATION PLATFORM

CIM INFORMATION MANAGEMENT-SYSTEM - AN EXPRESS-BASED INTEGRA...

引用

WORKSHOP ON COMPUTER INTEGRATED MANUFACTURING ( CIM ) IN PROCESS AND MANUFACTURING INDUSTRIES

作者： CAMARINHAMATOS, LM OSORIO, AL

来源：评论

学校读者我要写书评

暂无评论

DIMOS - distributed MONITORING-SYSTEM FROM SPECIFICATIONS TO DELIVERY, THE REALIZATION OF A NUCLEAR-POWER-PLANT SUPERVISION SYSTEM USING HOOD AND ADA

DIMOS - DISTRIBUTED MONITORING-SYSTEM FROM SPECIFICATIONS TO...

引用

IFAC Workshop on Real-Time Programming (WRTP 92)

作者： JACOBS, P PLUVINAGE, D TRASYS SA B-1200 BRUSSELSBELGIUM

来源：评论

学校读者我要写书评

暂无评论

CONCEPTS AND TOOLS FOR THE EFFECTIVE AND EFFICIENT USE OF WEB ARCHIVES

CONCEPTS AND TOOLS FOR THE EFFECTIVE AND EFFICIENT USE OF WE...

引用

作者： Helge Holzmann Gottfried Wilhelm Leibniz Universitat Hannover

学位级别：博士

Web archives constitute valuable sources for researchers in various disciplines. How- ever, their sheer size, the typically broad scope and their temporal dimension make them difficult to work with. We have identified three views to access and explore Web archives from different perspectives: user-, data- and graph-centric. The natural way to look at the information in a Web archive is through a Web browser, just like the live Web is consumed. This is what we consider the user-centric view. The most commonly used tool to access a Web archive this way is the Wayback Machine, the Internet Archive's replay tool to render archived webpages. To facilitate the discovery of a page if the URL or timestamp of interest is unknown, we propose effective approaches to search Web archives by keyword with a temporal dimension through social bookmarks and labeled hyperlinks. Another way for users to find and access archived pages is past information on the current Web that is linked to the corresponding evidence in a Web archive. A presented tool for this purpose ensures coherent archived states of webpages related to a common object as rich temporal representations to be referenced and shared. Besides accessing a Web archive by closely reading individual pages like users do, distant reading methods enable analyzing archival collections at scale. This data-centric view enables analysis of the Web and its dynamics itself as well as the contents of archived pages. We address both angles: 1. by presenting a retrospective analysis of crawl meta- data on the size, age and growth of a Web dataset, 2. by proposing a programming framework for efficiently processing archival collections. ArchiveSpark operates on stan- dard formats to build research corpora from Web archives and facilitates the process of filtering as well as data extraction and derivation at scale. The third perspective is what we call the graph-centric view. Here, websites, pages or extracted facts are considered nodes in

关键词： Web archives temporal search distributed data processing Web analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：