检索结果-内蒙古大学图书馆

13th IEEE International Conference on Trust Security and Privacy in Computing and Communications (TrustCom)

作者： Wang, Yingxu Wiebe, Victor J. Univ Calgary Schulich Sch Engn Dept Elect & Comp Engn Calgary AB T2N 1N4 Canada

ISBN: (纸本)9781479965137

big data are extremely large-scaled data in terms of quantity, complexity, semantics, distribution, and processing costs in computer science, cognitive informatics, web-based computing, cloud computing, and computational intelligence. Censuses and elections are a typical paradigm of big data engineering in modern digital democracy and social networks. This paper analyzes the mechanisms of voting systems and collective opinions using big data analysis technologies. A set of numerical and fuzzy models for collective opinion analyses is presented for applications in social networks, online voting, and general elections. A fundamental insight on the collective opinion equilibrium is revealed among electoral distributions and in voting systems. Fuzzy analysis methods for collective opinions are rigorously developed and applied in poll data mining, collective opinion determination, and quantitative electoral data processing.

关键词： big data big data engineering numerical methods fuzzy big data social networks voting opinion poll collective opinion quantitative analyses

来源：评论

学校读者我要写书评

暂无评论

Testing MapReduce programs: A systematic mapping study

引用

JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS 2019年第3期31卷

作者： Moran, Jesus de la Riva, Claudio Tuya, Javier Univ Oviedo Dept Comp Campus Viesques Gijon 33394 Spain

Context MapReduce is a processing model used in big data to facilitate the analysis of large data under a distributed architecture. Objective The aim of this study is to identify and categorize the state of the art of software testing in MapReduce applications, determining trends and gaps. Method Systematic mapping study to discuss and classify according to international standards 54 relevant studies in relation to reasons for testing, types of testing, quality characteristics, test activities, tools, roles, processes, test levels, and research validations. Results The principal reasons for testing MapReduce applications are performance issues, potential failures, issues related to the data, or to satisfy the agreements with efficient resources. The efforts are focused on performance and, to a lesser degree, on functionality. Performance testing is carried out through simulation and evaluation, whereas functional testing considers some program characteristics (such as specification and structure). Despite the type of testing, the majority of efforts are focused at the unit and integration test levels of the specific MapReduce functions without considering other parts of the technology stack. Conclusions Researchers have both opportunities and challenges in performance and functional testing, and there is room to improve their research though the use of mature and standard validation methods.

关键词： big data engineering MapReduce software testing systematic mapping study

来源：评论

学校读者我要写书评

暂无评论

Optimizing Healthcare big data Processing with Containerized PySpark and Parallel Computing: A Study on ETL Pipeline Efficiency

引用

Journal of data Analysis and Information Processing 2024年第4期12卷 544-565页

作者： Ehsan Soltanmohammadi Neset Hikmet Department of Integrated Information Technology University of South Carolina Columbia United States

In this study, we delve into the realm of efficient big data engineering and Extract, Transform, Load (ETL) processes within the healthcare sector, leveraging the robust foundation provided by the MIMIC-III Clinical database. Our investigation entails a comprehensive exploration of various methodologies aimed at enhancing the efficiency of ETL processes, with a primary emphasis on optimizing time and resource utilization. Through meticulous experimentation utilizing a representative dataset, we shed light on the advantages associated with the incorporation of PySpark and Docker containerized applications. Our research illuminates significant advancements in time efficiency, process streamlining, and resource optimization attained through the utilization of PySpark for distributed computing within big data engineering workflows. Additionally, we underscore the strategic integration of Docker containers, delineating their pivotal role in augmenting scalability and reproducibility within the ETL pipeline. This paper encapsulates the pivotal insights gleaned from our experimental journey, accentuating the practical implications and benefits entailed in the adoption of PySpark and Docker. By streamlining big data engineering and ETL processes in the context of clinical big data, our study contributes to the ongoing discourse on optimizing data processing efficiency in healthcare applications. The source code is available on request.

关键词： big data engineering ETL Healthcare Sector Containerized Applications Distributed Computing Resource Optimization data Processing Efficiency

来源：评论

学校读者我要写书评

暂无评论

big data Analytics: A Cognitive Perspectives

引用

INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE 2017年第2期11卷 41-56页

作者： Wang, Yingxu Peng, Jun Chongqing Univ Sci & Technol Sch Elect & Informat Engn Chongqing Peoples R China

big data are pervasively generated by human cognitive processes, formal inferences, and system quantifications. This paper presents the cognitive foundations of big data systems towards big data science. The key perceptual model of big data systems is the recursively typed hyperstructure (RTHS). The RTHS model reveals the inherited complexities and unprecedented difficulty in big data engineering. This finding leads to a set of mathematical and computational models for efficiently processing big data systems. The cognitive relationship between data, information, knowledge, and intelligence is formally described.

关键词： big data big data engineering big data Science Cognitive Foundations Cognitive Informatics Denotational Mathematics Recursively Typed Hyperstructures

来源：评论

学校读者我要写书评

暂无评论

big data Analytics on the Characteristic Equilibrium of Collective Opinions in Social Networks

引用

INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE 2014年第3期8卷 29-44页

作者： Wang, Yingxu Wiebe, Victor J. Univ Calgary ICIC Calgary AB Canada

big data are products of human collective intelligence that are exponentially increasing in all facets of quantity, complexity, semantics, distribution, and processing costs in computer science, cognitive informatics, web-based computing, cloud computing, and computational intelligence. This paper presents fundamental big data analysis and mining technologies in the domain of social networks as a typical paradigm of big data engineering. A key principle of computational sociology known as the characteristic opinion equilibrium is revealed in social networks and electoral systems. A set of numerical and fuzzy models for collective opinion analyses is formally presented. Fuzzy data mining methodologies are rigorously described for collective opinion elicitation and benchmarking in order to enhance the conventional counting and statistical methodologies for big data analytics.

关键词： big data big data Analytics big data engineering Cognitive Informatics Collective Opinion Computational Sociology Fuzzy big data Analysis Mathematical Models Numerical Methods Opinion Equillibrium Opinion Poll Quantitative Analyses Social Networks

来源：评论

学校读者我要写书评

暂无评论

big data in Construction: Current Applications and Future Opportunities

引用

big data AND COGNITIVE COMPUTING 2022年第1期6卷

作者： Munawar, Hafiz Suliman Ullah, Fahim Qayyum, Siddra Shahzad, Danish Univ New South Wales Sch Built Environm Sydney NSW 2052 Australia Univ Southern Queensland Sch Surveying & Built Environm Springfield Central Qld 4300 Australia Univ Saarland Dept Visual Comp D-66123 Saarbrucken Germany

big data have become an integral part of various research fields due to the rapid advancements in the digital technologies available for dealing with data. The construction industry is no exception and has seen a spike in the data being generated due to the introduction of various digital disruptive technologies. However, despite the availability of data and the introduction of such technologies, the construction industry is lagging in harnessing big data. This paper critically explores literature published since 2010 to identify the data trends and how the construction industry can benefit from big data. The presence of tools such as computer-aided drawing (CAD) and building information modelling (BIM) provide a great opportunity for researchers in the construction industry to further improve how infrastructure can be developed, monitored, or improved in the future. The gaps in the existing research data have been explored and a detailed analysis was carried out to identify the different ways in which big data analysis and storage work in relevance to the construction industry. big data engineering (BDE) and statistics are among the most crucial steps for integrating big data technology in construction. The results of this study suggest that while the existing research studies have set the stage for improving big data research, the integration of the associated digital technologies into the construction industry is not very clear. Among the future opportunities, big data research into construction safety, site management, heritage conservation, and project waste minimization and quality improvements are key areas.

关键词： big data big data engineering construction big data digital technologies construction industry

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：