咨询与建议

限定检索结果

文献类型

  • 29 篇 学位论文
  • 25 篇 会议
  • 16 篇 期刊文献
  • 1 册 图书

馆藏范围

  • 71 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 47 篇 工学
    • 40 篇 计算机科学与技术...
    • 12 篇 软件工程
    • 8 篇 电气工程
    • 5 篇 信息与通信工程
    • 4 篇 仪器科学与技术
    • 3 篇 控制科学与工程
    • 2 篇 测绘科学与技术
    • 2 篇 环境科学与工程(可...
    • 1 篇 机械工程
    • 1 篇 电子科学与技术(可...
    • 1 篇 水利工程
    • 1 篇 交通运输工程
    • 1 篇 公安技术
  • 33 篇 管理学
    • 32 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 3 篇 理学
    • 2 篇 地理学
    • 1 篇 系统科学
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 医学

主题

  • 71 篇 spark sql
  • 16 篇 spark
  • 13 篇 big data
  • 11 篇 大数据
  • 8 篇 apache spark
  • 3 篇 hive
  • 3 篇 geomesa
  • 3 篇 查询优化
  • 3 篇 parquet
  • 3 篇 in-memory comput...
  • 3 篇 等值连接
  • 3 篇 数据倾斜
  • 2 篇 hash join
  • 2 篇 geospark
  • 2 篇 大数据分析
  • 2 篇 spark streaming
  • 2 篇 数据处理
  • 2 篇 data skipping
  • 2 篇 直方图
  • 2 篇 query optimizati...

机构

  • 6 篇 北京交通大学
  • 3 篇 中国科学院大学
  • 3 篇 东南大学
  • 3 篇 重庆邮电大学
  • 3 篇 上海交通大学
  • 2 篇 华北电力大学
  • 2 篇 北京邮电大学
  • 2 篇 southeast univ s...
  • 2 篇 laval univ ctr r...
  • 2 篇 福建船政交通职业...
  • 1 篇 univ portsmouth ...
  • 1 篇 univ sci & techn...
  • 1 篇 fudan univ sch c...
  • 1 篇 华中科技大学
  • 1 篇 korea univ dept ...
  • 1 篇 chinese acad sci...
  • 1 篇 杭州东方通信软件...
  • 1 篇 计算机体系结构国...
  • 1 篇 univ lyon lyon 2...
  • 1 篇 henan univ inst ...

作者

  • 2 篇 badard thierry
  • 2 篇 胡晶
  • 2 篇 陆会明
  • 2 篇 hu jing
  • 2 篇 engelinus jonath...
  • 2 篇 zhai mingyu
  • 2 篇 song aibo
  • 1 篇 tang jian-chao
  • 1 篇 zhang yufei
  • 1 篇 nasu yuya
  • 1 篇 魏可欣
  • 1 篇 li yang
  • 1 篇 wang jiahui
  • 1 篇 bentayeb fadila
  • 1 篇 li zhifang
  • 1 篇 张曼
  • 1 篇 xiong jin
  • 1 篇 tomasz drabas
  • 1 篇 田彬
  • 1 篇 丁凯泽

语言

  • 39 篇 中文
  • 32 篇 英文
检索条件"主题词=Spark SQL"
71 条 记 录,以下是1-10 订阅
排序:
A Cost Model for spark sql
收藏 引用
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 2019年 第5期31卷 819-832页
作者: Baldacci, Lorenzo Golfarelli, Matteo Univ Bologna DISI I-40126 Bologna Italy
In this paper, we propose a novel cost model for spark sql. The cost model covers the class of Generalized Projection, Selection, Join (GPSJ) queries. The cost model keeps into account the network and IO costs as well... 详细信息
来源: 评论
Handling Data Skew for Aggregation in spark sql Using Task Stealing
收藏 引用
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 2020年 第6期48卷 941-956页
作者: He, Zeyu Huang, Qiuli Li, Zhifang Weng, Chuliang East China Normal Univ Sch Data Sci & Engn Shanghai Peoples R China
In distributed in-memory computing systems, data distribution has a large impact on performance. Designing a good partition algorithm is difficult and requires users to have adequate prior knowledge of data, which mak... 详细信息
来源: 评论
QHB+: Accelerated Configuration Optimization for Automated Performance Tuning of spark sql Applications
收藏 引用
IEEE ACCESS 2024年 12卷 60138-60148页
作者: Jang, Deokyeon Yoon, Hyunsik Jung, Kijung Chung, Yon Dohn Korea Univ Dept Comp Sci & Engn Seoul 02841 South Korea
Apache spark stands out as a well-known solution for big data processing because of its efficiency and rapid processing capabilities. One of its modules, spark sql, serves as a prominent big data query engine. However... 详细信息
来源: 评论
Stargate: A data source connector based on spark sql  18
Stargate: A data source connector based on Spark SQL
收藏 引用
2nd International Conference on Machine Learning and Soft Computing (ICMLSC)
作者: Tao, Yuzheng Wu, Gang Kang, Yi Shanghai Jiao Tong Univ Dept Software Engn Shanghai Peoples R China Transwarp Technol Co Ltd Dept Framework Shanghai Peoples R China
spark sql has become a landing solution when a lot of enterprises in the face of massive data analysis and processing issues. To quickly and conveniently connect computing engines to data sources on different storage ... 详细信息
来源: 评论
Indexing for Large Scale Data Querying based on spark sql  14
Indexing for Large Scale Data Querying based on Spark SQL
收藏 引用
14th IEEE International Conference on e-Business Engineering (ICEBE)
作者: Cui, Yi Li, Guoqiang Cheng, Hao Wang, Daoyuan Shanghai Jiao Tong Univ Sch Software Engn Shanghai Peoples R China Intel APAC Corp Shanghai Peoples R China
spark sql lets spark programmers query structured data inside spark programs using sql statements. It provides spark programmers with great convenience to leverage the benefits of relational processing, and its intern... 详细信息
来源: 评论
Query Optimization Approach with Middle Storage Layer for spark sql  22
Query Optimization Approach with Middle Storage Layer for Sp...
收藏 引用
22nd IEEE International Conference on Computer Supported Cooperative Work in Design (CSCWD)
作者: Song, Aibo Zhai, Mingyu Xue, Yingying Chen, Peng Du, Mingyang Wan, Yutong NARI Technol Dev Co Ltd Nanjing Jiangsu Peoples R China Southeast Univ Sch Comp Sci & Engn Nanjing Jiangsu Peoples R China
Currently, spark sql cannot optimize the multi-query tasks: tasks provided by batch processing are translated into different spark jobs, and these jobs cannot share input data. To solve this problem. this paper explor... 详细信息
来源: 评论
Workload Driven Comparison and Optimization of Hive and spark sql  4
Workload Driven Comparison and Optimization of Hive and Spar...
收藏 引用
4th International Conference on Information Science and Control Engineering (ICISCE)
作者: Zhang, Man Liu, Fang Lu, Yutong Chen, Zhiguang Natl Univ Def Technol Coll Comp Changsha Hunan Peoples R China
This paper proposes how to conduct the specific job performance optimization of Hive and spark sql, and make a comparison of them at the same time. First, we compare Hive and spark sql by ten sql queries. By analyzing... 详细信息
来源: 评论
DQN-based Join Order Optimization by Learning Experiences of Running Queries on spark sql  20
DQN-based Join Order Optimization by Learning Experiences of...
收藏 引用
20th IEEE International Conference on Data Mining (ICDM)
作者: Lee, Kyeong-Min Kim, InA Lee, Kyu-Chul Chungnam Natl Univ Dept Comp Engn Daejeon South Korea
In a smart grid, various types of queries such as adhoc queries and analytic queries are requested for data. There is a limit to query evaluation based on a single node database engines because queries are requested f... 详细信息
来源: 评论
Rover: An Online spark sql Tuning Service via Generalized Transfer Learning  23
Rover: An Online Spark SQL Tuning Service via Generalized Tr...
收藏 引用
29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)
作者: Shen, Yu Ren, Xinyuyang Lu, Yupeng Jiang, Huaijun Xu, Huanyong Peng, Di Li, Yang Zhang, Wentao Cui, Bin Peking Univ Sch CS Beijing Peoples R China ByteDance Inc Beijing Peoples R China Peking Univ Ctr Data Sci Beijing Peoples R China Mila Quebec AI Inst Montreal PQ Canada Peking Univ Inst Computat Social Sci Sch CS Beijing Peoples R China
Distributed data analytic engines like spark are common choices to process massive data in industry. However, the performance of spark sql highly depends on the choice of configurations, where the optimal ones vary wi... 详细信息
来源: 评论
Optimization of Row Pattern Matching over Sequence Data in spark sql  30th
Optimization of Row Pattern Matching over Sequence Data in S...
收藏 引用
30th International Conference on Database and Expert Systems Applications (DEXA)
作者: Nakabasami, Kosuke Kitagawa, Hiroyuki Nasu, Yuya Railway Tech Res Inst Hikari Cho 2-8-38 Kokubunji Tokyo Japan Univ Tsukuba Ctr Computat Sci Tennodai 1-1-1 Tsukuba Ibaraki Japan Univ Tsukuba Grad Sch Syst & Informat Engn Tennodai 1-1-1 Tsukuba Ibaraki Japan
Due to the advance of information and communications technology and sensor technology, a large quantity of sequence data (time series data, log data, etc.) are generated and processed every day. Row pattern matching f... 详细信息
来源: 评论