检索结果-内蒙古大学图书馆

speech-to-sql: toward speech-driven sql query generation from natural language question

VLDB JOURNAL 2024年第4期33卷 1179-1201页

作者： Song, Yuanfeng Wong, Raymond Chi-Wing Zhao, Xuefang Hong Kong Univ Sci & Technol Clear Water Bay Hong Kong Peoples R China WeBank Co Ltd AI Grp Shenzhen Peoples R China

speech-based inputs have been gaining significant momentum with the popularity of smartphones and tablets in our daily lives, since voice is the most popular and efficient way for human-computer interaction. This paper works toward designing more effective speech-based interfaces to query the structured data in relational databases. We first identify a new task named speech-to-sql, which aims to understand the information conveyed by human speech and directly translate it into structured query language (sql) statements. A naive solution to this problem can work in a cascaded manner, that is, an automatic speech recognition component followed by a text-to-sql component. However, it requires a high-quality ASR system and also suffers from the error compounding problem between the two components, resulting in limited performance. To handle these challenges, we propose a novel end-to-end neural architecture named speechsqlNet to directly translate human speech into sql queries without an external ASR step. speechsqlNet has the advantage of making full use of the rich linguistic information presented in speech. To the best of our knowledge, this is the first attempt to directly synthesize sql based on common natural language questions in spoken form, rather than a natural language-based version of sql. To validate the effectiveness of the proposed problem and model, we further construct a dataset named speechQL, by piggybacking the widely used text-to-sql datasets. Extensive experimental evaluations on this dataset show that speechsqlNet can directly synthesize high-quality sql queries from human speech, outperforming various competitive counterparts as well as the cascaded methods in terms of exact match accuracies. We expect speech-to-sql would inspire more research on more effective and efficient human-machine interfaces to lower the barrier of using relational databases.

关键词： speech-to-sql sql query generation speech-driven querying system AI/NLP/speech for database

来源：评论

学校读者我要写书评

暂无评论

Translating Natural Language Queries to sql Using the T5 Model 18

Translating Natural Language Queries to SQL Using the T5 Mod...

引用

18th Annual IEEE International Systems Conference (SysCon)

作者： Wong, Albert Pham, Lien Lee, Young Chan, Shek Sadaya, Razel Khmelevsky, Youry Clement, Mathias Cheng, Florence Wing Yau Mahony, Joe Ferri, Michael Langara Coll Math & Stat Vancouver BC Canada Okanagan Coll Math & Stat Kelowna BC Canada Okanagan Coll Comp Sci Kelowna BC Canada Harris SmartWorks Res & Dev Ottawa ON Canada

ISBN: (纸本)9798350358810;9798350358803

This paper presents the development process of a natural language to sql model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73% and 84% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used successfully on a daily basis. The approach used in the model development could be implemented in a similar fashion for other database environments and with a more powerful pre-trained language model.

关键词： Natural Language Processing Data Query System Text-to-sql speech-to-sql Deep Learning Machine Learning T5 Model Human-Machine-Systems Energy Systems

来源：评论

学校读者我要写书评

暂无评论

VOICEQUERYSYSTEM: A Voice-driven Database Querying System Using Natural Language Questions 22

VOICEQUERYSYSTEM: A Voice-driven Database Querying System Us...

引用

International Conference on Management of Data (SIGMOD)

作者： Song, Yuanfeng Wong, Raymond Chi-Wing Zhao, Xuefang Jiang, Di Hong Kong Univ Sci & Technol Hong Kong Peoples R China WeBank Co Ltd AI Grp Shenzhen Guangdong Peoples R China

ISBN: (纸本)9781450392495

With recent development in natural language processing (NLP) and automatic speech recognition (ASR), voice-based interfaces have become a necessity for applications such as chatbots, search engines, and databases. In this demonstration, we introduce VOICEQITERYSYSTEM, a voice-based database querying system that enables users to conduct data operations with natural language questions (NLQs). Different from existing voice-based interfaces such as SpeakQL or EchoQuery, which restricts the voice input to be an exact sql or follow a pre-defined template, VOICEQUERYSYSTEM attempts to achieve data manipulation via common NLQs, and thus does not require the user's technical background in sql language. The underlying techniques in VOICEQITERYSYSTEM is a new task named speech-to-sql, which aims to understand the semantic in speech and then translate it into sql queries. We explore two proposed approaches - the cascaded one and the end-to-end (E2E) one towards speech-to-sql translation. The cascaded method first converts the user's voice-based NLQs into text by a self-developed ASR module, and then conducts downstream sql generation via a text-to-sql model (i.e., IRNet). In contrast, the E2E method is a novel neural architecture named speechsqlNet designed by us, which converts the speech signals into sql queries directly without the middle medium as text. Extensive experiments and demonstrations validate the rationale of the speech-to-sql task and the effectiveness of the proposed speechsqlNet model. To the best of our knowledge, this is the first system that provides a voice-based querying functionality on DBMS from common NLQs.

关键词： speech-to-sql relational database sql query generation voice-based interface speech-driven querying system

来源：评论

学校读者我要写书评

暂无评论

A Survey of Natural Language Processing Implementation for Data Query Systems

A Survey of Natural Language Processing Implementation for D...

引用

IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)

作者： Wong, Albert Joiner, Dakota Chiu, Chunyin Elsayed, Mohamed Pereira, Keegan Khmelevsky, Youry Mahony, Joe Langara Coll Math & Stat Vancouver BC Canada Okanagan Coll Comp Sci Kelowna BC Canada Harris SmartWorks Res & Dev Ottawa ON Canada

ISBN: (纸本)9781665434416

With increasing complexity and volume of collected data continuing to rise, it is becoming ever more important to develop systems with high interactability. Businesses with an interest in big data continue to seek solutions that limit cost while providing effective, simplified solutions to current issues in data retrieval. Combined analysis and application of a multi-factorial system will likely lead to promising results in ease of reporting of complex data by nontechnical end users. This survey is focused on natural language processing (NLP) implementations for data query systems, especially related to massive data sets (1TB+) in OLTP databases, OLAP databases, and data warehouses. We are seeking the most up-to-date and effective uses of NLP for speech-to-sql and Text-to-sql generation, and the most recent advancements in data warehousing to optimize ELT efficiency and data retrieval, focusing on the highest performing code implementations on the Spider and Wikisql datasets. Many models, including sequence-to-sequence (seq2seq), sequence-to-sql (Seq2sql), and fuzzy semantic to sql (F-Semtosql), among others, are briefly described and compared. As well, recent advancements in data warehousing technology like multi-disk buffering in the ELT process and hybrid multi-dimensional and relational OLAP databases (HOLAPs) are discussed. The learning gathered here is applied to fill a gap in the current industrial knowledge base in service of increased efficiency in data access, retrieval, and reporting in a customer-facing environment.

关键词： Natural Language Processing Data Query System Text-to-sql speech-to-sql Deep Learning Machine Learning Human-Machine-Systems Energy Systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：