TY - GEN
T1 - Utilizing Past User Feedback for More Accurate Text-to-SQL
AU - Urban, Matthias
AU - Ding, Jialin
AU - Kernert, David
AU - Vaidya, Kapil
AU - Kraska, Tim
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/7/8
Y1 - 2025/7/8
N2 - In the classical problem formulation of Text-to-SQL in academia, each question is translated independently of the others into SQL. This differs from the setting in practice, where questions enter the Text-to-SQL system in sequence. Thus, for all but the first few questions, a translation history is available that contains past questions and how they were translated by the system. So far, it has not been sufficiently explored how Text-to-SQL systems can make use of the translation history to improve future translations. Another crucial difference from the academic setting is that in practice, users generally have a conversation with the Text-to-SQL system. More concretely, if the initial translation contains a mistake, the user can follow up with feedback messages to allow the system to fix the mistake. In this case, it might be helpful to remember these user feedback messages to avoid repeating past mistakes. Thus, in this paper, we explore how a history of such past conversations between users and the Text-to-SQL system can be used to make future Text-to-SQL translations more accurate. We explore several approaches for extracting relevant experiences and insights from this conversation history and show in an evaluation that utilizing them can improve translation accuracy by up to 14.9%.
AB - In the classical problem formulation of Text-to-SQL in academia, each question is translated independently of the others into SQL. This differs from the setting in practice, where questions enter the Text-to-SQL system in sequence. Thus, for all but the first few questions, a translation history is available that contains past questions and how they were translated by the system. So far, it has not been sufficiently explored how Text-to-SQL systems can make use of the translation history to improve future translations. Another crucial difference from the academic setting is that in practice, users generally have a conversation with the Text-to-SQL system. More concretely, if the initial translation contains a mistake, the user can follow up with feedback messages to allow the system to fix the mistake. In this case, it might be helpful to remember these user feedback messages to avoid repeating past mistakes. Thus, in this paper, we explore how a history of such past conversations between users and the Text-to-SQL system can be used to make future Text-to-SQL translations more accurate. We explore several approaches for extracting relevant experiences and insights from this conversation history and show in an evaluation that utilizing them can improve translation accuracy by up to 14.9%.
KW - NL2SQL
KW - Natural Language Interface for Databases
KW - Text-to-SQL
UR - https://www.scopus.com/pages/publications/105012239066
UR - https://www.scopus.com/inward/citedby.url?scp=105012239066&partnerID=8YFLogxK
U2 - 10.1145/3736733.3736739
DO - 10.1145/3736733.3736739
M3 - Conference contribution
AN - SCOPUS:105012239066
T3 - HILDA 2025 - Workshop on Human-In-the-Loop Data Analytics, Co-located with SIGMOD 2025
BT - HILDA 2025 - Workshop on Human-In-the-Loop Data Analytics, Co-located with SIGMOD 2025
PB - Association for Computing Machinery, Inc
T2 - 2025 Workshop on Human-In-the-Loop Data Analytics, HILDA 2025, Co-located with SIGMOD 2025
Y2 - 22 June 2025 through 27 June 2025
ER -