SQL Query Evaluation with Large Language Model and Abstract Syntax Trees (SIGCSE TS 2024 - ACM Student Research Competition)

Track

SIGCSE TS 2024 ACM Student Research Competition

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 21 Mar 2024 15:00 - 17:00 at Exhibit Hall E - Posters - ACM Student Research Competition - Posters Chair(s): Mohammed Seyam, Timothy Yuen

Abstract

SQL stands as the foundational language for data analysis and manipulation, playing a pivotal role in the database learning process. Proficiency in SQL is essential for students seeking to excel in data-related fields. However, the conventional approaches to assessing SQL queries rely heavily on manual grading, and the automated assessment tools are usually producing only binary decisions for the submitted queries. Our primary research objective is to develop effective methods for evaluating the quality of the SQL queries. To meet this objective, we introduce two approaches: structure-based analysis and evaluation by an instruction tuned large language model (LLM). The first approach deconstructs queries into Abstract Syntax Trees (AST) and employs cosine similarity to assess student submissions. The second approach utilizes a pre-trained LLM: FLAN-T5, fine-tuned for predicting the quality of student submissions. These methodologies are tested on a SQL dataset, and our experimental findings evaluate against a grading rubric with categories ranging from “good” to “unacceptable”. The experimental results demonstrate that we can enhance the grading efficiency by applying these approaches, and illustrate the ability of utilizing LLM to classify the assessed SQL statements more accurately. In addition, our research contributes to Computer Science (CS) education by providing an auto-grading system that enhances the learning experience and evaluation process.

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 21 Mar
Displayed time zone: Pacific Time (US & Canada) change

15:00 - 17:00	ACM Student Research Competition - PostersACM Student Research Competition at Exhibit Hall E - Posters Chair(s): Mohammed Seyam Virginia Tech, Timothy Yuen University of Texas at San Antonio

15:00 2h Talk		Ethics vs. Abstraction: Comparing Learning Outcomes from Ethics-Integrated and Technical-Only Instruction ACM Student Research Competition Noelle Brown University of Utah
15:00 2h Talk		SQL Query Evaluation with Large Language Model and Abstract Syntax Trees ACM Student Research Competition Lili Xiang Northeastern University
15:00 2h Talk		Enhancing Scratch Programming with Autograding ACM Student Research Competition Herart Dominggus Nurue The University of Alabama
15:00 2h Talk		Askers, Answerers, Non-Answerers, and Lurkers: Investigating Online Q&A Platforms Personas and Success in a CS1 Course ACM Student Research Competition Nigel Newby University of Pennsylvania
15:00 2h Talk		Cloud-Secure: An Investigation into Firefly and Grey Wolf Optimization Algorithms for Phishing Detection with Machine Learning Classifiers ACM Student Research Competition Kelvin Ovabor Computer science Department,The University of Alabama
15:00 2h Talk		A Web Development and Cloud Deployment Framework for a Software Engineering Course ACM Student Research Competition Michael Wilkinson Worcester Polytechnic Institute, Ariel Schechter Worcester Polytechnic Institute, Bryce Lukens Worcester Polytechnic Institute, Ian Wright Worcester Polytechnic Institute, Joseph Cardarelli Worcester Polytechnic Institute
15:00 2h Talk		Creating Instructional Elementary Programming Videos for Use in an Adaptive Testing and Remediation System ACM Student Research Competition Abigail Jones Northern Kentucky University, Manogya Aryal Northern Kentucky University, Justin Selby Northern Kentucky University, Seth Adjei Northern Kentucky University
15:00 2h Talk		Accessing and Democratizing AI for Whom? Student Learning through an Algorithm-Centered Supply Chain Case Study ACM Student Research Competition Ashish Hingle George Mason University
15:00 2h Talk		Understanding support needs in computer science labs ACM Student Research Competition Weiyin Xu University of San Diego
15:00 2h Talk		Towards a Verified Implementation of a Network Stack ACM Student Research Competition Caitlyn Wilde Lewis and Clark College, Wyeth Greenlaw Rollins Lewis and Clark College, Levi Overcast Evergreen State College
15:00 2h Talk		Exploring K-12 Teachers' Trajectories of Engagement with Critically Conscious Computing ACM Student Research Competition Brendan Henrique University of California, Berkeley
15:00 2h Talk		Energizing Web Development in the Exploring Computer Science Curriculum ACM Student Research Competition Samuel Mazzone Marquette University
15:00 2h Talk		Revolutionizing Student Engagement and Enrollment through Personalized, AI-Driven Dialog Systems in Higher Education ACM Student Research Competition Alexander Fox Elizabethtown College, Joshua Stoner Elizabethtown College, Jingwen Wang Elizabethtown College
15:00 2h Talk		Understanding Engineering Students’ Ethical and Algorithmic Decision Preferences through a Consequentialist Framework ACM Student Research Competition Edward Kempa University of Florida, Ashish Aggarwal University of Florida
15:00 2h Talk		The Future of the Error Message: Comparing Large Language Models and Novice Programmer Effectiveness in Fixing Errors ACM Student Research Competition Brij Howard-Sarin Williams College
15:00 2h Talk		Learning with the Rashomon Augmented Reality Cube (RARc) ACM Student Research Competition Candy Arce New Mexico State University, Angie Hernandez New Mexico State University, Ruth Torres Castillo New Mexico State University, Tomas Valdez New Mexico State University, Raena Cota New Mexico State University, Enrico Pontelli
15:00 2h Talk		Mining students’ mastery levels from CS placement tests via LLMs ACM Student Research Competition Yi Li Boston College, Riteng Zhang Boston College, Danni Qu Boston College, Marques Samary Maíra Boston College
15:00 2h Talk		Understanding Outcome Expectancy in a CS1 Course ACM Student Research Competition Griffin Pitts University of Florida, Ashish Aggarwal University of Florida
15:00 2h Talk		Enhancing American Sign Language Classification by Leveraging Hand Landmark Extraction ACM Student Research Competition Benjamin Guerrieri The College of New Jersey
15:00 2h Talk		A Comparative Study of Few-Shot vs. Zero-Shot Prompting to Generate Quick and Useful Responses to Students’ Periodic Reflections ACM Student Research Competition Ethan Hurley UNC Charlotte - College of Computing and Informatics, Joel Okyere-Badoo UNC Charlotte - College of Computing and Informatics
15:00 2h Talk		Simplifying Signaling Pathway Reconstruction with Containerized Random Walk Algorithms ACM Student Research Competition Yancheng Liu Reed College, Anna Ritz Reed College
15:00 2h Talk		Is ChatGPT the academic Catalyst we’ve all been waiting for? ACM Student Research Competition Raneem Mousa Simmons university, Nanette Veilleux Simmons University
15:00 2h Talk		Undergraduate Facilitators as Role Models for Middle School Learners within an AI Summer CampK12 ACM Student Research Competition Carly Solomon University of Florida
15:00 2h Talk		Understanding Undergraduate Students’ Participation in Computing Clubs ACM Student Research Competition Brooke Nelson University of Florida, Amanpreet Kapoor University of Florida, USA, Christina Gardner-McCune Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL, USA
15:00 2h Talk		Recommendation System Algorithm Application to Increase Interdisciplinary Undergraduate Course Collaborations ACM Student Research Competition Julianna Iannini The College of New Jersey, Kyla Ramos The College of New Jersey, Joe Petrecca The College of New Jersey
15:00 2h Talk		Edit, Run, Error, Repeat: Learning Analytics To Find The Most Improved Programming Student ACM Student Research Competition Johan Snider Uppsala University

SQL Query Evaluation with Large Language Model and Abstract Syntax Trees

Program Display Configuration

Program Display Configuration

Thu 21 MarDisplayed time zone: Pacific Time (US & Canada) change

Lili Xiang

Northeastern University

Thu 21 Mar
Displayed time zone: Pacific Time (US & Canada) change