A Fast and Accurate Machine Learning Autograder for the Breakout Assignment (SIGCSE TS 2024 - Papers)

Who

Evan Liu, David Yuan, Syed Ahmed, Elyse Cornwall, Juliette Woodrow, Kaylee Burns, Allen Nie, Emma Brunskill, Chris Piech, Chelsea Finn

Track

SIGCSE TS 2024 Papers

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 22 Mar 2024 11:10 - 11:35 at Oregon Ballroom 204 - LLM - tools Chair(s): Geoffrey Herman

Abstract

In this paper, we detail the successful deployment of a machine learning autograder that significantly decreases the grading labor required in the Breakout computer science assignment. This assignment — which tasks students with programming a game consisting of a controllable paddle and a ball that bounces off the paddle to break bricks — is popular for engaging students with introductory computer science concepts, but creates a large grading burden. Due to the game’s interactive nature, grading defies traditional unit tests and instead typically requires 8+ minutes of manually playing each student’s game to search for bugs. This amounts to 45+ hours of grading in a standard course offering and prevents further widespread adoption of the assignment. Our autograder alleviates this burden by playing each student’s game with a reinforcement learning agent and providing videos of discovered bugs to instructors. In an A/B test with manual grading, we find that our autograder reduces grading time by 44%, while slightly improving grading accuracy by 6%, ultimately saving roughly 30 hours over our deployment in two offerings of the assignment. Our results further suggest the practicality of grading other interactive assignments (e.g., other games or building websites) via similar machine learning techniques.

DOI

https://doi.org/10.1145/3626252.3630759

Evan Liu

Stanford University

United States

David Yuan

Stanford University

United States

Syed Ahmed

Oakland University

Elyse Cornwall

Stanford University

United States

Juliette Woodrow

Stanford University

United States

Kaylee Burns

Stanford University

United States

Allen Nie

Stanford University

United States

Emma Brunskill

Stanford University

United States

Chris Piech

Stanford University

United States

Chelsea Finn

Stanford University

United States

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 22 Mar
Displayed time zone: Pacific Time (US & Canada) change

10:45 - 12:00	LLM - toolsPapers at Oregon Ballroom 204 Chair(s): Geoffrey Herman University of Illinois at Urbana-Champaign

10:45 25m Talk		Evaluating Automatically Generated Contextualised Programming ExercisesGlobal Papers Andre del Carpio Gutierrez The University of Auckland, Paul Denny The University of Auckland, Andrew Luxton-Reilly The University of Auckland DOI
11:10 25m Talk		A Fast and Accurate Machine Learning Autograder for the Breakout Assignment Papers Evan Liu Stanford University, David Yuan Stanford University, Syed Ahmed Oakland University, Elyse Cornwall Stanford University, Juliette Woodrow Stanford University, Kaylee Burns Stanford University, Allen Nie Stanford University, Emma Brunskill Stanford University, Chris Piech Stanford University, Chelsea Finn Stanford University DOI
11:35 25m Talk		Beyond Traditional Teaching: Designing a virtual teaching assistant using LLMs for CS educationGlobal Papers Mengqi Liu Mcgill university, Faten M'Hiri Mcgill university DOI