Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation

ACL ARR 2024 April Submission693 Authors

16 Apr 2024 (modified: 20 May 2024)ACL ARR 2024 April SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Abstract: While Large language model (LLM)-based programming assistants such as CoPilot and ChatGPT can help improve the productivity of professional software developers, they can also facilitate cheating in introductory computer programming courses. Assuming instructors have limited control over the industrial-strength models, this paper investigates the baseline performance of 5 widely used LLMs on a collection of introductory programming problems, examines adversarial perturbations to degrade their performance, and describes the results of a user study aimed at measuring the efficacy of such perturbations in hindering actual code generation for introductory programming assignments. The user study suggests that i) perturbations combinedly reduced the average correctness score by 77%, ii) the drop in correctness caused by these perturbations was affected based on their detectability.
Paper Type: Long
Research Area: NLP Applications
Research Area Keywords: educational applications; code generation;
Contribution Types: Model analysis & interpretability, NLP engineering experiment
Languages Studied: Python Programming Language
Submission Number: 693
Loading