TY - GEN
T1 - Iterative Student Program Planning using Transformer-Driven Feedback
AU - Rivera, Elijah
AU - Steinmaurer, Alexander
AU - Fisler, Kathi
AU - Krishnamurthi, Shriram
N1 - Publisher Copyright:
© 2024 Owner/Author.
PY - 2024/7/3
Y1 - 2024/7/3
N2 - Problem planning is a fundamental programming skill, and aids students in decomposing tasks into manageable subtasks. While feedback on plans is beneficial for beginners, providing this in a scalable and timely way is an enormous challenge in large courses. Recent advances in LLMs raise the prospect of helping here. We utilize LLMs to generate code based on students' plans, and evaluate the code against expert-defined test suites. Students receive feedback on their plans and can refine them. In this report, we share our experience with the design and implementation of this workflow. This tool was used by 544 students in a CS1 course at an Austrian university. We developed a codebook to evaluate their plans and manually applied it to a sample. We show that LLMs can play a valuable role here. However, we also highlight numerous cautionary aspects of using LLMs in this context, many of which will not be addressed merely by having more powerful models (and indeed may be exacerbated by it).
AB - Problem planning is a fundamental programming skill, and aids students in decomposing tasks into manageable subtasks. While feedback on plans is beneficial for beginners, providing this in a scalable and timely way is an enormous challenge in large courses. Recent advances in LLMs raise the prospect of helping here. We utilize LLMs to generate code based on students' plans, and evaluate the code against expert-defined test suites. Students receive feedback on their plans and can refine them. In this report, we share our experience with the design and implementation of this workflow. This tool was used by 544 students in a CS1 course at an Austrian university. We developed a codebook to evaluate their plans and manually applied it to a sample. We show that LLMs can play a valuable role here. However, we also highlight numerous cautionary aspects of using LLMs in this context, many of which will not be addressed merely by having more powerful models (and indeed may be exacerbated by it).
KW - automated feedback
KW - llms
KW - program planning
UR - http://www.scopus.com/inward/record.url?scp=85198903482&partnerID=8YFLogxK
U2 - 10.1145/3649217.3653607
DO - 10.1145/3649217.3653607
M3 - Conference paper
AN - SCOPUS:85198903482
T3 - Annual Conference on Innovation and Technology in Computer Science Education, ITiCSE
SP - 45
EP - 51
BT - ITiCSE 2024 - Proceedings of the 2024 Conference Innovation and Technology in Computer Science Education
PB - Association of Computing Machinery
T2 - 29th Conference Innovation and Technology in Computer Science Education
Y2 - 8 July 2024 through 10 July 2024
ER -