Conference papers

Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment

Botond Tarcsay, Technological University Dublin, Dublin, Ireland
Fernando Perez-Tellez, Technological University Dublin, Dublin, Ireland
Jelena Vasic, Technological University Dublin, Dublin, Ireland

Author ORCID Identifier

https://orcid.org/0000-0003-3012-8278

Document Type

Article

Disciplines

1.2 COMPUTER AND INFORMATION SCIENCE, Computer Sciences

Publication Details

https://link.springer.com/chapter/10.1007/978-3-031-33783-3_5

Tarcsay, B., Perez-Tellez, F., Vasic, J. (2023). Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment. In: Rodríguez-González, A.Y., Pérez-Espinosa, H., Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A., Olvera-López, J.A. (eds) Pattern Recognition. MCPR 2023. Lecture Notes in Computer Science, vol 13902. Springer, Cham.

https://doi.org/10.1007/978-3-031-33783-3_5

Abstract

Programming has become an important skill in today’s world and is taught widely both in traditional and online settings. Instructors need to grade increasing amounts of student work. Unit testing can contribute to the automation of the grading process but it cannot assess the structure or partial correctness of code, which is needed for finely differentiated grading. This paper builds on previous research that investigated machine learning models for determining the correctness of programs from token-based features of source code and found that some such models can be successful in classifying source code with respect to whether it passes unit tests. This paper makes two further contributions. First, these results are scrutinized under conditions of varying similarity between code instances used for model training and testing, for a better understanding of how well the models generalize. It was found that the models do not generalize outside of groups of code instances performing very similar tasks (corresponding to similar coding assignments). Second, selected binary classification models are used as a base for multi-class prediction with two different methods. Both of these exhibit prediction success well above the random baseline, with potential to contribute to automated assessment with multi-valued measures of quality (grading schemes), in contrast to the binary pass/fail measure associated with unit testing.

DOI

https://doi.org/10.1007/978-3-031-33783-3_5

Recommended Citation

Tarcsay, Botond; Perez-Tellez, Fernando; and Vasic, Jelena, "Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment" (2023). Conference papers. 401.
https://arrow.tudublin.ie/scschcomcon/401

Creative Commons License

This work is licensed under a Creative Commons Attribution-Share Alike 4.0 International License.

Download

Included in

Computer Engineering Commons

COinS

Conference papers

Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment

Author ORCID Identifier

Document Type

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Creative Commons License

Included in

Search

Browse

Author Corner

Links

Conference papers

Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment

Authors

Author ORCID Identifier

Document Type

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Creative Commons License

Included in

Share

Search

Browse

Author Corner

Links