Articles

Integrating elements of data science into high-school teaching: Naïve Bayes-classification algorithm and programming in Python

Published:
November 4, 2020
Authors
View
Keywords
License

Copyright (c) 2020 Ödön Vancsó, Péter Princz

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

How To Cite
Selected Style: APA
Vancsó, Ödön, & Princz, P. (2020). Integrating elements of data science into high-school teaching: Naïve Bayes-classification algorithm and programming in Python. Teaching Mathematics and Computer Science, 18(4), 307-316. https://doi.org/10.5485/TMCS.2020.0494
Abstract

Probability theory and mathematical statistics are traditionally one of the most difficult chapters of mathematics to teach. One of the authors, Péter Princz has experience in teaching various topics via computer programming of the problem at hand as a class activity. The proposed method is to involve programming as a didactic tool in hard-to-teach topics. The intended goal in this case is to implement a naïve Bayes-classifier algorithm in Python and demonstrate the machine-learning capabilities of it by applying it to a real-world dataset of edible or poisonous mushrooms. The students would implement the algorithm in a playful and interactive way. The proposed incremental development process aligns well with the spirit of Tamás Varga who considered computers as modern tools of experimental problem solving as early as in the 1960s.

Subject Classification: 97D40, 97D50, 97K50, 97K99, 97M60, 97P40, 97P50, 97U50