Évf. 6 szám 3 (2021)
Gépészeti és járműmérnöki tudományok

LSI with Support Vector Machine for Text Categorization – a practical example with Python

Megjelent november 19, 2021
József Dr. Menyhárt
Debreceni Egyetem Műszaki Kar, Légi- és Közúti Járművek Tanszék
Joao Henrique Gomes Da Costa Cavalcanti
pdf (English)

APA

Dr. Menyhárt, J., & Gomes Da Costa Cavalcanti, J. H. (2021). LSI with Support Vector Machine for Text Categorization – a practical example with Python . International Journal of Engineering and Management Sciences, 6(3). https://doi.org/10.21791/IJEMS.2021.3.2.

Artificial intelligence is becoming a powerful tool of modernity science, there is even a science consensus about how our society is turning to a data-driven society. Machine learning is a branch of Artificial intelligence that has the ability to learn from data and understand its behavers. Python programming language aiming the challenges of this new era is becoming one of the most popular languages for general programming and scientific computing. Keeping all this new era circumstances in mind, this article has as a goal to show one example of how to use one supervised machine learning method, Support Vector Machine, and to predict movie’s genre according to its description using the programming language of the moment, python. Firstly, Omdb official API was used to gather data about movies, then tuned Support Vector Machine model for Latent semantic indexing capable of predicting movies genres according to its plot was coded. The performance of the model occurred to be satisfactory considering the small dataset used and the occurrence of movies with hybrid genres. Testing the model with larger dataset and using multi-label classification models were purposed to improve the model.