Gépészeti és járműmérnöki tudományok

LSI with Support Vector Machine for Text Categorization – a practical example with Python

Megjelent:
2021-11-19
Szerzők
Megtekintés
Kulcsszavak
Licenc

Copyright (c) 2021 Joao Henrique Gomes Da Costa Cavalcanti, József Menyhárt

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

How To Cite
Kiválasztott formátum: APA
Gomes Da Costa Cavalcanti, J. H., & Menyhárt, J. (2021). LSI with Support Vector Machine for Text Categorization – a practical example with Python . International Journal of Engineering and Management Sciences, 6(3), 18-29. https://doi.org/10.21791/IJEMS.2021.3.2.
Beküldött 2021-09-06
Elfogadott 2021-10-07
Publikált 2021-11-19
Absztrakt

Artificial intelligence is becoming a powerful tool of modernity science, there is even a science consensus about how our society is turning to a data-driven society. Machine learning is a branch of Artificial intelligence that has the ability to learn from data and understand its behavers. Python programming language aiming the challenges of this new era is becoming one of the most popular languages for general programming and scientific computing. Keeping all this new era circumstances in mind, this article has as a goal to show one example of how to use one supervised machine learning method, Support Vector Machine, and to predict movie’s genre according to its description using the programming language of the moment, python. Firstly, Omdb official API was used to gather data about movies, then tuned Support Vector Machine model for Latent semantic indexing capable of predicting movies genres according to its plot was coded. The performance of the model occurred to be satisfactory considering the small dataset used and the occurrence of movies with hybrid genres. Testing the model with larger dataset and using multi-label classification models were purposed to improve the model.

Adatbázis logók