Author and genre identification of Turkish news texts using deep learning algorithms
Click here to view fulltext PDF
Permanent link:
https://www.ias.ac.in/article/fulltext/sadh/047/0194
Nowadays, the increasing amount of data has brought the need to classify the data. Text classification is the process of categorizing similar text data. This paper aims to make a modeling study for author and genre identification, which is one of the important challenges of text classification, for Turkish news texts byusing machine and deep learning algorithms. For this purpose, firstly, a total of 13 large-scale datasets having multi classes are built as new datasets. In the modeling stage, Multinomial Naı¨ve Bayes (MNB), Random Forest (RF), Convolutional Neural Network (CNN), and Long Short Term Memory (LSTM) algorithms were applied to the datasets. Results showed that for dataset AI-TNKU-7, the CNN algorithm demonstrated the highest accuracy for author identification at 95.81%. In relation to genre identification, the LSTM algorithm for the dataset GITNKU- 6 demonstrated the highest accuracy at 96.73%.
Volume 48, 2023
All articles
Continuous Article Publishing mode
Click here for Editorial Note on CAP Mode
© 2022-2023 Indian Academy of Sciences, Bengaluru.