• Author and genre identification of Turkish news texts using deep learning algorithms

    • Fulltext

       

        Click here to view fulltext PDF


      Permanent link:
      https://www.ias.ac.in/article/fulltext/sadh/047/0194

    • Keywords

       

      Author identification; genre identification; deep learning; text classification; Turkish news datasets; machine learning.

    • Abstract

       

      Nowadays, the increasing amount of data has brought the need to classify the data. Text classification is the process of categorizing similar text data. This paper aims to make a modeling study for author and genre identification, which is one of the important challenges of text classification, for Turkish news texts byusing machine and deep learning algorithms. For this purpose, firstly, a total of 13 large-scale datasets having multi classes are built as new datasets. In the modeling stage, Multinomial Naı¨ve Bayes (MNB), Random Forest (RF), Convolutional Neural Network (CNN), and Long Short Term Memory (LSTM) algorithms were applied to the datasets. Results showed that for dataset AI-TNKU-7, the CNN algorithm demonstrated the highest accuracy for author identification at 95.81%. In relation to genre identification, the LSTM algorithm for the dataset GITNKU- 6 demonstrated the highest accuracy at 96.73%.

    • Author Affiliations

       

      PINAR TÜFEKCI1 MELİKE BEKTAŞ2

      1. Department of Computer Engineering, Corlu Faculty of Engineering, Tekirdag Namik Kemal University, Tekirdag, Turkey
      2. Department of Information Technologies, Bursa Technical University, Bursa, Turkey
    • Dates

       
  • Sadhana | News

    • Editorial Note on Continuous Article Publication

      Posted on July 25, 2019

      Click here for Editorial Note on CAP Mode

© 2022-2023 Indian Academy of Sciences, Bengaluru.