• Fulltext

       

        Click here to view fulltext PDF


      Permanent link:
      https://www.ias.ac.in/article/fulltext/sadh/044/08/0181

    • Keywords

       

      Natural language processing; Word Sense Disambiguation; supervised methodology; lemmatization; bootstrapping.

    • Abstract

       

      An attempt is made in this paper to report how a supervised methodology has been adopted for the task of Word Sense Disambiguation (WSD) in Bengali with necessary modifications. At the initial stage, four commonly used supervised methods, Decision Tree (DT), Support Vector Machine (SVM), Artificial NeuralNetwork (ANN) and Naı¨ve Bayes (NB), are developed at the baseline. These algorithms are applied individually on a data set of 13 most frequently used Bengali ambiguous words. On experimental basis, the baseline strategyis modified with two extensions: (a) inclusion of lemmatization process into the system and (b) bootstrapping of the operational process. As a result, the levels of accuracy of the baseline methods are slightly improved, which is a positive signal for the whole process of disambiguation as it opens scope for further modification of the existing method for better result. In this experiment, the data sets are prepared from the Bengali corpus, developed in the Technology Development for Indian Languages (TDIL) project of the Government of India andfrom the Bengali WordNet, which is developed at the Indian Statistical Institute, Kolkata. The paper reports the challenges and pitfalls of the work that have been closely observed during the experiment.

    • Author Affiliations

       

      ALOK RANJAN PAL1 DIGANTA SAHA2 NILADRI SEKHAR DASH3 SUDIP KUMAR NASKAR2 ANTARA PAL4

      1. Department of Computer Science and Engineering, College of Engineering and Management, Kolaghat 721171, India
      2. Department of Computer Science and Engineering, Jadavpur University, Kolkata 700032, India
      3. Linguistic Research Unit, Indian Statistical Institute, Kolkata 700108, India
      4. Department of Computer Science and Engineering, National Institute of Technology, Durgapur 713209, India
    • Dates

       
  • Sadhana | News

    • Editorial Note on Continuous Article Publication

      Posted on July 25, 2019

      Click here for Editorial Note on CAP Mode

© 2022-2023 Indian Academy of Sciences, Bengaluru.