• Knowledge discovery through text-based similarity searches for astronomy literature

    • Fulltext


        Click here to view fulltext PDF

      Permanent link:

    • Keywords


      Natural language processing; methods: statistical.

    • Abstract


      The increase in the number of researchers coupled with the ease of publishing and distribution of scientific papers (due to technological advancements) has resulted in a dramatic increase in astronomy literature.This has likely led to the predicament that the body of the literature is too large for traditional human consumption and that related and crucial knowledge is not discovered by researchers. In addition to the increased production of astronomical literature, recent decades have also brought several advancements in computational linguistics. Especially, the machine-aided processing of literature dissemination might make it possible to convert this stream of papers into a coherent knowledge set. In this paper, we present the application of computational linguistics techniques to astronomy literature. In particular, we developed a tool that will find similar articles purely based on text content from an input paper. We find that our technique performs robustly in comparisonwith other tools recommending articles given a reference paper (known as recommender system). Our novel tool shows great power in combining computational linguistics with astronomy literature and suggests thatadditional research in this endeavor will likely produce even better tools that will help researchers cope with vast amounts of knowledge being produced.

    • Author Affiliations



      1. Center for Cosmology and Particle Physics, New York University, 726 Broadway, New York, NY 10003, USA.
      2. European Southern Observatory, Karl-Schwarzschild-Strasse 2, 85748 Garching, Germany.
    • Dates

  • Journal of Astrophysics and Astronomy | News

    • Continuous Article Publication

      Posted on January 27, 2016

      Since January 2016, the Journal of Astrophysics and Astronomy has moved to Continuous Article Publishing (CAP) mode. This means that each accepted article is being published immediately online with DOI and article citation ID with starting page number 1. Articles are also visible in Web of Science immediately. All these have helped shorten the publication time and have improved the visibility of the articles.

    • Editorial Note on Continuous Article Publication

      Posted on July 25, 2019

      Click here for Editorial Note on CAP Mode

© 2017-2019 Indian Academy of Sciences, Bengaluru.