• Fulltext

       

        Click here to view fulltext PDF


      Permanent link:
      https://www.ias.ac.in/article/fulltext/jbsc/040/03/0571-0577

    • Keywords

       

      Data archival; data dissemination; file format; RNA; RNA secondary structure

    • Abstract

       

      Given the importance of RNA secondary structures in defining their biological role, it would be convenient for researchers seeking RNA data if both sequence and structural information pertaining to RNA molecules are made available together. Current nucleotide data repositories archive only RNA sequence data. Furthermore, storage formats which can frugally represent RNA sequence as well as structure data in a single file, are currently unavailable. This article proposes a novel storage format, `FASTR’, for concomitant representation of RNA sequence and structure. The storage efficiency of the proposed FASTR format has been evaluated using RNA data from various microorganisms. Results indicate that the size of FASTR formatted files (containing both RNA sequence as well as structure information) are equivalent to that of FASTA-format files, which contain only RNA sequence information. RNA secondary structure is typically represented using a combination of a string of nucleotide characters along with the corresponding dot-bracket notation indicating structural attributes. `FASTR’ – the novel storage format proposed in the present study enables a frugal representation of both RNA sequence and structural information in the form of a single string. In spite of having a relatively smaller storage footprint, the resultant `fastr’ string(s) retain all sequence as well as secondary structural information that could be stored using a dot-bracket notation. An implementation of the `FASTR’ methodology is available for download at http://metagenomics.atc.tcs.com/compression/fastr.

    • Author Affiliations

       

      Tungadri Bose1 Anirban Dutta1 Mohammed Mh1 Hemang Gandhi1 Sharmila S Mande1

      1. Bio-Sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, Pune 411 013, India
    • Dates

       
    • Supplementary Material

       
  • Journal of Biosciences | News

© 2017-2019 Indian Academy of Sciences, Bengaluru.