pp 375-378 July 2012
pp 379-397 July 2012 Articles
This article provides a retrospective on the ABC initiative in the area of all-atom molecular dynamics (MD) simulations including explicit solvent on all tetranucleotide steps of duplex B-form DNA duplex, ca. 2012. The ABC consortium has completed two phases of simulations, the most current being a set of 50–100 trajectories based on the AMBER ff99 force field together with the parmbsc0 modification. Some general perspectives on the field of MD on DNA and sequence effects on DNA structure are provided, followed by an overview our MD results, including a detailed comparison of the ff99/parmbsc0 results with crystal and NMR structures available for d(CGCGAATTCGCG). Some projects inspired by or related to the ABC initiative and database are also reviewed, including methods for the trajectory analyses, informatics of dealing with the large database of results, compressions of trajectories for efficacy of distribution, DNA solvation by water and ions, parameterization of coarse-grained models with applications and gene finding and genome annotation
pp 399-421 July 2012 Articles
Detailed analyses of the sequence-dependent solvation and ion atmosphere of DNA are presented based on molecular dynamics (MD) simulations on all the 136 unique tetranucleotide steps obtained by the ABC consortium using the AMBER suite of programs. Significant sequence effects on solvation and ion localization were observed in these simulations. The results were compared to essentially all known experimental data on the subject. Proximity analysis was employed to highlight the sequence dependent differences in solvation and ion localization properties in the grooves of DNA. Comparison of the MD-calculated DNA structure with canonical A- and B-forms supports the idea that the G/C-rich sequences are closer to canonical A- than B-form structures, while the reverse is true for the poly A sequences, with the exception of the alternating ATAT sequence. Analysis of hydration density maps reveals that the flexibility of solute molecule has a significant effect on the nature of observed hydration. Energetic analysis of solute–solvent interactions based on proximity analysis of solvent reveals that the GC or CG base pairs interactmore strongly with watermolecules in the minor groove of DNA that the AT or TA base pairs, while the interactions of the AT or TA pairs in the major groove are stronger than those of the GC or CG pairs. Computation of solvent-accessible surface area of the nucleotide units in the simulated trajectories reveals that the similarity with results derived from analysis of a database of crystallographic structures is excellent. The MD trajectories tend to follow Manning’s counterion condensation theory, presenting a region of condensed counterions within a radius of about 17 Å from the DNA surface independent of sequence. The GC and CG pairs tend to associate with cations in the major groove of the DNA structure to a greater extent than the AT and TA pairs. Cation association is more frequent in the minor groove of AT than the GC pairs. In general, the observed water and ion atmosphere around the DNA sequences is the MD simulation is in good agreement with experimental observations.
pp 423-431 July 2012 Articles
Promoter regions in the genomes of all domains of life show similar trends in several structural properties such as stability, bendability, curvature, etc. In current study we analysed the stability and bendability of various classes of promoter regions (based on the recent identification of different classes of transcription start sites) of Helicobacter pylori 26695 strain. It is found that primary TSS and operon-associated TSS promoters show significantly strong features in their promoter regions. DNA free-energy-based promoter prediction tool PromPredict was used to annotate promoters of different classes, and very high recall values (∼80%) are obtained for primary TSS. Orthologous genes from other strains of H. pylori show conservation of structural properties in promoter regions as well as coding regions. PromPredict annotates promoters of orthologous genes with very high recall and precision.
pp 433-444 July 2012 Articles
We present here a novel methodology for predicting new genes in prokaryotic genomes on the basis of inherent energetics of DNA. Regions of higher thermodynamic stability were identified, which were filtered based on already known annotations to yield a set of potentially new genes. These were then processed for their compatibility with the stereo-chemical properties of proteins and tripeptide frequencies of proteins in Swissprot data, which results in a reliable set of new genes in a genome. Quite surprisingly, the methodology identifies new genes even in well-annotated genomes. Also, the methodology can handle genomes of any GC-content, size and number of annotated genes.
pp 445-455 July 2012 Articles
The three-dimensional structure of DNA contains various sequence-dependent structural information, which control many cellular processes in life, such as replication, transcription, DNA repair, etc. For the above functions, DNA double helices need to unwind or melt locally, which is different from terminal melting, as often seen in molecular dynamics (MD) simulations or even in many DNA crystal structures. We have carried out detailed MD simulations of DNA double helices of regular oligonucleotide fragments as well as in polymeric constructs with water and charge-neutralizing counter-ions at several different temperatures. We wanted to eliminate the end-effect or terminal melting propensity by employing MD simulation of DNA oligonucleotides in such a manner that gives rise to properties of polymeric DNA of infinite length. The polymeric construct is expected to allow us to see local melting at elevated temperatures. Comparative structural analysis of oligonucleotides and its corresponding virtual polymer at various temperatures ranging from 300 K to 400 K is discussed. The general behaviour, such as volume expansion coefficients of both the simulations show high similarity, indicating polymeric construct, does not give many artificial constraints. Local melting of a polymer, even at elevated temperature, may need a high nucleation energy that was not available in the short (7 ns) simulations. We expected to observe such nucleation followed by cooperative melting of the polymers in longer MD runs. Such simulations of different polymeric sequences would facilitate us to predict probable melting origins in a polymeric DNA.
pp 457-474 July 2012 Articles
Nucleic acid interaction with nanoscale objects like carbon nanotubes (CNTs) and dendrimers is of fundamental interest because of their potential application in CNT separation, gene therapy and antisense therapy. Combining nucleic acids with CNTs and dendrimers also opens the door towards controllable self-assembly to generate various supra-molecular and nano-structures with desired morphologies. The interaction between these nanoscale objects also serve as a model system for studying DNA compaction, which is a fundamental process in chromatin organization. By using fully atomistic simulations, here we report various aspects of the interactions and binding modes of DNA and small interfering RNA (siRNA) with CNTs, graphene and dendrimers. Our results give a microscopic picture and mechanism of the adsorption of single- and double-strand DNA (ssDNA and dsDNA) on CNT and graphene. The nucleic acid–CNT interaction is dominated by the dispersive van der Waals (vdW) interaction. In contrast, the complexation of DNA (both ssDNA and dsDNA) and siRNA with various generations of poly-amido-amine (PAMAM) dendrimers is governed by electrostatic interactions. Our results reveal that both the DNA and siRNA form stable complex with the PAMAM dendrimer at a physiological pH when the dendrimer is positively charged due to the protonation of the primary amines. The size and binding energy of the complex increase with increase in dendrimer generation. We also give a summary of the current status in these fields and discuss future prospects.
pp 475-481 July 2012 Articles
One class of small molecules with therapeutic potential for treatment of cancer functions as transcription inhibitors via interaction with double-stranded DNA. Majority of the studies of the interaction with DNA have so far been reported under conditions nonexistent in vivo. Inside the cell, DNA is present in the nucleus as a complex with proteins known as chromatin. For the last few years we have been studying the interaction of these DNA-binding small molecules at the chromatin level with emphasis on the drug-induced structural alterations in chromatin. Our studies have shown that at the chromatin level these molecules could be classified in two broad categories: single-binding and dual-binding molecules. Single-binding molecules access only DNA in the chromatin, while the dual-binding molecules could bind to both DNA and the associated histone(s). Structural effects of the DNA-binding molecules upon chromatin in light of the above broad categories and the associated biological implications of the two types of binding are discussed.
pp 483-491 July 2012 Articles
Zinc finger proteins interact via their individual fingers to three base pair subsites on the target DNA. The four key residue positions −1, 2, 3 and 6 on the alpha-helix of the zinc fingers have hydrogen bond interactions with the DNA. Mutating these key residues enables generation of a plethora of combinatorial possibilities that can bind to any DNA stretch of interest. Exploiting the binding specificity and affinity of the interaction between the zinc fingers and the respective DNA can help to generate engineered zinc fingers for therapeutic purposes involving genome targeting. Exploring the structure–function relationships of the existing zinc finger–DNA complexes can aid in predicting the probable zinc fingers that could bind to any target DNA. Computational tools ease the prediction of such engineered zinc fingers by effectively utilizing information from the available experimental data. A study of literature reveals many approaches for predicting DNA-binding specificity in zinc finger proteins. However, an alternative approach that looks into the physico-chemical properties of these complexes would do away with the difficulties of designing unbiased zinc fingers with the desired affinity and specificity. We present a physico-chemical approach that exploits the relative strengths of hydrogen bonding between the target DNA and all combinatorially possible zinc fingers to select the most optimum zinc finger protein candidate.
pp 493-502 July 2012 Articles
HIV Integrase (IN) is an enzyme that is responsible for the integration of the proviral genome into the human genome, and this integration step is the first step of the virus hijacking the human cell machinery for its propagation and replication. 10-23 DNAzyme has the potential to suppress gene expressions through sequence-specific mRNA cleavage. We have designed three novel DNAzymes, DIN54, DIN116, and DIN152, against HIV-1 Integrase gene using Mfold software and evaluated them for target site cleavage activity on the in vitro transcribed mRNA. All DNAzymes were tested for its inhibition of expression of HIV Integrase protein in the transiently transfected cell lines. DIN116 and DIN152 inhibited IN-EGFP expression by 80% and 70% respectively.
pp 503-517 July 2012 Articles
DNA is continuously attacked by reactive species that can affect its structure and function severely. Structural modifications to DNA mainly arise from modifications in its bases that primarily occur due to their exposure to different reactive species. Apart from this, DNA strand break, inter- and intra-strand crosslinks and DNA–protein crosslinks can also affect the structure of DNA significantly. These structural modifications are involved in mutation, cancer and many other diseases. As it has the least oxidation potential among all the DNA bases, guanine is frequently attacked by reactive species, producing a plethora of lethal lesions. Fortunately, living cells are evolved with intelligent enzymes that continuously protect DNA from such damages. This review provides an overview of different guanine lesions formed due to reactions of guanine with different reactive species. Involvement of these lesions in inter- and intra-strand crosslinks, DNA–protein crosslinks and mutagenesis are discussed. How certain enzymes recognize and repair different guanine lesions in DNA are also presented.
pp 519-532 July 2012 Articles
It is now established that a small fraction of genomic DNA does adopt the non-canonical B-DNA structure or ‘unusual’ DNA structure. The unusual DNA structures like DNA-hairpin, cruciform, Z-DNA, triplex and tetraplex are represented as hotspots of chromosomal breaks, homologous recombination and gross chromosomal rearrangements since they are prone to the structural alterations. Friedreich’s ataxia (FRDA), the autosomal recessive degenerative disorder of nervous and muscles tissue, is caused by the massive expansion of (GAA) repeats that occur in the first intron of Frataxin gene X25 on chromosome 9q13-q21.1. The purine strand of the DNA in the expanded (GAA) repeat region folds back to form the (R∙R*Y) type of triplex, which further inhibits the frataxin gene expression, and this clearly suggests that the shape of DNA is the determining factor in the cellular function. FRDA is the only disease known so far to be associated with DNA triplex. Structural characterization of GAA-containing DNA triplexes using some simple biophysical methods like UV melting, UV absorption, circular dichroic spectroscopy and electrophoretic mobility shift assay are discussed. Further, the clinical aspects and genetic analysis of FRDA patients who carry (GAA) repeat expansions are presented. The potential of some small molecules that do not favour the DNA triplex formation as therapeutics for FRDA are also briefly discussed.
pp 533-538 July 2012 Articles
The energy landscape of RNA is known to be extremely rugged, and hence finding low-energy structures starting from a random structure is a challenging task for any optimization algorithm. In the current work, we have investigated the ability of one Monte Carlo–based optimization algorithm, Temperature Basin Paving, to explore the energy landscape of a small RNA T-loop hairpin. In this method, the history of the simulation is used to increase the probability of states less visited in the simulation. It has been found that using both energy and end-to-end distance as the biasing parameters in the simulation, the partially folded structure of the hairpin starting from random structures could be obtained.
pp 539-552 July 2012 Articles
Studies on RNA targeting by small molecules to specifically control certain cellular functions is an area of remarkable current interest. For this purpose, a basic understanding of the molecular aspects of the interaction of small molecules with various RNA structures is essential. Alkaloids are a group of natural products with potential therapeutic utility, and very recently, their interaction with many RNA structures have been reported. Especially noteworthy are the protoberberines and aristolochia alkaloids distributed widely in many botanical families. Many of the alkaloids of these group exhibit excellent binding affinity to many RNA structures that may be exploited to develop RNA targeted therapeutics. This review attempts to present the current status on the understanding of the interaction of these alkaloids with various RNA structures, mainly highlighting the biophysical aspects.
pp 553-561 July 2012 Articles
Molecular docking, molecular mechanics, molecular dynamics and relaxation matrix simulation protocols have been extensively used to generate the structural details of ligand–receptor complexes in order to understand the binding interactions between the two entities. Experimental methods like NMR spectroscopy and X-ray crystallography are known to provide structural information about ligand–receptor complexes. In addition, fluorescence spectroscopy, circular dichroism (CD) spectroscopy and molecular docking have also been utilized to decode the phenomenon of the ligand–DNA interactions, with good correlation between experimental and computational results. The DNA binding affinity was demonstrated by analysing fluorescence spectral data. Structural rigidity of DNA upon ligand binding was identified by CD spectroscopy. Docking is carried out using the DNA-Dock program which results in the binding affinity data along with structural information like interatomic distances and H-bonding, etc. The complete structural analyses of various drug–DNA complexes have afforded results that indicate a specific DNA binding pattern of these ligands. It also exhibited that certain structural features of ligands can make a ligand to be AT- or GC-specific. It was also demonstrated that changing specificity from AT base pairs to GC base pairs further improved the DNA topoisomerase inhibiting activity in certain ligands. Thus, a specific molecular recognition signature encrypted in the structure of ligand can be decoded and can be effectively employed in designing more potent antiviral and antitumour agents.
pp 563-571 July 2012 Articles
Enhanced disease susceptibility 1 (EDS1), a plant-specific protein has homology with the eukaryotic lipase in their N-terminal halves and a unique domain at its C-termini. EDS1 is known to be an important regulator of biotic stress and an essential component of basal immunity. EDS1 interacts with its positive co-regulator phytoalexin deficient 4 (PAD4), resulting in mobilization of the salicylic acid defence pathway. Limited information regarding this interaction in rice is available. To study this interaction, a model of EDS1 and PAD4 proteins from rice was generated and validated with Accelrys DS software version 3.1 using bioinformatics interface. The in silico docking between the two proteins showed a significant protein–protein interaction between rice EDS1 and PAD4, suggesting that they form a dimeric protein complex, which, similar to that in Arabidopsis, is perhaps also important for triggering the salicylic acid signalling pathway in plants.
pp 573-577 July 2012 Articles
Proteins manifest themselves as phenotypic traits, retained or lost in living systems via evolutionary pressures. Simply put, survival is essentially the ability of a living system to synthesize a functional protein that allows for a response to environmental perturbations (adaptation). Loss of functional proteins leads to extinction. Currently there are no universally applicable quantitative metrics at the molecular level for either measuring ‘evolvability’ of life or for assessing the conditions under which a living system would go extinct and why. In this work, we show emergence of the first such metric by utilizing the recently discovered stoichiometric margin of life for all known naturally occurring (and functional) proteins. The constraint of having well-defined stoichiometries of the 20 amino acids in naturally occurring protein sequences requires utilization of the full scope of degeneracy in the genetic code, i.e. usage of all codons coding for an amino acid, by only 11 of the 20 amino acids. This shows that the non-availability of individual codons for these 11 amino acids would disturb the fine stoichiometric balance resulting in non-functional proteins and hence extinction. Remarkably, these amino acids are found in close proximity of any given amino acid in the backbones of thousands of known crystal structures of folded proteins. On the other hand, stoichiometry of the remaining 9 amino acids, found to be farther/distal from any given amino acid in backbones of folded proteins, is maintained independent of the number of codons available to synthesize them, thereby providing some robustness and hence survivability.