High correlation between the turnover of nucleotides under mutational pressure and the DNA composition
공공데이터포털
Background Any DNA sequence is a result of compromise between the selection and mutation pressures exerted on it during evolution. It is difficult to estimate the relative influence of each of these pressures on the rate of accumulation of substitutions. However, it is important to discriminate between the effect of mutations, and the effect of selection, when studying the phylogenic relations between taxa. Results We have tested in computer simulations, and analytically, the available substitution matrices for many genomes, and we have found that DNA strands in equilibrium under mutational pressure have unique feature: the fraction of each type of nucleotide is linearly dependent on the time needed for substitution of half of nucleotides of a given type, with a correlation coefficient close to 1. Substitution matrices found for sequences under selection pressure do not have this property. A substitution matrix for the leading strand of the Borrelia burgdorferi genome, having reached equilibrium in computer simulation, gives a DNA sequence with nucleotide composition and asymmetry corresponding precisely to the third positions in codons of protein coding genes located on the leading strand. Conclusions Parameters of mutational pressure allow us to count DNA composition in equilibrium with this mutational pressure. Comparing any real DNA sequence with the sequence in equilibrium it is possible to estimate the distance between these sequences, which could be used as a measure of the selection pressure. Furthermore, the parameters of the mutational pressure enable direct estimation of the relative mutation rates in any DNA sequence in the studied genome.
Distinct functions of
공공데이터포털
Background In most organisms proper reductional chromosome segregation during meiosis I is strongly correlated with the presence of crossover recombination structures (chiasmata); recombination deficient mutants lack crossovers and suffer meiosis I nondisjunction. We report that these functions are separable in the fission yeast Schizosaccharomyces pombe. Results Intron mapping and expression studies confirmed that Rec12 is a member of the Spo11/Top6A topoisomerase family required for the formation of meiotic dsDNA breaks and recombination. rec12-117, rec12-D15 (null), and rec12-Y98F (active site) mutants lacked most crossover recombination and chromosomes segregated abnormally to generate aneuploid meiotic products. Since S. pombe contains only three chromosome pairs, many of those aneuploid products were viable. The types of aberrant chromosome segregation were inferred from the inheritance patterns of centromere linked markers in diploid meiotic products. The rec12-117 and rec12-D15 mutants manifest segregation errors during both meiosis I and meiosis II. Remarkably, the rec12-Y98F (active site) mutant exhibited essentially normal meiosis I segregation patterns, but still exhibited meiosis II segregation errors. Conclusions Rec12 is a 345 amino acid protein required for most crossover recombination and for chiasmatic segregation of chromosomes during meiosis I. Rec12 also participates in a backup distributive (achiasmatic) system of chromosome segregation during meiosis I. In addition, catalytically-active Rec12 mediates some signal that is required for faithful equational segregation of chromosomes during meiosis II.
The molecular genetic linkage map of the model legume
공공데이터포털
Background The legume Medicago truncatula has emerged as a model plant for the molecular and genetic dissection of various plant processes involved in rhizobial, mycorrhizal and pathogenic plant-microbe interactions. Aiming to develop essential tools for such genetic approaches, we have established the first genetic map of this species. Two parental homozygous lines were selected from the cultivar Jemalong and from the Algerian natural population (DZA315) on the basis of their molecular and phenotypic polymorphism. Results An F2 segregating population of 124 individuals between these two lines was obtained using an efficient manual crossing technique established for M. truncatula and was used to construct a genetic map. This map spans 1225 cM (average 470 kb/cM) and comprises 289 markers including RAPD, AFLP, known genes and isoenzymes arranged in 8 linkage groups (2n = 16). Markers are uniformly distributed throughout the map and segregation distortion is limited to only 3 linkage groups. By mapping a number of common markers, the eight linkage groups are shown to be homologous to those of diploid alfalfa (M. sativa), implying a good level of macrosynteny between the two genomes. Using this M. truncatula map and the derived F3 populations, we were able to map the Mtsym6 symbiotic gene on linkage group 8 and the SPC gene, responsible for the direction of pod coiling, on linkage group 7. Conclusions These results demonstrate that Medicago truncatula is amenable to diploid genetic analysis and they open the way to map-based cloning of symbiotic or other agronomically-important genes using this model plant.
Polytene chromosomes as indicators of phylogeny in several species groups of
공공데이터포털
Background Polytene chromosome banding patterns have long been used by Drosophila evolutionists to infer degree of relatedness among taxa. Recently, nucleotide sequences have preempted this traditional method. We place the classical Drosophila evolutionary biology tools of polytene chromosome inversion analysis in a phylogenetic context and assess their utility in comparison to nucleotide sequences. Results A simultaneous analysis framework was used to examine the congruence of the chromosomal inversion data with more recent DNA sequence data in four Drosophila species groups – the melanogaster, virilis, repleta, and picture wing. Inversions and nucleotides were highly congruent with one another based on incongruence length difference and partitioned Bremer support values. Inversion phylogenies were less resolved because of fewer numbers of characters. Partitioned Bremer supports, corrected for the number of characters in each matrix, were higher for inversion matrices. Conclusions Polytene chromosome data are highly congruent with DNA sequence data and, when placed in a simultaneous analysis framework, are shown to be more information rich than nucleotide data.
A clustering method for repeat analysis in DNA sequences
공공데이터포털
Background A computational system for analysis of the repetitive structure of genomic sequences is described. The method uses suffix trees to organize and search the input sequences; this data structure has been used previously for efficient computation of exact and degenerate repeats. Results The resulting software tool collects all repeat classes and outputs summary statistics as well as a file containing multiple sequences (multi fasta), that can be used as the target of searches. Its use is demonstrated here on several complete microbial genomes, the entire Arabidopsis thaliana genome, and a large collection of rice bacterial artificial chromosome end sequences. Conclusions We propose a new clustering method for analysis of the repeat data captured in suffix trees. This method has been incorporated into a system that can find repeats in individual genome sequences or sets of sequences, and that can organize those repeats into classes. It quickly and accurately creates repeat databases from small and large genomes. The associated software (RepeatFinder), should prove helpful in the analysis of repeat structure for both complete and partial genome sequences.
Evidence for the adaptive significance of an LTR retrotransposon sequence in a
공공데이터포털
Background The potential adaptive significance of transposable elements (TEs) to the host genomes in which they reside is a topic that has been hotly debated by molecular evolutionists for more than two decades. Recent genomic analyses have demonstrated that TE fragments are associated with functional genes in plants and animals. These findings suggest that TEs may contribute significantly to gene evolution. Results We have analyzed two transposable elements associated with genes in the sequenced Drosophila melanogaster y; cn bw sp strain. A fragment of the Antonia long terminal repeat (LTR) retrotransposon is present in the intron of Chitinase 3 (Cht3), a gene located within the constitutive heterochromatin of chromosome 2L. Within the euchromatin of chromosome 2R a full-length Burdock LTR retrotransposon is located immediately 3' to cathD, a gene encoding cathepsin D. We tested for the presence of these two TE/gene associations in strains representing 12 geographically diverse populations of D. melanogaster. While the cathD insertion variant was detected only in the sequenced y; cn bw sp strain, the insertion variant present in the heterochromatic Cht3 gene was found to be fixed throughout twelve D. melanogaster populations and in a D. mauritiana strain suggesting that it maybe of adaptive significance. To further test this hypothesis, we sequenced a 685bp region spanning the LTR fragment in the intron of Cht3 in strains representative of the two sibling species D. melanogaster and D. mauritiana (~2.7 million years divergent). The level of sequence divergence between the two species within this region was significantly lower than expected from the neutral substitution rate and lower than the divergence observed between a randomly selected intron of the Drosophila Alcohol dehydrogenase gene (Adh). Conclusions Our results suggest that a 359 bp fragment of an Antonia retrotransposon (complete LTR is 659 bp) located within the intron of the Drosophila melanogaster Cht3 gene is of adaptive evolutionary significance. Our results are consistent with previous suggestions that the presence of TEs in constitutive heterochromatin may be of significance to the expression of heterochromatic genes.
A hierarchical statistical model for estimating population properties of quantitative genes
공공데이터포털
Background Earlier methods for detecting major genes responsible for a quantitative trait rely critically upon a well-structured pedigree in which the segregation pattern of genes exactly follow Mendelian inheritance laws. However, for many outcrossing species, such pedigrees are not available and genes also display population properties. Results In this paper, a hierarchical statistical model is proposed to monitor the existence of a major gene based on its segregation and transmission across two successive generations. The model is implemented with an EM algorithm to provide maximum likelihood estimates for genetic parameters of the major locus. This new method is successfully applied to identify an additive gene having a large effect on stem height growth of aspen trees. The estimates of population genetic parameters for this major gene can be generalized to the original breeding population from which the parents were sampled. A simulation study is presented to evaluate finite sample properties of the model. Conclusions A hierarchical model was derived for detecting major genes affecting a quantitative trait based on progeny tests of outcrossing species. The new model takes into account the population genetic properties of genes and is expected to enhance the accuracy, precision and power of gene detection.
The genetic structure of recombinant inbred mice: high-resolution consensus maps for complex trait analysis
공공데이터포털
Background Recombinant inbred (RI) strains of mice are an important resource used to map and analyze complex traits. They have proved particularly effective in multidisciplinary genetic studies. Widespread use of RI strains has been hampered by their modest numbers and by the difficulty of combining results derived from different RI sets. Results We have increased the density of typed microsatellite markers two- to five-fold in each of several major RI sets that share C57BL/6 as a parental strain (AXB, BXA, BXD, BXH and CXB). A common set of 490 markers was genotyped in just over 100 RI strains. Genotypes of around 1,100 additional microsatellites in one or more RI sets were generated, collected and checked for errors. Consensus RI maps that integrate genotypes of approximately 1,600 microsatellite loci were assembled. The genomes of individual strains typically incorporate 45-55 recombination breakpoints. The collected RI set - termed the BXN set - contains approximately 5,000 breakpoints. The distribution of recombinations approximates a Poisson distribution and distances between breakpoints average about 0.5 centimorgans (cM). Locations of most breakpoints have been defined with a precision of < 2 cM. Genotypes deviate from Hardy-Weinberg equilibrium in only a small number of intervals. Conclusions Consensus maps derived from RI strains conform almost exactly to theoretical expectation and are close to the length predicted by the Haldane-Waddington equation (x3.6 for a 2-3 cM interval between markers). Non-syntenic associations between different chromosomes introduce predictable distortions in quantitative trait locus (QTL) datasets that can be partly corrected using two-locus correlation matrices.
The society of genes: networks of functional links between genes from comparative genomics
공공데이터포털
Comparative genomics provides at least three methods for identifying functional links between genes: examination of phylogenetic distributions, analysis of conserved proximity and observations of fusions of genes into a multidomain gene in another organism. We show that the functional networks obtained by applying these methods have different topologies and that the information they provide is largely additive. In particular, the combined networks of functional links contain an average of 57% of an organism's complete genetic complement, uncover substantial portions of known pathways, and suggest the function of previously unannotated genes. In addition, the combined networks are qualitatively different from the networks obtained using individual methods.