데이터셋 상세
미국
The process of genome shrinkage in the obligate symbiont
Background Very small genomes have evolved repeatedly in eubacterial lineages that have adopted obligate associations with eukaryotic hosts. Complete genome sequences have revealed that small genomes retain very different gene sets, raising the question of how final genome content is determined. To examine the process of genome reduction, the tiny genome of the endosymbiont Buchnera aphidicola was compared to the larger ancestral genome, reconstructed on the basis of the phylogenetic distribution of gene orthologs among fully sequenced relatives of Escherichia coli and Buchnera. Results The reconstructed ancestral genome contained 2,425 open reading frames (ORFs). The Buchnera genome, containing 564 ORFs, consists of 153 fragments of 1-34 genes that are syntenic with reconstructed ancestral regions. On the basis of this reconstruction, 503 genes were eliminated within syntenic fragments, and 1,403 genes were lost from the gaps between syntenic fragments, probably in connection with genome rearrangements. Lost regions are sometimes large, and often span functionally unrelated genes. In addition, individual genes and regulatory regions have been lost or eroded. For the categories of DNA repair genes and rRNA genes, most lost loci fall in regions between syntenic fragments. This history of gene loss is reflected in the sequences of intergenic spacers at positions where genes were once present. Conclusions The most plausible interpretation of this reconstruction is that Buchnera lost many genes through the fixation of large deletions soon after the acquisition of an obligate endosymbiotic lifestyle. An implication is that final genome composition may be partly the chance outcome of initial deletions and that neighboring genes influence the likelihood of loss of particular genes and pathways.
데이터 정보
연관 데이터
Evidence for large domains of similarly expressed genes in the
공공데이터포털
Background Transcriptional regulation in eukaryotes generally operates at the level of individual genes. Regulation of sets of adjacent genes by mechanisms operating at the level of chromosomal domains has been demonstrated in a number of cases, but the fraction of genes in the genome subject to regulation at this level is unknown. Results Drosophila gene-expression profiles that were determined from over 80 experimental conditions using high-density oligonucleotide microarrays were searched for groups of adjacent genes that show similar expression profiles. We found about 200 groups of adjacent and similarly expressed genes, each having between 10 and 30 members; together these groups account for over 20% of assayed genes. Each group covers between 20 and 200 kilobase pairs of genomic sequence, with a mean group size of about 100 kilobase pairs. Groups do not appear to show any correlation with polytene banding patterns or other known chromosomal structures, nor were genes within groups functionally related to one another. Conclusions Groups of adjacent and co-regulated genes that are not otherwise functionally related in any obvious way can be identified by expression profiling in Drosophila. The mechanism underlying this phenomenon is not yet known.
DNA loops and semicatenated DNA junctions
공공데이터포털
Background Alternative DNA conformations are of particular interest as potential signals to mark important sites on the genome. The structural variability of CA microsatellites is particularly pronounced; these are repetitive poly(CA) · poly(TG) DNA sequences spread in all eukaryotic genomes as tracts of up to 60 base pairs long. Many in vitro studies have shown that the structure of poly(CA) · poly(TG) can vary markedly from the classical right handed DNA double helix and adopt diverse alternative conformations. Here we have studied the mechanism of formation and the structure of an alternative DNA structure, named Form X, which was observed previously by polyacrylamide gel electrophoresis of DNA fragments containing a tract of the CA microsatellite poly(CA) · poly(TG) but had not yet been characterized. Results Formation of Form X was found to occur upon reassociation of the strands of a DNA fragment containing a tract of poly(CA) · poly(TG), in a process strongly stimulated by the nuclear proteins HMG1 and HMG2. By inserting Form X into DNA minicircles, we show that the DNA strands do not run fully side by side but instead form a DNA knot. When present in a closed DNA molecule, Form X becomes resistant to heating to 100°C and to alkaline pH. Conclusions Our data strongly support a model of Form X consisting in a DNA loop at the base of which the two DNA duplexes cross, with one of the strands of one duplex passing between the strands of the other duplex, and reciprocally, to form a semicatenated DNA junction also called a DNA hemicatenane.
Conservation of long-range synteny and microsynteny between the genomes of two distantly related nematodes
공공데이터포털
To assess whether the pattern of high rates of genome rearrangement, with a bias towards within-chromosome events is true of nematodes in general, genome sequence was used to compare the model Caenorhabditis elegans and the filarial parasite Brugia malayi. It is suggested that intrachromosomal rearrangement is a major force driving chromosomal organization in nematodes.
Transcriptional territories in the genome
공공데이터포털
An analysis of numerous Drosophila microarray experiments reveals that the genome has many large groups of adjacent genes that are expressed similarly but are not functionally related.
Long-term experimental evolution in
공공데이터포털
Background Twelve populations of the bacterium, Escherichia coli, adapted to a simple, glucose-limited, laboratory environment over 10,000 generations. As a consequence, these populations tended to lose functionality on alternative resources. I examined whether these populations in turn became inferior competitors in four alternative environments. These experiments are among the first to quantify and compare dimensions of the fundamental and realized niches. Results Three clones were isolated from each of the twelve populations after 10,000 generations of evolution. Direct competition between these clones and the ancestor in the selective environment revealed average fitness improvements of ~50%. When grown in the wells of Biolog plates, however, evolved clones grew 25% worse on average than the ancestor on a variety of different carbon sources. Next, I competed each evolved population versus the ancestor in four foreign environments (10-fold higher and lower glucose concentration, added bile salts, and dilute LB media). Surprisingly, nearly all populations were more fit than the ancestor in each foreign environment, though the margin of improvement was least in the most different environment. Most populations also evolved increased sensitivity to novobiocin. Conclusions Reduced functionality on numerous carbon sources suggested that the fundamental niche of twelve E. coli populations had narrowed after adapting to a specific laboratory environment. However, in spite of these results, the same populations were competitively superior in four novel environments. These findings suggest that adaptation to certain dimensions of the environment may compensate for other functional losses and apparently enhance the realized niche.
Long-term experimental evolution in
공공데이터포털
Background Experimental populations of Escherichia coli have evolved for 20,000 generations in a uniform environment. Their rate of improvement, as measured in competitions with the ancestor in that environment, has declined substantially over this period. This deceleration has been interpreted as the bacteria approaching a peak or plateau in a fitness landscape. Alternatively, this deceleration might be caused by non-transitive competitive interactions, in particular such that the measured advantage of later genotypes relative to earlier ones would be greater if they competed directly. Results To distinguish these two hypotheses, we performed a large set of competitions using one of the evolved lines. Twenty-one samples obtained at 1,000-generation intervals each competed against five genetically marked clones isolated at 5,000-generation intervals, with three-fold replication. The pattern of relative fitness values for these 315 pairwise competitions was compared with expectations under transitive and non-transitive models, the latter structured to produce the observed deceleration in fitness relative to the ancestor. In general, the relative fitness of later and earlier generations measured by direct competition agrees well with the fitness inferred from separately competing each against the ancestor. These data thus support the transitive model. Conclusion Non-transitive competitive interactions were not a major feature of evolution in this population. Instead, the pronounced deceleration in its rate of fitness improvement indicates that the population early on incorporated most of those mutations that provided the greatest gains, and subsequently relied on beneficial mutations that were fewer in number, smaller in effect, or both.
Genome trees constructed using five different approaches suggest new major bacterial clades
공공데이터포털
Background The availability of multiple complete genome sequences from diverse taxa prompts the development of new phylogenetic approaches, which attempt to incorporate information derived from comparative analysis of complete gene sets or large subsets thereof. Such attempts are particularly relevant because of the major role of horizontal gene transfer and lineage-specific gene loss, at least in the evolution of prokaryotes. Results Five largely independent approaches were employed to construct trees for completely sequenced bacterial and archaeal genomes: i) presence-absence of genomes in clusters of orthologous genes; ii) conservation of local gene order (gene pairs) among prokaryotic genomes; iii) parameters of identity distribution for probable orthologs; iv) analysis of concatenated alignments of ribosomal proteins; v) comparison of trees constructed for multiple protein families. All constructed trees support the separation of the two primary prokaryotic domains, bacteria and archaea, as well as some terminal bifurcations within the bacterial and archaeal domains. Beyond these obvious groupings, the trees made with different methods appeared to differ substantially in terms of the relative contributions of phylogenetic relationships and similarities in gene repertoires caused by similar life styles and horizontal gene transfer to the tree topology. The trees based on presence-absence of genomes in orthologous clusters and the trees based on conserved gene pairs appear to be strongly affected by gene loss and horizontal gene transfer. The trees based on identity distributions for orthologs and particularly the tree made of concatenated ribosomal protein sequences seemed to carry a stronger phylogenetic signal. The latter tree supported three potential high-level bacterial clades,: i) Chlamydia-Spirochetes, ii) Thermotogales-Aquificales (bacterial hyperthermophiles), and ii) Actinomycetes-Deinococcales-Cyanobacteria. The latter group also appeared to join the low-GC Gram-positive bacteria at a deeper tree node. These new groupings of bacteria were supported by the analysis of alternative topologies in the concatenated ribosomal protein tree using the Kishino-Hasegawa test and by a census of the topologies of 132 individual groups of orthologous proteins. Additionally, the results of this analysis put into question the sister-group relationship between the two major archaeal groups, Euryarchaeota and Crenarchaeota, and suggest instead that Euryarchaeota might be a paraphyletic group with respect to Crenarchaeota. Conclusions We conclude that, the extensive horizontal gene flow and lineage-specific gene loss notwithstanding, extension of phylogenetic analysis to the genome scale has the potential of uncovering deep evolutionary relationships between prokaryotic lineages.
Interkingdom gene fusions
공공데이터포털
Background: Genome comparisons have revealed major lateral gene transfer between the three primary kingdoms of life - Bacteria, Archaea, and Eukarya. Another important evolutionary phenomenon involves the evolutionary mobility of protein domains that form versatile multidomain architectures. We were interested in investigating the possibility of a combination of these phenomena, with an invading gene merging with a pre-existing gene in the recipient genome. Results: Complete genomes of fifteen bacteria, four archaea and one eukaryote were searched for interkingdom gene fusions (IKFs); that is, genes coding for proteins that apparently consist of domains originating from different primary kingdoms. Phylogenetic analysis supported 37 cases of IKF, each of which includes a 'native' domain and a horizontally acquired 'alien' domain. IKFs could have evolved via lateral transfer of a gene coding for the alien domain (or a larger protein containing this domain) followed by recombination with a native gene. For several IKFs, this scenario is supported by the presence of a gene coding for a second, stand-alone version of the alien domain in the recipient genome. Among the genomes investigated, the greatest number of IKFs has been detected in Mycobacterium tuberculosis, where they are almost always accompanied by a stand-alone alien domain. For most of the IKF cases detected in other genomes, the stand-alone counterpart is missing. Conclusions: The results of comparative genome analysis show that IKF formation is a real, but relatively rare, evolutionary phenomenon. We hypothesize that IKFs are formed primarily via the proposed two-stage mechanism, but other than in the Actinomycetes, in which IKF generation seems to be an active, ongoing process, most of the stand-alone intermediates have been eliminated, perhaps because of functional redundancy.
A tandem repeats database for bacterial genomes: application to the genotyping of
공공데이터포털
Background Some pathogenic bacteria are genetically very homogeneous, making strain discrimination difficult. In the last few years, tandem repeats have been increasingly recognized as markers of choice for genotyping a number of pathogens. The rapid evolution of these structures appears to contribute to the phenotypic flexibility of pathogens. The availability of whole-genome sequences has opened the way to the systematic evaluation of tandem repeats diversity and application to epidemiological studies. Results This report presents a database () of tandem repeats from publicly available bacterial genomes which facilitates the identification and selection of tandem repeats. We illustrate the use of this database by the characterization of minisatellites from two important human pathogens, Yersinia pestis and Bacillus anthracis. In order to avoid simple sequence contingency loci which may be of limited value as epidemiological markers, and to provide genotyping tools amenable to ordinary agarose gel electrophoresis, only tandem repeats with repeat units at least 9 bp long were evaluated. Yersinia pestis contains 64 such minisatellites in which the unit is repeated at least 7 times. An additional collection of 12 loci with at least 6 units, and a high internal conservation were also evaluated. Forty-nine are polymorphic among five Yersinia strains (twenty-five among three Y. pestis strains). Bacillus anthracis contains 30 comparable structures in which the unit is repeated at least 10 times. Half of these tandem repeats show polymorphism among the strains tested. Conclusions Analysis of the currently available bacterial genome sequences classifies Bacillus anthracis and Yersinia pestis as having an average (approximately 30 per Mb) density of tandem repeat arrays longer than 100 bp when compared to the other bacterial genomes analysed to date. In both cases, testing a fraction of these sequences for polymorphism was sufficient to quickly develop a set of more than fifteen informative markers, some of which show a very high degree of polymorphism. In one instance, the polymorphism information content index reaches 0.82 with allele length covering a wide size range (600-1950 bp), and nine alleles resolved in the small number of independent Bacillus anthracis strains typed here.
Genomic comparisons among
공공데이터포털
Background Insertion Sequence (IS) elements are mobile genetic elements widely distributed among bacteria. Their activities cause mutations, promoting genetic diversity and sometimes adaptation. Previous studies have examined their copy number and distribution in Escherichia coli K-12 and natural isolates. Here, we map most of the IS elements in E. coli B and compare their locations with the published genomes of K-12 and O157:H7. Results The genomic locations of IS elements reveal numerous differences between B, K-12, and O157:H7. IS elements occur in hok-sok loci (homologous to plasmid stabilization systems) in both B and K-12, whereas these same loci lack IS elements in O157:H7. IS elements in B and K-12 are often found in locations corresponding to O157:H7-specific sequences, which suggests IS involvement in chromosomal rearrangements including the incorporation of foreign DNA. Some sequences specific to B are identified, as reported previously for O157:H7. The extent of nucleotide sequence divergence between B and K-12 is <2% for most sequences adjacent to IS elements. By contrast, B and K-12 share only a few IS locations besides those in hok-sok loci. Several phenotypic features of B are explained by IS elements, including differential porin expression from K-12. Conclusions These data reveal a high level of IS activity since E. coli B, K-12, and O157:H7 diverged from a common ancestor, including IS association with deletions and incorporation of horizontally acquired genes as well as transpositions. These findings indicate the important role of IS elements in genome plasticity and divergence.