A rapid method to map mutations in
공공데이터포털
Background Genetic screens in Drosophila have provided a wealth of information about a variety of cellular and developmental processes. It is now possible to screen for mutant phenotypes in virtually any cell at any stage of development by performing clonal screens using the flp/FRT system. The rate-limiting step in the analysis of these mutants is often the identification of the mutated gene, however, because traditional mapping strategies rely mainly on genetic and cytological markers that are not easily linked to the molecular map. Results Here we describe the development of a single-nucleotide polymorphism (SNP) map for chromosome arm 3R. The map contains 73 polymorphisms between the standard FRT chromosome, and a mapping chromosome that carries several visible markers (rucuca), at an average density of one SNP per 370 kilobases (kb). Using this collection, we show that mutants can be mapped to a 400 kb interval in a single meiotic mapping cross, with only a few hundred SNP detection reactions. Discovery of further SNPs in the region of interest allows the mutation to be mapped with the same recombinants to a region of about 50 kb. Conclusion The combined use of standard visible markers and molecular polymorphisms in a single mapping strategy greatly reduces both the time and cost of mapping mutations, because it requires at least four times fewer SNP detection reactions than a standard approach. The use of this map, or others developed along the same lines, will greatly facilitate the identification of the molecular lesions in mutants from clonal screens.
A draft annotation and overview of the human genome
공공데이터포털
Background The recent draft assembly of the human genome provides a unified basis for describing genomic structure and function. The draft is sufficiently accurate to provide useful annotation, enabling direct observations of previously inferred biological phenomena. Results We report here a functionally annotated human gene index placed directly on the genome. The index is based on the integration of public transcript, protein, and mapping information, supplemented with computational prediction. We describe numerous global features of the genome and examine the relationship of various genetic maps with the assembly. In addition, initial sequence analysis reveals highly ordered chromosomal landscapes associated with paralogous gene clusters and distinct functional compartments. Finally, these annotation data were synthesized to produce observations of gene density and number that accord well with historical estimates. Such a global approach had previously been described only for chromosomes 21 and 22, which together account for 2.2% of the genome. Conclusions We estimate that the genome contains 65,000-75,000 transcriptional units, with exon sequences comprising 4%. The creation of a comprehensive gene index requires the synthesis of all available computational and experimental evidence.
Full-length messenger RNA sequences greatly improve genome annotation
공공데이터포털
Background Annotation of eukaryotic genomes is a complex endeavor that requires the integration of evidence from multiple, often contradictory, sources. With the ever-increasing amount of genome sequence data now available, methods for accurate identification of large numbers of genes have become urgently needed. In an effort to create a set of very high-quality gene models, we used the sequence of 5,000 full-length gene transcripts from Arabidopsis to re-annotate its genome. We have mapped these transcripts to their exact chromosomal locations and, using alignment programs, have created gene models that provide a reference set for this organism. Results Approximately 35% of the transcripts indicated that previously annotated genes needed modification, and 5% of the transcripts represented newly discovered genes. We also discovered that multiple transcription initiation sites appear to be much more common than previously known, and we report numerous cases of alternative mRNA splicing. We include a comparison of different alignment software and an analysis of how the transcript data improved the previously published annotation. Conclusions Our results demonstrate that sequencing of large numbers of full-length transcripts followed by computational mapping greatly improves identification of the complete exon structures of eukaryotic genes. In addition, we are able to find numerous introns in the untranslated regions of the genes.