데이터셋 상세
미국
The ESAT-6 gene cluster of
Background The genome of Mycobacterium tuberculosis H37Rv has five copies of a cluster of genes known as the ESAT-6 loci. These clusters contain members of the CFP-10 (lhp) and ESAT-6 (esat-6) gene families (encoding secreted T-cell antigens that lack detectable secretion signals) as well as genes encoding secreted, cell-wall-associated subtilisin-like serine proteases, putative ABC transporters, ATP-binding proteins and other membrane-associated proteins. These membrane-associated and energy-providing proteins may function to secrete members of the ESAT-6 and CFP-10 protein families, and the proteases may be involved in processing the secreted peptide. Results Finished and unfinished genome sequencing data of 98 publicly available microbial genomes has been analyzed for the presence of orthologs of the ESAT-6 loci. The multiple duplicates of the ESAT-6 gene cluster found in the genome of M. tuberculosis H37Rv are also conserved in the genomes of other mycobacteria, for example M. tuberculosis CDC1551, M. tuberculosis 210, M. bovis, M. leprae, M. avium, and the avirulent strain M. smegmatis. Phylogenetic analyses of the resulting sequences have established the duplication order of the gene clusters and demonstrated that the gene cluster known as region 4 (Rv3444c-3450c) is ancestral. Region 4 is also the only region for which an ortholog could be found in the genomes of Corynebacterium diphtheriae and Streptomyces coelicolor. Conclusions Comparative genomic analysis revealed that the presence of the ESAT-6 gene cluster is a feature of some high-G+C Gram-positive bacteria. Multiple duplications of this cluster have occurred and are maintained only within the genomes of members of the genus Mycobacterium.
데이터 정보
연관 데이터
LJFO00000000, LJFQ00000000, LJFT00000000, LJFU00000000, LJFX00000000, LJFY00000000
공공데이터포털
Draft genome sequences of six Mycobacterium immunogenum strains isolated from a chloraminated drinking water distribution system simulator subjected to changes in operational parameters. This dataset is associated with the following publication: Gomez-Alvarez, V., and R. Revetta. Draft Genome Sequences of Six Mycobacterium immunogenum, Obtained from a Chloraminated Drinking Water Distribution System Simulator. Genome Announcements. American Society for Microbiology, Washington, DC, USA, 4(1): e01538-15, (2016).
Data from: Draft genome sequences of eight streptogramin-resistant Enterococcus species isolates from animal and environmental sources in the United States
공공데이터포털
,Draft genome sequences of five Enterococcus faecium, two Enterococcus hirae, and one Enterococcus gallinarum from enviromental sources and chicken carcass rinsates. Isolates were selected for their resistance to the streptogramin antibiotic, Quinupristin-Dalfopristin and were all collected in the United States between 2001 and 2004. Antimicrobial resistance genes were identified conferring resistance to the macrolide-lincosamide-streptogramins, aminoglycosides, tetracycline, beta-lactams, and glycopeptides.,,
YRC 13 Loci SPM
공공데이터포털
BSE file of allele counts by population for 13 loci derived from the two individual genotype files (YRC baseline 4loci.txt and YRC baseline 9loci.txt).
YRC Baseline 9 Loci
공공데이터포털
TXT file of individual genotypes at 9 microsatellite loci. First row has the microsatellite loci names. Columns 1 and 2 are individual identifiers, followed by allele sizes for the 9 microsatellites, two alleles per locus.
YRC Baseline 4 Loci
공공데이터포털
TXT file of individual genotypes at 4 microsatellite loci. First row has the microsatellite loci names. Columns 1 and 2 are individual identifiers, followed by allele sizes for the 4 microsatellites, two alleles per locus.
RefSeq: NCBI Reference Sequence Database
공공데이터포털
A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
Genome In A Bottle - v2.0 Genome Stratifications (Deprecated)
공공데이터포털
These stratification BED files from the Global Alliance for Genomics and Health (GA4GH) Benchmarking Team and the Genome in a Bottle Consortium are intended as a standard resource of BED files for use in stratifying true positive, false positive, and false negative variant calls. These v2.0 stratification BED files from the Global Alliance for Genomics and Health (GA4GH) Benchmarking Team and the Genome in a Bottle Consortium are intended as a standard resource of BED files for use in stratifying true positive, false positive, and false negative variant calls. v2.0 stratifications have been deprecated and replaced by v3.0 genome-stratifications.
Illumina sequencing
공공데이터포털
These data are bacterial 16S rRNA sequences and a taxonomic summary table for biofilm samples from the bio-reactors. The data may provide background/supporting information for other researchers who have a similar experimental plan with a microbial electrochemical cell reactor. This dataset is associated with the following publication: Santodomingo, J., H. Lee, B. Dhar, J. An, B. Rittmann, H. Ren, and J. Chae. The Roles of Biofilm Conductivity and Donor Substrate Kinetics in a Mixed-Culture Biofilm Anod. ENVIRONMENTAL SCIENCE & TECHNOLOGY. American Chemical Society, Washington, DC, USA, 50(23): 12799-12807, (2016).