Evaluation of DNA yield from various sources for use in single nucleotide polymorphism panels
공공데이터포털
Genetics studies are used by wildlife managers and researchers to gain inference into a population of a species of interest. To gain these insights, micro-satellites have been the primary method, however, there currently is a shift from micro-satellites to single nucleotide polymorphisms (SNPs). With the DNA requirements being different, an investigation into which samples can provide adequate DNA yield is warranted. Using samples that were collected from previous genetic projects from regions in the USA from 2014 to 2021, we investigated the DNA yield of eight sample categories to gain insights into which provided adequate DNA to be used in various panels. We found four sample categories that met the DNA requirements for use in all three panels, three sample categories that only met the DNA requirements for two panels, and one sample category that did not meet any of the three panels requirements. Additionally, we used linear random-effects models to determine which covariates would have the greatest influence on DNA yield. We determined that all covariates, tissue type, storage method, preservative, DNA quality, time until DNA extraction and time after DNA extraction could influence DNA yield.
Phenotype-Genotype Integrator (PheGenI)
공공데이터포털
Supports finding human phenotype/genotype relationships with queries by phenotype, chromosome location, gene, and SNP identifiers. Currently includes information from dbGaP, the National Human Genome Research Institute (NHGRI) genome-wide association study (GWAS) Catalog, and Genotype - Tissue Expression (GTeX).
Sequencing Data Set of Sediment Layers
공공데이터포털
A table (DP_SRA.xlsx) contains rows as sample and columns as entries representing the biosample accession number (NCBI), collection (date), library strategy, target (source), and sequencing (technology) for each individual sample. The zip file (Genome_Set01.zip) contain nine (9) fasta file (DP_bin_02.fasta, DP_bin_04.fasta, DP_bin_09.fasta, DP_bin_10.fasta, DP_bin_14.fasta, DP_bin_15.fasta, DP_bin_16a.fasta, DP_bin_20.fasta, DP_bin_23.fasta) with the contig sequences (i.e. binning) for each metagenome-assembled genomes (MAGs). These data are available from the NCBI Sequence Read Archive (SRA) under the BioProject (https://www.ncbi.nlm.nih.gov/bioproject) with accession number PRJNA646252 and the following BioSample numbers: SAMN15536103 to SAMN15536108. This dataset is associated with the following publication: Gomez-Alvarez, V., H. Liu, J. Pressman, and D. Wahman. Metagenomic Profile of Microbial Communities in a Drinking Water Storage Tank Sediment after Sequential Exposure to Monochloramine, Free Chlorine, and Monochloramine. ENVIRONMENTAL SCIENCE & TECHNOLOGY. American Chemical Society, Washington, DC, USA, 1(5): 1283-1294, (2021).
Sequence Read Archive (SRA)
공공데이터포털
The Sequence Read Archive (SRA) stores sequencing data from the next generation of sequencing platforms including Roche 454 GS System®, Illumina Genome Analyzer®, Life Technologies AB SOLiD System®, Helicos Biosciences Heliscope®, Complete Genomics®, and Pacific Biosciences SMRT®.
Database of Short Genetic Variations (dbSNP)
공공데이터포털
Database of Short Genetic Variations (dbSNP) contains human single nucleotide variations, microsatellites, and small-scale insertions and deletions along with publication, population frequency, molecular consequence, and genomic and RefSeq mapping information for both common variations and clinical mutations.