데이터셋 상세
미국
dbSNP
,dbSNP is a database of single nucleotide polymorphisms (SNPs) and multiple small-scale variations that include insertions/deletions, microsatellites, and non-polymorphic variants.,
데이터 정보
연관 데이터
dbVar
공공데이터포털
,dbVar is a database of genomic structural variation. It accepts data from all species and includes clinical data. It can accept diverse types of events, including inversions, insertions and translocations. Additionally, both germline and somatic variants are accepted.,
High-throughput genotyping of single nucleotide polymorphisms with rolling circle amplification
공공데이터포털
Background Single nucleotide polymorphisms (SNPs) are the foundation of powerful complex trait and pharmacogenomic analyses. The availability of large SNP databases, however, has emphasized a need for inexpensive SNP genotyping methods of commensurate simplicity, robustness, and scalability. We describe a solution-based, microtiter plate method for SNP genotyping of human genomic DNA. The method is based upon allele discrimination by ligation of open circle probes followed by rolling circle amplification of the signal using fluorescent primers. Only the probe with a 3' base complementary to the SNP is circularized by ligation. Results SNP scoring by ligation was optimized to a 100,000 fold discrimination against probe mismatched to the SNP. The assay was used to genotype 10 SNPs from a set of 192 genomic DNA samples in a high-throughput format. Assay directly from genomic DNA eliminates the need to preamplify the target as done for many other genotyping methods. The sensitivity of the assay was demonstrated by genotyping from 1 ng of genomic DNA. We demonstrate that the assay can detect a single molecule of the circularized probe. Conclusions Compatibility with homogeneous formats and the ability to assay small amounts of genomic DNA meets the exacting requirements of automated, high-throughput SNP scoring.
SNP analysis of the inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) gene by a fluorescence-adapted SSCP method
공공데이터포털
Background Single-nucleotide polymorphisms (SNPs) are considered to be useful polymorphic markers for genetic studies of polygenic traits. Single-stranded conformational polymorphism (SSCP) analysis has been widely applied to detect SNPs, including point mutations in cancer and congenital diseases. In this study, we describe an application of the fluorescent labeling of PCR fragments using a fluorescent-adapted primer for SSCP analysis as a novel method. Methods Single-nucleotide polymorphisms (SNPs) of the inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) gene were analyzed using a fluorescence-adapted SSCP method. The method was constructed from two procedures: 1) a fluorescent labeling reaction of PCR fragments using fluorescence-adapted primers in a single tube, and 2) electrophoresis on a non-denaturing polyacrylamide gel. Results This method was more economical and convenient than the single-stranded conformational polymorphism (SSCP) methods previously reported in the detection of the labeled fragments obtained. In this study, eight SNPs of the IHRP gene were detected by the fluorescence-adapted SSCP. One of the SNPs was a new SNP resulting in an amino acid substitution, while the other SNPs have already been reported in the public databases. Six SNPs of the IHRP were associated with two haplotypes. Conclusions The fluorescence-adapted SSCP was useful for detecting and genotyping SNPs.
Biospecimen Repository Access and Data Sharing (BRADS)
공공데이터포털
,BRADS is a repository for data and biospecimens from population health research initiatives and clinical or interventional trials designed and implemented by NICHD’s Division of Intramural Population Health Research (DIPHR). Topics include human reproduction and development, pregnancy, child health and development, and women’s health. The website is maintained by DIPHR.,
NIST test dataset for assessing baseline nucleic acid sequence screening
공공데이터포털
This repository contains the dataset used in the manuscript "Inter-tool analysis of a NIST dataset for assessing baseline nucleic acid sequence screening". NIST constructed the test dataset based on the current screening recommendations from HHS. The dataset is a FASTA formatted file with blinded numerical sequence headers. The dataset was sent to sequence screening tool developers for initial testing and to obtain feedback about its utility for assessing baseline sequence screening. An additional metadata file provides the NIST-assigned label for each sequence, along with a more detailed description derived from the source database.
U.S. population data for human identification markers
공공데이터포털
The primary data consist of allele or haplotype frequencies for N=1036 anonymized U.S. population samples. Additional files are supplements to the associated publications. Any changes to spreadsheets are listed in the "Change Log" tab within each spreadsheet. DOI numbers for associated publications are listed below, under "References".
Detection of mutations in the dystrophin gene via automated DHPLC screening and direct sequencing
공공데이터포털
Background Currently molecular diagnostic laboratories focus only on the identification of large deletion and duplication mutations (spanning one exon or more) for Duchenne Muscular Dystrophy (DMD) yielding 65% of causative mutations. These mutations are detected by an existing set of multiplexed polymerase chain reaction (PCR) primer pairs. Due to the large size of the dystrophin gene (79 exons), finding point mutations (substitutions, deletions or insertions of one or several nucleotides) has been prohibitively expensive and laborious. The aim of this project was to develop an effective and convenient method of finding all, or most, mutations in the dystrophin gene with only a moderate increase in cost. Results Using denaturing high performance liquid chromatography (DHPLC) screening and direct sequencing, 86 PCR amplicons of genomic DNA from the dystrophin gene were screened for mutations in eight patients diagnosed with DMD who had tested negative for large DNA rearragements. Mutations likely to be disease-causative were found in six of the eight patients. All 86 amplicons from the two patients in whom no likely disease-causative mutations were found were completely sequenced and only polymorphisms were found. Conclusions We have shown that it is now feasible for clinical laboratories to begin testing for both point mutations and large deletions/duplications in the dystrophin gene. The detection rate will rise from 65% to greater than 92% with only a moderate increase in cost.
Protein Data Bank (PDB)
공공데이터포털
,The Protein Data Bank (PDB) archive is the single worldwide repository of information about the 3D structures of large biological molecules, including proteins and nucleic acids found in all organisms including bacteria, yeast, plants, flies, other animals, and humans.,