Data from: Genetic variation among 481 diverse soybean accessions
공공데이터포털
,This data is from the manuscript titled: "Genetic variation among 481 diverse soybean accessions, inferred from genomic re-sequencing". SNP calls were obtained from resequencing 481 diverse soybean lines comprising 52 wild (Glycine soja) and 429 cultivated (Glycine max). This dataset contains 6 gzipped VCF (Variant Call Format) files with variant calls for all 481 USB accessions, all G. max accessions, G. soja accessions, accessions sequenced at 15x coverage, accessions sequenced at 40x coverage, and 106 accessions re-sequenced from a previous study (Valliyodan et al. 2016). SNPs were called using the Haplotype caller algorithm from the Genome Analysis Toolkit (GATK) version gatk-2.5-2-gf57256b. A total of 7.8 million SNPs were identified between the 481 re-sequenced accessions. SNPs were assigned IDs using the script "assign_name.awk" available at https://github.com/soybase/SoySNP-Names. SNP effects were predicted using SnpEff 3.0.,Dataset also available at https://soybase.org/data/v2/Glycine/max/diversity/Wm82.gnm2.div.Valliyodan_Brown_2021/,Funding support provided by the United Soybean Board for the large-scale sequencing of soybean genomes (project #1320-532-5615), Bayer (previously Monsanto and Bayer), and Corteva (previously Dow AgroSciences), with in-kind support for analysis from USDA Agricultural Research Service project 5030-21000-069-00-D.,Resources in this dataset:,,
Switchgrass ESTs and SNPs
공공데이터포털
,As part of our project, “Developing Association Mapping in Polyploid Perennial Biofuel Grasses” (DOE-USDA Plant Feedstock Genomics for Bioenergy Program grant DE-A102-07ER64454)*, two SNP discovery initiatives were carried out. The earlier one (2009) was an approach based on EST sequences. The latest initiative (2011-12) adopted a more powerful approach, based on GBS (Genotyping by Sequencing). We believe that the SNP markers identified in these studies will greatly enhance breeding efforts that target the improvement of key biofuel traits and the development of new switchgrass cultivars.,To enable genome-wide association study (GWAS) and genomic selection (GS) in switchgrass, we genotyped a full-sib population (n =130), a half-sib population (n =168) and association populations (66 pops, n =540). The parents of the linkage populations are upland tetraploids. The association populations are primarily of the upland ecotype, both tetraploid and octoploid, with a few lowland tetraploids as well. A total of 350 GB of sequence was generated from 840 individuals using GBS. Over 1.2 million putative SNPs were discovered with the UNEAK pipeline. In addition, ultra-high density paternal and maternal linkage maps, of 41K and 46K SNPs, respectively, were also constructed based on the conserved synteny between switchgrass and foxtail millet.,The data associated with this study are listed here:,
Shannon Dillon - OzWheat V1.0 SNP data assets
공공데이터포털
Single nucleotide polymorphism genotypes for ~300 varieties from the OzWheat diversity panel generated with alternative platforms combined in a single matrix. Original platform source files are also provided. Transcript SNPs: 63,222 SNP loci. 298 wheat lines. File=OzWheat_Transsnp_filtered_CS1.0_231130.hmp.txt 90K Illumina SNP array: 23362 SNP loci. 286 wheat lines from OzWheat panel 1 and 300 lines from OzWheat panel 2. File=V1_V2_commonSNP_merged.hmp 40K Illumina SNP array: 10670 SNP loci. 286 wheat lines. File=OW1_40K_filtered.hmp.txt DARTseq: 56467 SNP loci. 282 wheat lines. File=Report_DW21-6152_SNP_mapping_2.csv Combined platform SNP - joins SNPs called from transcriptome data, 90K and 40K array and DARTseq. 115,260 SNP loci. 296 wheat lines. File=OzWheat_Transsnp40k90kDArT_filtered_CS1.0_alleleinfo_240102.hmp.txt Keeble-Gagnère G, Pasam R, Forrest KL, et al. Novel Design of Imputation-Enabled SNP Arrays for Breeding and Research Applications Supporting Multi-Species Hybridization. Front Plant Sci. 2021;12:756877. Published 2021 Dec 22. doi:10.3389/fpls.2021.756877
A genome‑wide association and meta‑analysis: Candidate genes
공공데이터포털
Seed size is an important trait for yield and commercial value in dry-grain cowpea. Seed size varies widely among different cowpea accessions, and the genetic basis of such variation is not yet well understood. To better decipher the genetic basis of seed size, a genome-wide association study (GWAS) and meta-analysis were conducted on a panel of 368 cowpea diverse accessions from 51 countries. Four traits, including seed weight, length, width and density were evaluated across three locations. Using 51,128 single nucleotide polymorphisms covering the cowpea genome, 17 loci were identified for these traits. One locus was common to weight, width and length, suggesting pleiotropy. By integrating synteny-based analysis with common bean, six candidate genes (Vigun05g036000, Vigun05g039600, Vigun05g204200, Vigun08g217000, Vigun11g187000, and Vigun11g191300) which are implicated in multiple functional categories related to seed size such as endosperm development, embryo development, and cell elongation were identified. These results suggest that a combination of GWAS meta-analysis with synteny comparison in a related plant is an efficient approach to identify candidate gene (s) for complex traits in cowpea. The identified loci and candidate genes provide useful information for improving cowpea varieties and for molecular investigation of seed size.
Data from: Assessing metabolomic and chemical diversity of a soybean lineage representing 35 years of breeding
공공데이터포털
,Information on crop genotype- and phenotype-metabolite associations can be of value to trait development as well as to food security and safety. The unique study presented here assessed seed metabolomic and ionomic diversity in a soybean (Glycine max) lineage representing ~35 years of breeding (launch years 1972–2008) and increasing yield potential. Selected varieties included six conventional and three genetically modified (GM) glyphosate-tolerant lines. A metabolomics approach utilizing capillary electrophoresis (CE)-time-of-flight-mass spectrometry (TOF-MS), gas chromatography (GC)-TOF-MS and liquid chromatography (LC)-quadrupole (q)-TOFMS resulted in measurement of a total of 732 annotated peaks. Ionomics through inductively-coupled plasma (ICP)-MS profiled twenty mineral elements. Orthogonal partial least squares-discriminant analysis (OPLS-DA) of the seed data successfully differentiated newer higher-yielding soybean from earlier lower-yielding accessions at both field sites. This result reflected genetic fingerprinting data that demonstrated a similar distinction between the newer and older soybean. Correlation analysis also revealed associations between yield data and specific metabolites. There were no clear metabolic differences between the conventional and GM lines. Overall, observations of metabolic and genetic differences between older and newer soybean varieties provided novel and significant information on the impact of varietal development on biochemical variability. Proposed applications of omics in food and feed safety assessments will need to consider that GM is not a major source of metabolite variability and that trait development in crops will, of necessity, be associated with biochemical variation.,,
SoyBase and the Soybean Breeder's Toolbox
공공데이터포털
,SoyBase is a repository for genetics, genomics and related data resources for soybean. It contains current genetic, physical and genomic sequence maps integrated with qualitative and quantitative traits.,SoyBase database was established in the 1990s as the USDA Soybean Genetics Database. Originally, it contained only genetic information about soybeans such as genetic maps and information about the Mendelian genetics of soybean. In time SoyBase was expanded to include molecular data regarding soybean genes and sequences as they became available. In 2010, the soybean genome sequence was published and it and supporting gene sequences have been integrated into the SoyBase sequence browser. SoyBase genetic maps were used in the assembly of both the Williams 82 2010 assembly (Wm82.a1.v1) and the newest genome assembly (Wm82.a2.v1).,SoyBase also incorporates information about mutant and other soybean genetic stocks and serves as a contact point for ordering strains from those populations. As association analyses continue due to various re-sequencing efforts SoyBase will also incorporate those data into the soybean genome browser as they become available. Gene expression patterns are also available at SoyBase through the SoyBase expression pages and the Soybean Gene Atlas. Other expression/transcriptome/methylomic data sets also have been and continue to be incorporated into the SoyBase genome browser.,Project No:3625-21000-062-00D Accession No: 0425040,
Data from: Genome-wide Association and Genomic Prediction Identifies Soybean Cyst Nematode Resistance in Common Bean Including a Syntenic Region to Soybean Rhg1 Locus
공공데이터포털
,A panel of single nucleotide polymorphisms (SNPs) for 363 common bean accessions was generated. A genome-wide association study (GWAS) was applied to detect SNPs significantly associated with resistance to Heterodera glycines (HG) also known as the soybean cyst nematode (SCN) in the core collection of common bean, Phaseolus vulgaris. There were 84,416 SNPs identified in 363 common bean accessions.,,
A genome‑wide association and meta‑analysis: Cowpea seed weight
공공데이터포털
Seed size is an important trait for yield and commercial value in dry-grain cowpea. Seed size varies widely among different cowpea accessions, and the genetic basis of such variation is not yet well understood. To better decipher the genetic basis of seed size, a genome-wide association study (GWAS) and meta-analysis were conducted on a panel of 368 cowpea diverse accessions from 51 countries. Four traits, including seed weight, length, width and density were evaluated across three locations. Using 51,128 single nucleotide polymorphisms covering the cowpea genome, 17 loci were identified for these traits. One locus was common to weight, width and length, suggesting pleiotropy. By integrating synteny-based analysis with common bean, six candidate genes (Vigun05g036000, Vigun05g039600, Vigun05g204200, Vigun08g217000, Vigun11g187000, and Vigun11g191300) which are implicated in multiple functional categories related to seed size such as endosperm development, embryo development, and cell elongation were identified. These results suggest that a combination of GWAS meta-analysis with synteny comparison in a related plant is an efficient approach to identify candidate gene (s) for complex traits in cowpea. The identified loci and candidate genes provide useful information for improving cowpea varieties and for molecular investigation of seed size.
Data from: A Community Resource for Exploring and Utilizing Genetic Diversity in the USDA Pea Single Plant Plus Collection
공공데이터포털
,Included in this dataset are SNP and fasta data for the Pea Single Plant Plus Collection (PSPPC) and the PSPPC augmented with 25 P. fulvum accessions.,These 6 datasets can be roughly divided into two groups. Group 1 consists of three datasets labeled PSPPC which refer to SNP data pertaining to the USDA Pea Single Plant Plus Collection. Group 2 consists of three datasets labeled PSPPC + P. fulvum which refer to SNP data pertaining to the USDA PSPPC with 25 accessions of Pisum fulvum added. SNPs for each of these groups were called independently; therefore SNP names that are shared between the PSPPC and PSPPC + P. fulvum groups should NOT be assumed to refer to the same locus.,For analysis, SNP data is available in two widely used formats: hapmap and vcf. These formats can be successfully loaded into TASSEL v. 5.2.25 (http://www.maizegenetics.net/tassel). Explanations of fields (columns) in the VCF files are contained within commented (##) rows at the top of the file.,Descriptions of the first 11 columns in the hapmap file are as follows:,The fasta sequences containing the SNPs are also available for such downstream applications as development of primers for platform-specific markers.,For more information about this dataset, contact Clarice Coyne at Clarice.Coyne@usda.gov or coynec@wsu.edu.,,
A genome‑wide association and meta‑analysis: Cowpea seed size phenotypes
공공데이터포털
Seed size is an important trait for yield and commercial value in dry-grain cowpea. Seed size varies widely among different cowpea accessions, and the genetic basis of such variation is not yet well understood. To better decipher the genetic basis of seed size, a genome-wide association study (GWAS) and meta-analysis were conducted on a panel of 368 cowpea diverse accessions from 51 countries. Four traits, including seed weight, length, width and density were evaluated across three locations. Using 51,128 single nucleotide polymorphisms covering the cowpea genome, 17 loci were identified for these traits. One locus was common to weight, width and length, suggesting pleiotropy. By integrating synteny-based analysis with common bean, six candidate genes (Vigun05g036000, Vigun05g039600, Vigun05g204200, Vigun08g217000, Vigun11g187000, and Vigun11g191300) which are implicated in multiple functional categories related to seed size such as endosperm development, embryo development, and cell elongation were identified. These results suggest that a combination of GWAS meta-analysis with synteny comparison in a related plant is an efficient approach to identify candidate gene (s) for complex traits in cowpea. The identified loci and candidate genes provide useful information for improving cowpea varieties and for molecular investigation of seed size.