데이터셋 상세
미국
Astragalus species complex genetic data from southeast Utah (Grand County and San Juan County), USA
These data were compiled to investigate the evolutionary history of Astragalus iselyi, A. sabulosus var. sabulosus, and A. sabulosus var. vehiculus. The data release consists of six text files. One file is a bash script (astragalus_MLE_msms.sh) for generating simulated genetic data. Four files contain individual-level (astragalus.fasta, astragalus.nothin.recode.vcf, astragalus.recode.vcf) or population-level (astragalus_dadi.txt) information on genetic variation. One file contains sampling site-specific data for various soil and climatic variables (astragalus_env.txt). The .vcf files contain the full sequence information that is contained in the other files, but the file structures vary based on the programs used for analysis. These files may be opened and edited in a text editor program, such as Notepad ++ (PC) or BBEdit (Mac). The .vcf file can be loaded into the Stacks population program (Catchen et al. 2013) or VCFtools (Danecek et al. 2011) to calculate genetic diversity statistics (astragalus.nothin.recode.vcf contains multiple genetic variants per locus, whereas astragalus.recode.vcf contains only one genetic variant per locus).The astragalus_dadi.txt file can be used as an input file with the program dadi (Gutenkunst et al. 2009) to infer population history. Finally, astragalus.fasta can be loaded as an input file into RAxML (Stamatakis 2008) to perform phylogenetic analyses.
데이터 정보
연관 데이터
Astragalus species complex genetic data from southeast Utah (Grand County and San Juan County), USA
공공데이터포털
These data were compiled to investigate the evolutionary history of Astragalus iselyi, A. sabulosus var. sabulosus, and A. sabulosus var. vehiculus. The data release consists of six text files. One file is a bash script (astragalus_MLE_msms.sh) for generating simulated genetic data. Four files contain individual-level (astragalus.fasta, astragalus.nothin.recode.vcf, astragalus.recode.vcf) or population-level (astragalus_dadi.txt) information on genetic variation. One file contains sampling site-specific data for various soil and climatic variables (astragalus_env.txt). The .vcf files contain the full sequence information that is contained in the other files, but the file structures vary based on the programs used for analysis. These files may be opened and edited in a text editor program, such as Notepad ++ (PC) or BBEdit (Mac). The .vcf file can be loaded into the Stacks population program (Catchen et al. 2013) or VCFtools (Danecek et al. 2011) to calculate genetic diversity statistics (astragalus.nothin.recode.vcf contains multiple genetic variants per locus, whereas astragalus.recode.vcf contains only one genetic variant per locus).The astragalus_dadi.txt file can be used as an input file with the program dadi (Gutenkunst et al. 2009) to infer population history. Finally, astragalus.fasta can be loaded as an input file into RAxML (Stamatakis 2008) to perform phylogenetic analyses.
Genetic analyses of Astragalus sect. Humillimi data in the southwest USA
공공데이터포털
These data were compiled to support effective conservation for species within Astragalus sect. Humillimi. Two species, A. cremnophylax var. cremnophylax and A. humillimus are federally listed as endangered taxa. The data provided herein were used to resolve the relationaships among taxa, the population structure within taxa, and genetic diversity within taxa. The data are provided in a STRUCTURE-formatted file that includes taxa, individuals, sampling localities, and the genotype scores per individual for 690 Amplified Fragment Length Polymorphisms (AFLPs).
Genetic analyses of Astragalus sect. Humillimi data in the southwest USA
공공데이터포털
These data were compiled to support effective conservation for species within Astragalus sect. Humillimi. Two species, A. cremnophylax var. cremnophylax and A. humillimus are federally listed as endangered taxa. The data provided herein were used to resolve the relationaships among taxa, the population structure within taxa, and genetic diversity within taxa. The data are provided in a STRUCTURE-formatted file that includes taxa, individuals, sampling localities, and the genotype scores per individual for 690 Amplified Fragment Length Polymorphisms (AFLPs).
Penstemon grahamii genetic data from a dryland region of the western United States
공공데이터포털
These data were compiled to investigate the evolutionary history of Graham's beardtongue (Penstemon grahamii). Objective(s) of our study were to determine the evolutionary history of P. grahamii, including ancestral population sizes, the history of population divergences, and historical connectivity. In addition, we characterized population structure, genetic diversity summary statistics, and landscape factors influencing differentiation. These data represent anonymous loci sequenced from throughout the P. grahamii genome (specifically, .vcf and .structure files). Data in these files were manipulated to represent site frequency spectra between population pairs (.data files). These data were collected in 2019 from across the P. grahamii distribution, which is located in northeastern Utah and western Colorado. Specifically, plants are located at lower elevations on the northern slope of the Book Cliffs, which forms the southern side of the Uinta Basin. These data were collected by employees of the U.S. Geological Survey. Known occurrences of P. grahamii were visited and leaf samples from individual plants were collected from across the species’ distribution. These data can be used to further investigate the genetic differentiation, genetic diversity, and evolutionary history of P. grahamii.
Penstemon grahamii genetic data from a dryland region of the western United States
공공데이터포털
These data were compiled to investigate the evolutionary history of Graham's beardtongue (Penstemon grahamii). Objective(s) of our study were to determine the evolutionary history of P. grahamii, including ancestral population sizes, the history of population divergences, and historical connectivity. In addition, we characterized population structure, genetic diversity summary statistics, and landscape factors influencing differentiation. These data represent anonymous loci sequenced from throughout the P. grahamii genome (specifically, .vcf and .structure files). Data in these files were manipulated to represent site frequency spectra between population pairs (.data files). These data were collected in 2019 from across the P. grahamii distribution, which is located in northeastern Utah and western Colorado. Specifically, plants are located at lower elevations on the northern slope of the Book Cliffs, which forms the southern side of the Uinta Basin. These data were collected by employees of the U.S. Geological Survey. Known occurrences of P. grahamii were visited and leaf samples from individual plants were collected from across the species’ distribution. These data can be used to further investigate the genetic differentiation, genetic diversity, and evolutionary history of P. grahamii.
Plant genetic structure data from riparian areas within the Grand Canyon region in northern Arizona
공공데이터포털
These data represent nuclear microsatellite data collected from four riparian plant species that occur in and around Grand Canyon National Park: Populus fremontii (POFR), Salix gooddingii (SAGO), Salix exigua (SAEX), and Prosopis glandulosa (PRGL). These data were collected for population genetic analysis to help inform native plant material development in Grand Canyon National Park. Leaf samples were collected at sites spanning 470 km of the Colorado River between Glen Canyon Dam and Lake Mead and in tributaries. Known revegetation areas were not sampled. We aimed to collect leaf tissue from at least 15 individuals at each sample site. If there were fewer than 15 individuals per species at a site, leaf tissue was collected from every individual. Leaves were immediately dried and stored in silica gel. For P. fremontii, S. gooddingii, and P. glandulosa, total genomic DNA was extracted from dried leaf tissue using a high-molecular weight protocol with modifications. For S. exigua, DNeasy Plant Minikits were used. We amplified 9, 9, 11, and 13 loci for P. fremontii, S. gooddingii, S. exigua, and P. glandulosa, respectively. All loci were amplified using polymerase chain reaction (PCR) and fragment analysis processed on an ABI 3730xl Genetic Analyzer with GeneScan LIZ500 internal size standard. Allele fragment sizes were scored using GeneMarker v2.2.0. Loci that were missing more than 5% of values, were not polymorphic, or could not be reliably scored were omitted. Nine loci were included for P. fremontii, six for S. gooddingii, eight for P. glandulosa, and nine for S. exigua. All species are diploid, so each microsatellite locus has two values. Associated with the microsatellite data are labels delineating what collection site each individual came from, which genetic group each individual was assigned to during statistical analyses, and information about those sites. Specifically, sites are noted as being on the Colorado River or on a tributary to it, and inside or outside of the canyon rims.
Plant genetic structure data from riparian areas within the Grand Canyon region in northern Arizona
공공데이터포털
These data represent nuclear microsatellite data collected from four riparian plant species that occur in and around Grand Canyon National Park: Populus fremontii (POFR), Salix gooddingii (SAGO), Salix exigua (SAEX), and Prosopis glandulosa (PRGL). These data were collected for population genetic analysis to help inform native plant material development in Grand Canyon National Park. Leaf samples were collected at sites spanning 470 km of the Colorado River between Glen Canyon Dam and Lake Mead and in tributaries. Known revegetation areas were not sampled. We aimed to collect leaf tissue from at least 15 individuals at each sample site. If there were fewer than 15 individuals per species at a site, leaf tissue was collected from every individual. Leaves were immediately dried and stored in silica gel. For P. fremontii, S. gooddingii, and P. glandulosa, total genomic DNA was extracted from dried leaf tissue using a high-molecular weight protocol with modifications. For S. exigua, DNeasy Plant Minikits were used. We amplified 9, 9, 11, and 13 loci for P. fremontii, S. gooddingii, S. exigua, and P. glandulosa, respectively. All loci were amplified using polymerase chain reaction (PCR) and fragment analysis processed on an ABI 3730xl Genetic Analyzer with GeneScan LIZ500 internal size standard. Allele fragment sizes were scored using GeneMarker v2.2.0. Loci that were missing more than 5% of values, were not polymorphic, or could not be reliably scored were omitted. Nine loci were included for P. fremontii, six for S. gooddingii, eight for P. glandulosa, and nine for S. exigua. All species are diploid, so each microsatellite locus has two values. Associated with the microsatellite data are labels delineating what collection site each individual came from, which genetic group each individual was assigned to during statistical analyses, and information about those sites. Specifically, sites are noted as being on the Colorado River or on a tributary to it, and inside or outside of the canyon rims.
Species distribution model (SDM) for Astragalus layneae in the Mojave Desert
공공데이터포털
Preserving native species diversity is fundamental to ecosystem conservation. Selecting appropriate native species for use in restoration is a critical component of project design and may emphasize species attributes such as life history, functional type, pollinator services, and nutritional value for wildlife. Determining which species are likely to establish and persist in a particular environment is a key consideration. Species distribution models (SDMs) characterize relationships between species occurrences and the physical environment (e.g., climate, soil, topographic relief) and provide a mechanism for assessing which species may successfully propagate at a restoration site. In conjunction with information on species attributes, SDMs facilitate holistic ecosystem restoration by enabling practitioners to identify diverse, resilient assemblages of native species. This project develops SDMs for native species of fundamental ecosystem importance in order to guide restoration of Mojave Desert landscapes.The dataset contained herein provides an SDM for Astragalus layneae within its Mojave Desert range based on known occurrences.
Population genetic analysis of three aquatic macroinvertebrate species from samples in Grand Canyon (Arizona, USA) tributaries and nearby reference streams, 2016-2021
공공데이터포털
This dataset contains single nucleotide polymorphism (SNP) information for aquatic insect species collected in tributaries of the Colorado River in Grand Canyon (Arizona, USA), as well as SNP information for individuals collected from reference reaches of the Upper Colorado River Basin in Utah. This dataset focuses specifically on three species that were common and widely distributed throughout tributary streams in Grand Canyon: a mayfly (Fallceon quilleri), a caddisfly (Hydrospyche oslari), and a water strider (Rhagovelia distincta).
Population genetic analysis of three aquatic macroinvertebrate species from samples in Grand Canyon (Arizona, USA) tributaries and nearby reference streams, 2016-2021
공공데이터포털
This dataset contains single nucleotide polymorphism (SNP) information for aquatic insect species collected in tributaries of the Colorado River in Grand Canyon (Arizona, USA), as well as SNP information for individuals collected from reference reaches of the Upper Colorado River Basin in Utah. This dataset focuses specifically on three species that were common and widely distributed throughout tributary streams in Grand Canyon: a mayfly (Fallceon quilleri), a caddisfly (Hydrospyche oslari), and a water strider (Rhagovelia distincta).