Data from: Genome analyses of fungal pathogens Neonectria faginata and Neonectria coccinea
공공데이터포털
,Protein predictions using Augustus web for the fungi Neonectria coccinea and N. faginata, as well as protein prediction of closely related species N. ditissima, and Corinectria fuckeliana.,,
Protein predictions for Calonectria pseudonaviculata CBS 139707 (aka cpsCT01)
공공데이터포털
,Boxwood blight disease, caused by the fungi Calonectria henricotiae and C. pseudonaviculata, is an emergent threat to natural and managed landscapes worldwide. This dataset contains protein predictions and identifications generated from Calonectria pseudonaviculata CBS 139707 (aka cpsCT01) genome dataset https://doi.org/10.15482/USDA.ADC/1410184.,,
Functional annotation for 15 diverse arthropod genomes
공공데이터포털
,We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our readthedocs site. The files for each genome include GOanna, InterproScan and KOBAS predictions.,Arthropod genomes selected for this study and their assembly and annotation statistics.,,,
Data from: A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System
공공데이터포털
,A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies, however, long-read methods have historically had greater input DNA requirements and higher costs than next generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female Spotted Lanternfly (Lycorma delicatula) using a single PacBio SMRT Cell. The Spotted Lanternfly is an invasive species recently discovered in the northeastern United States, threatening to damage economically important crop plants in the region. The DNA from one individual female specimen collected in Reading, Berks County, Pennsylvania was used to make one standard, size-selected library with an average DNA fragment size of ~20 kb. The library was run on one Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing approximately 38x coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Further, it was possible to segregate more than half of the diploid genome into the two separate haplotypes. The assembly also recovered two microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.,Supporting files for the manuscript "A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System", include several intermediate versions of the assembly (raw output from Falcon, raw output from Falcon unzip, etc.) as well as the final assembly primary contigs and haplotigs (for the regions of the genome that were phased).,,
Leptinotarsa decemlineata genome annotations v0.5.3
공공데이터포털
,The Leptinotarsa decemlineata genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. This dataset presents the Leptinotarsa decemlineata gene set BCM_v_0.5.3, which was generated computationally. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Leptinotarsa decemlineata genome assembly 1.0. Further annotation method details will be available in a forthcoming publication.,NOTE: This gene set is an unstable pre-release (v0.5.3), and was provided to facilitate manual curation and analyses before the official gene set is released. Gene identifiers from this gene set will likely not be maintained.,If you wish to use this dataset, please follow the Baylor College of Medicine's conditions for data use: https://www.hgsc.bcm.edu/bcm-hgsc-conditions-use,
Colletotrichum shisoi FDWSRU 21-072 genome annotations
공공데이터포털
,Colletotrichum shisoi is a fungal plant pathogen of Perilla frutescens, a mint species cultivated in some Asian countries but considered invasive in the United States. This dataset contains a new, highly contiguous genome sequence generated from the North American C. shisoi isolate FDWSRU 21-072, which has been proposed for use as a biological control agent of invasive P. frutescens. Long-read PacBio sequencing produced a genome assembly of 48 contigs and 86.9 Mb. Structural and functional gene annotations were generated with the FunAnnotate v1.8.16 pipeline.,