데이터셋 상세
미국
Evolution and phylogeny of insect endogenous retroviruses
Background The genome of invertebrates is rich in retroelements which are structurally reminiscent of the retroviruses of vertebrates. Those containing three open reading frames (ORFs), including an env-like gene, may well be considered as endogenous retroviruses. Further support to this similarity has been provided by the ability of the env-like gene of DmeGypV (the Gypsy endogenous retrovirus of Drosophila melanogaster) to promote infection of Drosophila cells by a pseudotyped vertebrate retrovirus vector. Results To gain insights into their evolutionary story, a sample of thirteen insect endogenous retroviruses, which represents the largest sample analysed until now, was studied by computer-assisted comparison of the translated products of their gag, pol and env genes, as well as their LTR structural features. We found that the three phylogenetic trees based respectively on Gag, Pol and Env common motifs are congruent, which suggest a monophyletic origin for these elements. Conclusions We showed that most of the insect endogenous retroviruses belong to a major clade group which can be further divided into two main subgroups which also differ by the sequence of their primer binding sites (PBS). We propose to name IERV-K and IERV-S these two major subgroups of Insect Endogenous Retro Viruses (or Insect ERrantiVirus, according to the ICTV nomenclature) which respectively use Lys and Ser tRNAs to prime reverse transcription.
데이터 정보
연관 데이터
Evidence for the adaptive significance of an LTR retrotransposon sequence in a
공공데이터포털
Background The potential adaptive significance of transposable elements (TEs) to the host genomes in which they reside is a topic that has been hotly debated by molecular evolutionists for more than two decades. Recent genomic analyses have demonstrated that TE fragments are associated with functional genes in plants and animals. These findings suggest that TEs may contribute significantly to gene evolution. Results We have analyzed two transposable elements associated with genes in the sequenced Drosophila melanogaster y; cn bw sp strain. A fragment of the Antonia long terminal repeat (LTR) retrotransposon is present in the intron of Chitinase 3 (Cht3), a gene located within the constitutive heterochromatin of chromosome 2L. Within the euchromatin of chromosome 2R a full-length Burdock LTR retrotransposon is located immediately 3' to cathD, a gene encoding cathepsin D. We tested for the presence of these two TE/gene associations in strains representing 12 geographically diverse populations of D. melanogaster. While the cathD insertion variant was detected only in the sequenced y; cn bw sp strain, the insertion variant present in the heterochromatic Cht3 gene was found to be fixed throughout twelve D. melanogaster populations and in a D. mauritiana strain suggesting that it maybe of adaptive significance. To further test this hypothesis, we sequenced a 685bp region spanning the LTR fragment in the intron of Cht3 in strains representative of the two sibling species D. melanogaster and D. mauritiana (~2.7 million years divergent). The level of sequence divergence between the two species within this region was significantly lower than expected from the neutral substitution rate and lower than the divergence observed between a randomly selected intron of the Drosophila Alcohol dehydrogenase gene (Adh). Conclusions Our results suggest that a 359 bp fragment of an Antonia retrotransposon (complete LTR is 659 bp) located within the intron of the Drosophila melanogaster Cht3 gene is of adaptive evolutionary significance. Our results are consistent with previous suggestions that the presence of TEs in constitutive heterochromatin may be of significance to the expression of heterochromatic genes.
An algorithm for mapping positively selected members of quasispecies-type viruses
공공데이터포털
Background Many RNA viruses do not have a single, representative genome but instead form a set of related variants that has been called a quasispecies. The sequence variability of such viruses presents a significant bioinformatics challenge. In order for the sequence information to be understood, the complete mutational spectrum needs to be distilled to a biologically relevant and analyzable representation. Results Here, we develop a "selection mapping" algorithm--QUASI--that identifies the positively selected variants of viral proteins. The key to the selection mapping algorithm is the identification of particular replacement mutations that are overabundant relative to silent mutations at each codon (e.g., threonine at hemagglutinin position 262). Selection mapping identifies such replacement mutations as positively selected. Conversely, selection mapping recognizes negatively selected variants as mutational "noise" (e.g., serine at hemagglutinin position 262). Conclusion Selection mapping is a fundamental improvement over earlier methods (e.g., dN/dS) that identify positive selection at codons but do not identify which amino acids at these codons confer selective advantage. Using QUASI's selection maps, we characterize the selected mutational landscapes of influenza A H3 hemagglutinin, HIV-1 reverse transcriptase, and HIV-1 gp120.
Data from: The reference transcriptome of the adult female biting midge (Culicoides sonorensis) and differential gene expression profiling during teneral, blood, and sucrose feeding conditions
공공데이터포털
,Deep sequencing technologies were used to construct the first adult female Culicoides sonorensis reference transcriptome.,Genetic and genomic tools for Culicoides biting midges are lacking, despite the fact that they vector a large number of arboviruses and other pathogens impacting humans and domestic animals world-wide. Libraries of tissue-specific transcripts expressed in response to feeding and oral virus challenge in C. sonorensis have previously been reported, but extensive genome-wide expression profiling in the midge has not. Here, we successfully used deep sequencing technologies to construct the first adult female C. sonorensis reference transcriptome, and utilized genome-wide expression profiling to elucidate the genetic response to blood and sucrose feeding over time. The adult female midge unigene consists of 19,041 genes, of which less than 7% are differentially expressed during the course of a sucrose meal, while up to 52% of the genes respond significantly in blood-fed midges, indicating hematophagy induces complex physiological processes. Many genes that were differentially expressed during blood feeding were associated with digestion (e.g. proteases, lipases), hematophagy (e.g., salivary proteins), and vitellogenesis, revealing many major metabolic and biological factors underlying these critical processes. Additionally, key genes in the vitellogenesis pathway were identified, which provides the first glimpse into the molecular basis of anautogeny for C. sonorensis. This is the first extensive transcriptome for this genus, which will serve as a framework for future expression studies and in informing a reference genome assembly and annotation. Moreover, this study will serve as a foundation for subsequent studies of genome-wide expression analyses during early virus infection and dissecting the molecular mechanisms behind vector competence in midges.,
Provenance, classification, and abundance of RNA sequence fragments used to assess virus infections in honey bees, Apis mellifera
공공데이터포털
Deformed wing virus (DWV) is a major pathogen of concern to apiculture, and recent reports have indicated the local predominance and potential virulence of recombinants between DWV and a related virus, Varroa destructor virus 1 (VDV). However, little is known about the frequency and titer of VDV and recombinants relative to DWV generally. In this study, I assessed the relative occurrence and titer of DWV and VDV in public RNA-seq accessions of honey bee using a rapid, kmer-based approach. Three recombinant types were detectable graphically and corroborated by de novo assembly. Recombination breakpoints did not disrupt the capsid-encoding region, consistent with previous reports, and both VDV- and DWV-derived capsids were observed in recombinant backgrounds. High abundance of VDV kmers was largely restricted to recombinant forms. Non-metric multidimensional scaling identified genotypic clusters among DWV isolates, which was corroborated by read mapping and consensus generation. The recently described DWV-C lineage was not detected in the searched accessions. The data further highlight the utility of high-throughput sequencing to monitor viral polymorphisms and statistically test biological predictors of titer, and point to the need for consistent methodologies and sampling schemes.
Discontinuous and non-discontinuous subgenomic RNA transcription in a nidovirus
공공데이터포털
Arteri-, corona-, toro- and roniviruses are evolutionarily related positive-strand RNA viruses, united in the order Nidovirales. The best studied nidoviruses, the corona- and arteriviruses, employ a unique transcription mechanism, which involves discontinuous RNA synthesis, a process resembling similarity-assisted copy-choice RNA recombination. During infection, multiple subgenomic (sg) mRNAs are transcribed from a mirror set of sg negative-strand RNA templates. The sg mRNAs all possess a short 5′ common leader sequence, derived from the 5′ end of the genomic RNA. The joining of the non-contiguous ‘leader’ and ‘body’ sequences presumably occurs during minus-strand synthesis. To study whether toroviruses use a similar transcription mechanism, we characterized the 5′ termini of the genome and the four sg mRNAs of Berne virus (BEV). We show that BEV mRNAs 3–5 lack a leader sequence. Surprisingly, however, RNA 2 does contain a leader, identical to the 5′-terminal 18 residues of the genome. Apparently, BEV combines discontinuous and non-discontinous RNA synthesis to produce its sg mRNAs. Our findings have important implications for the understanding of the mechanism and evolution of nidovirus transcription.
A sensitive and rapid assay for homologous recombination in mosquito cells: impact of vector topology and implications for gene targeting
공공데이터포털
Background Recent progress in insect transgenesis has been dramatic but existing transposon-based approaches are constrained by position effects and potential instability. Gene targeting would bring a number of benefits, however progress requires a better understanding of the mechanisms involved. Much can be learned in vitro since extrachromosomal recombination occurs at high frequency, facilitating the study of multiple events and the impact of structural changes among the recombining molecules. We have investigated homologous recombination in mosquito cells through restoration of luciferase activity from deleted substrates. The implications of this work for the construction of insect gene targeting vectors are discussed. Results We show that linear targeting vectors are significantly more efficient than circular ones and that recombination is stimulated by introducing double-strand breaks into, or near, the region of homology. Single-strand annealing represents a very efficient pathway but may not be feasible for targeting unbroken chromosomes. Using circular plasmids to mimic chromosomal targets, one-sided invasion appears to be the predominant pathway for homologous recombination. Non-homologous end joining reactions also occur and may be utilised in gene targeting if double-strand breaks are first introduced into the target site. Conclusions We describe a rapid, sensitive assay for extrachromosomal homologous recombination in mosquito cells. Variations in substrate topology suggest that single-strand annealing and one-sided invasion represent the predominant pathways, although non-homologous end joining reactions also occur. One-sided invasion of circular chromosomal mimics by linear vectors might therefore be used in vitro to investigate the design and efficiency of gene targeting strategies.
Annotated reference transcriptome for female Culicoides sonorensis biting midges
공공데이터포털
,Unigene sequences were annotated by BlastX alignment to the non-redundant protein database (National Center for Biotechnology Information/GenBank) and the Aedes aegypti and Culex quinquefasciatus gene annotations (Vectorbase). This was done with a 1e-05 expectation value. Top hits are shown including accession numbers and description, if available. Unigene number and corresponding GenBank accession numbers are provided for all C. sonorensis genes. Both tables are modified from supplementary information tables at http://dx.doi.org/10.1371/journal.pone.0098123.s003 and numbered accordingly.,
Avian papillomaviruses: the parrot
공공데이터포털
Background An avian papillomavirus genome has been cloned from a cutaneous exophytic papilloma from an African grey parrot (Psittacus erithacus). The nucleotide sequence, genome organization, and phylogenetic position of the Psittacus erithacus papillomavirus (PePV) were determined. This PePV sequence represents the first complete avian papillomavirus genome defined. Results The PePV genome (7304 basepairs) differs from other papillomaviruses, in that it has a unique organization of the early protein region lacking classical E6 and E7 open reading frames. Phylogenetic comparison of the PePV sequence with partial E1 and L1 sequences of the chaffinch (Fringilla coelebs) papillomavirus (FPV) reveals that these two avian papillomaviruses form a monophyletic cluster with a common branch that originates near the unresolved center of the papillomavirus evolutionary tree. Conclusions The PePV genome has a unique layout of the early protein region which represents a novel prototypic genomic organization for avian papillomaviruses. The close relationship between PePV and FPV, and between their Psittaciformes and Passeriformes hosts, supports the hypothesis that papillomaviruses have co-evolved and speciated together with their host species throughout evolution.
A search for reverse transcriptase-coding sequences reveals new non-LTR retrotransposons in the genome of
공공데이터포털
Background: Non-long terminal repeat (non-LTR) retrotransposons are eukaryotic mobile genetic elements that transpose by reverse transcription of an RNA intermediate. We have performed a systematic search for sequences matching the characteristic reverse transcriptase domain of non-LTR retrotransposons in the sequenced regions of the Drosophila melanogaster genome. Results: In addition to previously characterized BS, Doc, F, G, I and Jockey elements, we have identified new non-LTR retrotransposons: Waldo, You and JuanDm. Waldo elements are related to mosquito RTI elements. You to the Drosophila I factor, and JuanDm to mosquito Juan-A and Juan-C. Interestingly, all JuanDm elements are highly homogeneous in sequence, suggesting that they are recent components of the Drosophila genome. Conclusions: The genome of D. melanogaster contains at least ten families of non-site-specific non-LTR retrotransposons representing three distinct clades. Many of these families contain potentially active members. Fine evolutionary analyses must await the more accurate sequences that are expected in the next future.
VIDA: a virus database system for the organization of animal virus genome open reading frames
공공데이터포털
VIDA is a new virus database that organizes open reading frames (ORFs) from partial and complete genomic sequences from animal viruses. Currently VIDA includes all sequences from GenBank for Herpesviridae, Coronaviridae and Arteriviridae. The ORFs are organized into homologous protein families, which are identified on the basis of sequence similarity relationships. Conserved sequence regions of potential functional importance are identified and can be retrieved as sequence alignments. We use a controlled taxonomical and functional classification for all the proteins and protein families in the database. When available, protein structures that are related to the families have also been included. The database is available for online search and sequence information retrieval at http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html.