Evaluation of thresholds for the detection of binding sites for regulatory proteins in
공공데이터포털
Background Sites in DNA that bind regulatory proteins can be detected computationally in various ways. Pattern discovery methods analyze collections of genes suspected to be co-regulated on the evidence, for example, of clustering of transcriptome data. Pattern searching methods use sequences with known binding sites to find other genes regulated by a given protein. Such computational methods are important strategies in the discovery and elaboration of regulatory networks and can provide the experimental biologist with a precise prediction of a binding site or identify a gene as a member of a set of co-regulated genes (a regulon). As more variations on such methods are published, however, thorough evaluation is necessary, as performance may differ depending on the conditions of use. Detailed evaluation also helps to improve and understand the behavior of the different methods and computational strategies. Results We used a collection of 86 regulons from Escherichia coli as datasets to evaluate two methods for pattern discovery and pattern searching: dyad analysis/dyad sweeping using the program Dyad-analysis, and multiple alignment using the programs Consensus/Patser. Clearly defined statistical parameters are used to evaluate the two methods in different situations. We placed particular emphasis on minimizing the rate of false positives. Conclusions As a general rule, sensors obtained from experimentally reported binding sites in DNA frequently locate true sites as the highest-scoring sequences within a given upstream region, especially using Consensus/Patser. Pattern discovery is still an unsolved problem, although in the cases where Dyad-analysis finds significant dyads (around 50%), these frequently correspond to true binding sites. With more robust methods, regulatory predictions could help identify the function of unknown genes.
Evidence for large domains of similarly expressed genes in the
공공데이터포털
Background Transcriptional regulation in eukaryotes generally operates at the level of individual genes. Regulation of sets of adjacent genes by mechanisms operating at the level of chromosomal domains has been demonstrated in a number of cases, but the fraction of genes in the genome subject to regulation at this level is unknown. Results Drosophila gene-expression profiles that were determined from over 80 experimental conditions using high-density oligonucleotide microarrays were searched for groups of adjacent genes that show similar expression profiles. We found about 200 groups of adjacent and similarly expressed genes, each having between 10 and 30 members; together these groups account for over 20% of assayed genes. Each group covers between 20 and 200 kilobase pairs of genomic sequence, with a mean group size of about 100 kilobase pairs. Groups do not appear to show any correlation with polytene banding patterns or other known chromosomal structures, nor were genes within groups functionally related to one another. Conclusions Groups of adjacent and co-regulated genes that are not otherwise functionally related in any obvious way can be identified by expression profiling in Drosophila. The mechanism underlying this phenomenon is not yet known.
Transcriptional transactivation by selected short random peptides attached to lexA-GFP fusion proteins
공공데이터포털
Background Transcriptional transactivation is a process with remarkable tolerance for sequence diversity and structural geometry. In studies of the features that constitute transactivating functions, acidity has remained one of the most common characteristics observed among native activation domains and activator peptides. Results We performed a deliberate search of random peptide libraries for peptides capable of conferring transcriptional transactivation on the lexA DNA binding domain. Two libraries, one composed of C-terminal fusions, the other of peptide insertions within the green fluorescent protein structure, were used. We show that (i) peptide sequences other than C-terminal fusions can confer transactivation; (ii) though acidic activator peptides are more common, charge neutral and basic peptides can function as activators; and (iii) peptides as short as 11 amino acids behave in a modular fashion. Conclusions These results support the recruitment model of transcriptional activation and, combined with other studies, suggest the possibility of using activator peptides in a variety of applications, including drug development work.
Research Article: Genome Biology
공공데이터포털
Background Computational predictions are critical for directing the experimental study of protein functions. Therefore it is paradoxical when an apparently erroneous computational prediction seems to be supported by experiment. Results We analyzed six cases where application of novel or conventional computational methods for protein sequence and structure analysis led to non-trivial predictions that were subsequently supported by direct experiments. We show that, on all six occasions, the original prediction was unjustified, and in at least three cases, an alternative, well-supported computational prediction, incompatible with the original one, could be derived. The most unusual cases involved the identification of an archaeal cysteinyl-tRNA synthetase, a dihydropteroate synthase and a thymidylate synthase, for which experimental verifications of apparently erroneous computational predictions were reported. Using sequence-profile analysis, multiple alignment and secondary-structure prediction, we have identified the unique archaeal 'cysteinyl-tRNA synthetase' as a homolog of extracellular polygalactosaminidases, and the 'dihydropteroate synthase' as a member of the β-lactamase-like superfamily of metal-dependent hydrolases. Conclusions In each of the analyzed cases, the original computational predictions could be refuted and, in some instances, alternative strongly supported predictions were obtained. The nature of the experimental evidence that appears to support these predictions remains an open question. Some of these experiments might signify discovery of extremely unusual forms of the respective enzymes, whereas the results of others could be due to artifacts.
Phenotypic silencing of cytoplasmic genes using sequence-specific double-stranded short interfering RNA and its application in the reverse genetics of wild type negative-strand RNA viruses
공공데이터포털
Background Post-transcriptional gene silencing (PTGS) by short interfering RNA has opened up new directions in the phenotypic mutation of cellular genes. However, its efficacy on non-nuclear genes and its effect on the interferon pathway remain unexplored. Since directed mutation of RNA genomes is not possible through conventional mutagenesis, we have tested sequence-specific 21-nucleotide long double-stranded RNAs (dsRNAs) for their ability to silence cytoplasmic RNA genomes. Results Short dsRNAs were generated against specific mRNAs of respiratory syncytial virus, a nonsegmented negative-stranded RNA virus with a cytoplasmic life cycle. At nanomolar concentrations, the dsRNAs specifically abrogated expression of the corresponding viral proteins, and produced the expected mutant phenotype ex vivo. The dsRNAs did not induce an interferon response, and did not inhibit cellular gene expression. The ablation of the viral proteins correlated with the loss of the specific mRNAs. In contrast, viral genomic and antigenomic RNA, which are encapsidated, were not directly affected. Conclusions Synthetic inhibitory dsRNAs are effective in specific silencing of RNA genomes that are exclusively cytoplasmic and transcribed by RNA-dependent RNA polymerases. RNA-directed RNA gene silencing does not require cloning, expression, and mutagenesis of viral cDNA, and thus, will allow the generation of phenotypic null mutants of specific RNA viral genes under normal infection conditions and at any point in the infection cycle. This will, for the first time, permit functional genomic studies, attenuated infections, reverse genetic analysis, and studies of host-virus signaling pathways using a wild type RNA virus, unencumbered by any superinfecting virus.
Doxycycline-induced expression of sense and inverted-repeat constructs modulates
공공데이터포털
Background A tetracycline-regulated (conditional) system for RNA interference (RNAi) would have many practical applications. Such a strategy was developed using RNAi of the gene for phosphogluconate mutase (Pgm). Pgm is a candidate lifespan regulator: PgmS allele frequency is increased by selection for increased lifespan, whereas PgmM and PgmF allele frequencies are decreased. Results The Pgm alleles were cloned and sequenced and were found to differ by amino-acid substitutions consistent with the relative electrophoretic mobilities of the proteins. The 'tet-on' doxycycline-regulated promoter system was used to overexpress PgmS in a wild-type (PgmM) background. Enzyme activity increases of two- to five-fold were observed in five independent transgenic lines. Tet-on was also used to drive expression of an inverted-repeat fragment of Pgm coding region. The inverted-repeat transcript was expected to form a dsRNA hairpin, induce RNAi, and thereby reduce endogenous Pgm gene expression at the RNA level. Endogenous Pgm RNA levels in adult flies were found to be reduced or eliminated by doxycycline treatment in five independent inverted-repeat transgenic lines. Our results show that doxycycline-regulated expression of inverted-repeat constructs can cause a conditional reduction in specific gene expression. The effect of sense and inverted-repeat construct expression on lifespan was assayed in multiple transgenic lines. Under the conditions tested, altered Pgm gene expression had no detectable effect on adult Drosophila lifespan. Conclusions A system for conditional RNAi in Drosophila adults shows promise for assay of gene functions during aging. Our results indicate that Pgm does not have a simple strong effect on longevity.
Research Article: BMC Genetics
공공데이터포털
Background To date, in eukaryotes, ribosomal protein expression is known to be regulated at the transcriptional and/or translational levels. But other forms of regulation may be possible. Results Here, we report the successful tagging of functional ribosomal particles with a S7-GFP chimaeric protein, making it possible to observe in vivo ribosome dynamics in the filamentous fungus Podospora anserina. Microscopic observations revealed a novel kind of ribosomal protein regulation during the passage between cell growth and stationary phases, with a transient accumulation of ribosomal proteins and/or ribosome subunits in the nucleus, possibly the nucleolus, being observed at the beginning of stationary phase. Conclusion Nuclear sequestration can be another level of ribosomal protein regulation in eukaryotic cells.This may contribute to the regulation of cell growth and division.
Screening for sequence-specific RNA-BPs by comprehensive UV crosslinking
공공데이터포털
Background Specific cis-elements and the associated trans-acting factors have been implicated in the post-transcriptional regulation of gene expression. In the era of genome wide analyses identifying novel trans-acting factors and cis-regulatory elements is a step towards understanding coordinated gene expression. UV-crosslink analysis is a standard method used to identify RNA-binding proteins. Uridine is traditionally used to radiolabel substrate RNAs, however, proteins binding to cis-elments particularly uridine poor will be weakly or not detected. We evaluate here the possibility of using UV-crosslinking with RNA substrates radiolabeled with each of the four ribonucleotides as an approach for screening for novel sequence specific RNA-binding proteins. Results The radiolabeled RNA substrates were derived from the 3'UTRs of the cloned Eg and c-mos Xenopus laevis maternal mRNAs. Specific, but not identical, uv-crosslinking signals were obtained, some of which corresponded to already identified proteins. A signal for a novel 90 kDa protein was observed with the c-mos 3'UTR radiolabeled with both CTP and GTP but not with UTP. The binding site of the 90 kDa RNA-binding protein was localised to a 59-nucleotide portion of the c-mos 3'UTR. Conclusion That the 90 kDa signal was detected with RNAs radiolabeled with CTP or GTP but not UTP illustrates the advantage of radiolabeling all four nucleotides in a UV-crosslink based screen. This method can be used for both long and short RNAs and does not require knowledge of the cis-acting sequence. It should be amenable to high throughput screening for RNA binding proteins.
Prediction of co-regulated genes in
공공데이터포털
Background Identification of co-regulated genes is essential for elucidating transcriptional regulatory networks and the function of uncharacterized genes. Although co-regulated genes should have at least one common sequence element, it is generally difficult to identify these genes from the presence of this element because it is very easily obscured by noise. To overcome this problem, we used conserved information from three closely related species: Bacillus subtilis, B. halodurans and B. stearothermophilus. Results Even though such species have a limited number of clearly orthologous genes, we obtained 1,884 phylogenetically conserved elements from the upstream intergenic regions of 1,568 B. subtilis genes. Similarity between these elements was used to cluster these genes. No other a priori knowledge on genes and elements was used. We could identify some genes known or suggested to be regulated by a common transcription factor as well as genes regulated by a common attenuation effector. Conclusions We confirmed that our method generates relatively few false positives in clusters with higher scores and that general elements such as -35/-10 boxes and Shine-Dalgarno sequence are not major obstacles. Moreover, we identified some plausible additional members of groups of known co-regulated genes. Thus, our approach is promising for exploring potentially co-regulated genes.