Evaluation of thresholds for the detection of binding sites for regulatory proteins in
공공데이터포털
Background Sites in DNA that bind regulatory proteins can be detected computationally in various ways. Pattern discovery methods analyze collections of genes suspected to be co-regulated on the evidence, for example, of clustering of transcriptome data. Pattern searching methods use sequences with known binding sites to find other genes regulated by a given protein. Such computational methods are important strategies in the discovery and elaboration of regulatory networks and can provide the experimental biologist with a precise prediction of a binding site or identify a gene as a member of a set of co-regulated genes (a regulon). As more variations on such methods are published, however, thorough evaluation is necessary, as performance may differ depending on the conditions of use. Detailed evaluation also helps to improve and understand the behavior of the different methods and computational strategies. Results We used a collection of 86 regulons from Escherichia coli as datasets to evaluate two methods for pattern discovery and pattern searching: dyad analysis/dyad sweeping using the program Dyad-analysis, and multiple alignment using the programs Consensus/Patser. Clearly defined statistical parameters are used to evaluate the two methods in different situations. We placed particular emphasis on minimizing the rate of false positives. Conclusions As a general rule, sensors obtained from experimentally reported binding sites in DNA frequently locate true sites as the highest-scoring sequences within a given upstream region, especially using Consensus/Patser. Pattern discovery is still an unsolved problem, although in the cases where Dyad-analysis finds significant dyads (around 50%), these frequently correspond to true binding sites. With more robust methods, regulatory predictions could help identify the function of unknown genes.
Noncoding RNA gene detection using comparative sequence analysis
공공데이터포털
Background Noncoding RNA genes produce transcripts that exert their function without ever producing proteins. Noncoding RNA gene sequences do not have strong statistical signals, unlike protein coding genes. A reliable general purpose computational genefinder for noncoding RNA genes has been elusive. Results We describe a comparative sequence analysis algorithm for detecting novel structural RNA genes. The key idea is to test the pattern of substitutions observed in a pairwise alignment of two homologous sequences. A conserved coding region tends to show a pattern of synonymous substitutions, whereas a conserved structural RNA tends to show a pattern of compensatory mutations consistent with some base-paired secondary structure. We formalize this intuition using three probabilistic "pair-grammars": a pair stochastic context free grammar modeling alignments constrained by structural RNA evolution, a pair hidden Markov model modeling alignments constrained by coding sequence evolution, and a pair hidden Markov model modeling a null hypothesis of position-independent evolution. Given an input pairwise sequence alignment (e.g. from a BLASTN comparison of two related genomes) we classify the alignment into the coding, RNA, or null class according to the posterior probability of each class. Conclusions We have implemented this approach as a program, QRNA, which we consider to be a prototype structural noncoding RNA genefinder. Tests suggest that this approach detects noncoding RNA genes with a fair degree of reliability.
SNP analysis of the inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) gene by a fluorescence-adapted SSCP method
공공데이터포털
Background Single-nucleotide polymorphisms (SNPs) are considered to be useful polymorphic markers for genetic studies of polygenic traits. Single-stranded conformational polymorphism (SSCP) analysis has been widely applied to detect SNPs, including point mutations in cancer and congenital diseases. In this study, we describe an application of the fluorescent labeling of PCR fragments using a fluorescent-adapted primer for SSCP analysis as a novel method. Methods Single-nucleotide polymorphisms (SNPs) of the inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) gene were analyzed using a fluorescence-adapted SSCP method. The method was constructed from two procedures: 1) a fluorescent labeling reaction of PCR fragments using fluorescence-adapted primers in a single tube, and 2) electrophoresis on a non-denaturing polyacrylamide gel. Results This method was more economical and convenient than the single-stranded conformational polymorphism (SSCP) methods previously reported in the detection of the labeled fragments obtained. In this study, eight SNPs of the IHRP gene were detected by the fluorescence-adapted SSCP. One of the SNPs was a new SNP resulting in an amino acid substitution, while the other SNPs have already been reported in the public databases. Six SNPs of the IHRP were associated with two haplotypes. Conclusions The fluorescence-adapted SSCP was useful for detecting and genotyping SNPs.
Quantitative assessment of the use of modified nucleoside triphosphates in expression profiling: differential effects on signal intensities and impacts on expression ratios
공공데이터포털
Background The power of DNA microarrays derives from their ability to monitor the expression levels of many genes in parallel. One of the limitations of such powerful analytical tools is the inability to detect certain transcripts in the target sample because of artifacts caused by background noise or poor hybridization kinetics. The use of base-modified analogs of nucleoside triphosphates has been shown to increase complementary duplex stability in other applications, and here we attempted to enhance microarray hybridization signal across a wide range of sequences and expression levels by incorporating these nucleotides into labeled cRNA targets. Results RNA samples containing 2-aminoadenosine showed increases in signal intensity for a majority of the sequences. These results were similar, and additive, to those seen with an increase in the hybridization time. In contrast, 5-methyluridine and 5-methylcytidine decreased signal intensities. Hybridization specificity, as assessed by mismatch controls, was dependent on both target sequence and extent of substitution with the modified nucleotide. Concurrent incorporation of modified and unmodified ATP in a 1:1 ratio resulted in significantly greater numbers of above-threshold ratio calls across tissues, while preserving ratio integrity and reproducibility. Conclusions Incorporation of 2-aminoadenosine triphosphate into cRNA targets is a promising method for increasing signal detection in microarrays. Furthermore, this approach can be optimized to minimize impact on yield of amplified material and to increase the number of expression changes that can be detected.
Transcriptional transactivation by selected short random peptides attached to lexA-GFP fusion proteins
공공데이터포털
Background Transcriptional transactivation is a process with remarkable tolerance for sequence diversity and structural geometry. In studies of the features that constitute transactivating functions, acidity has remained one of the most common characteristics observed among native activation domains and activator peptides. Results We performed a deliberate search of random peptide libraries for peptides capable of conferring transcriptional transactivation on the lexA DNA binding domain. Two libraries, one composed of C-terminal fusions, the other of peptide insertions within the green fluorescent protein structure, were used. We show that (i) peptide sequences other than C-terminal fusions can confer transactivation; (ii) though acidic activator peptides are more common, charge neutral and basic peptides can function as activators; and (iii) peptides as short as 11 amino acids behave in a modular fashion. Conclusions These results support the recruitment model of transcriptional activation and, combined with other studies, suggest the possibility of using activator peptides in a variety of applications, including drug development work.
Research Article: BMC Medical Genetics
공공데이터포털
Background SFHR (small fragment homologous replacement)-mediated targeting is a process that has been used to correct specific mutations in mammalian cells. This process involves both chemical and cellular factors that are not yet defined. To evaluate potential of this technique for gene therapy it is necessary to characterize gene transfer efficacy in terms of the transfection vehicle, the genetic target, and the cellular processing of the DNA and DNA-vehicle complex. Methods In this study, small fragments of genomic cystic fibrosis (CF) transmembrane conductance regulator (CFTR) DNA, that comprise the wild-type and ΔF508 sequences, were transfected into immortalized CF and normal airway epithelial cells, respectively. Homologous replacement was evaluated using PCR and sequence-based analyses of cellular DNA and RNA. Individual stages of cationic lipid-facilitated SFHR in cultured cell lines were also examined using transmission electron microscopy (TEM). Results We demonstrated that the lipid/DNA (+/-) ratio influences the mode of entry into the cell and therefore affects the efficacy of SFHR-mediated gene targeting. Lipid/DNA complexes with more negative ratios entered the cell via a plasma membrane fusion pathway. Transfer of the DNA that relies on an endocytic pathway appeared more effective at mediating SFHR. In addition, it was also clear that there is a correlation between the specific cell line transfected and the optimal lipid/DNA ratio. Conclusions These studies provide new insights into factors that underlie SFHR-mediated gene targeting efficacy and into the parameters that can be modulated for its optimization.
Research Article: BMC Genetics
공공데이터포털
Background Blood group genotyping is increasingly utilized for prenatal diagnosis and after recent transfusions, but still lacks the specificity of serology. In whites, the presence of antigen D is predicted, if two or more properly selected RHD-specific polymorphism are detected. This prediction must fail, if an antigen D negative RHD positive allele is encountered. Excluding RHDψ and CdeS frequent only in individuals of African descent, most of these alleles are unknown and the population frequency of any such allele has not been determined. Methods We screened 8,442 antigen D negative blood donations by RHD PCR-SSP. RHD PCR positive samples were further characterized by RHD exon specific PCR-SSP or sequencing. The phenotype of the identified alleles was checked and their frequencies in Germans were determined. Results We detected 50 RHD positive samples. Fifteen samples harbored one of three new Del alleles. Thirty samples were due to 14 different D negative alleles, only 5 of which were previously known. Nine of the 14 alleles may have been generated by gene conversion in cis, for which we proposed a mechanism triggered by hairpin formation of chromosomal DNA. The cumulative population frequency of the 14 D negative alleles was 1:1,500. Five samples represented a D+/- chimera, a weak D and three partial D, which had been missed by routine serology; two recipients transfused with blood of the D+/- chimera donor became anti-D immunized. Conclusion The results of this study allowed to devise an improved RHD genotyping strategy, the false-positive rate of which was lower than 1:10,000. The number of characterized RHD positive antigen D negative and Del alleles was more than doubled and their population frequencies in Europe were defined.