author:"Agostini, Federico" | Pollux - Fachinformationsdienst Politikwissenschaft

Open Access#1

SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences

Agostini, Federico, 1985-; Cirillo, Davide; Delli Ponti, Riccardo, 1987-; Tartaglia, Gian Gaetano

BACKGROUND: The large amount of data produced by high-throughput sequencing poses new computational challenges. In the last decade, several tools have been developed for the identification of transcription and splicing factor binding sites. RESULTS: Here, we introduce the SeAMotE (Sequence Analysis of Motifs Enrichment) algorithm for discovery of regulatory regions in nucleic acid sequences. SeAMotE provides (i) a robust analysis of high-throughput sequence sets, (ii) a motif search based on pattern occurrences and (iii) an easy-to-use web-server interface. We applied our method to recently published data including 351 chromatin immunoprecipitation (ChIP) and 13 crosslinking immunoprecipitation (CLIP) experiments and compared our results with those of other well-established motif discovery tools. SeAMotE shows an average accuracy of 80% in finding discriminative motifs and outperforms other methods available in literature. CONCLUSIONS: /nSeAMotE is a fast, accurate and flexible algorithm for the identification of sequence patterns involved in protein-DNA and protein-RNA recognition. The server can be freely accessed at http://s.tartaglialab.com/new_submission/seamote. ; The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013), through the European Research Council, under grant agreement RIBOMYLOME_309545, and from the Spanish Ministry of Economy and Competitiveness/n(SAF2011-26211). We also acknowledge support from the Spanish Ministry of Economy and Competitiveness, 'Centro de Excelencia Severo Ochoa 2013–2017' (SEV-2012-0208)

Zugriff(Open Access)

BASE

Exportieren

Open Access#2

ccSOL omics: a webserver for solubility prediction of endogenous and heterologous expression in Escherichia coli

Agostini, Federico, 1985-; Cirillo, Davide; Livi, Carmen Maria; Ponti, Riccardo delli; Tartaglia, Gian Gaetano

SUMMARY: Here we introduce ccSOL omics, a webserver for large-scale calculations of protein solubility. Our method allows (i) proteome-wide predictions; (ii) identification of soluble fragments within each sequences; (iii) exhaustive single-point mutation analysis. RESULTS: Using coil/disorder, hydrophobicity, hydrophilicity, β-sheet and α-helix propensities, we built a predictor of protein solubility. Our approach shows an accuracy of 79% on the training set (36 990 Target Track entries). Validation on three independent sets indicates that ccSOL omics discriminates soluble and insoluble proteins with an accuracy of 74% on 31 760 proteins sharing <30% sequence similarity. AVAILABILITY AND IMPLEMENTATION: ccSOL omics can be freely accessed on the web at http://s.tartaglialab.com/page/ccsol_group. Documentation and tutorial are available at http://s.tartaglialab.com/static_files/shared/tutorial_ccsol_omics.html. CONTACT: gian.tartaglia@crg.es/nSUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. ; The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007–2013), through the European Research Council,under grant agreement RIBOMYLOME 309545, and from the Spanish Ministry of Economy and Competitiveness (SAF2011-26211). We also acknowledge support from the Spanish Ministry of Economy and Competitiveness, 'Centro de Excelencia Severo Ochoa 2013–2017' (SEV-2012-0208)

Zugriff(Open Access)

BASE

Exportieren

Open Access#32021

Comparative Genomics, Evolution, and Drought-Induced Expression of Dehydrin Genes in Model Brachypodium Grasses

Decena, María Angeles; Galvez-Rojas, Sergio; Agostini, Federico; Sancho, Rubén; Contreras-Moreira, Bruno; Des Marais, David L; Hernández Molina, Pilar; Catalán, Pilar

29 Pags.- 6 Figs. © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. ; Dehydration proteins (dehydrins, DHNs) confer tolerance to water-stress deficit in plants. We performed a comparative genomics and evolutionary study of DHN genes in four model Brachypodium grass species. Due to limited knowledge on dehydrin expression under water deprivation stress in Brachypodium, we also performed a drought-induced gene expression analysis in 32 ecotypes of the genus' flagship species B. distachyon showing different hydric requirements. Genomic sequence analysis detected 10 types of dehydrin genes (Bdhn) across the Brachypodium species. Domain and conserved motif contents of peptides encoded by Bdhn genes revealed eight protein architectures. Bdhn genes were spread across several chromosomes. Selection analysis indicated that all the Bdhn genes were constrained by purifying selection. Three upstream cis-regulatory motifs (BES1, MYB124, ZAT) were detected in several Bdhn genes. Gene expression analysis demonstrated that only four Bdhn1-Bdhn2, Bdhn3, and Bdhn7 genes, orthologs of wheat, barley, rice, sorghum, and maize genes, were expressed in mature leaves of B. distachyon and that all of them were more highly expressed in plants under drought conditions. Brachypodium dehydrin expression was significantly correlated with drought-response phenotypic traits (plant biomass, leaf carbon and proline contents and water use efficiency increases, and leaf water and nitrogen content decreases) being more pronounced in drought-tolerant ecotypes. Our results indicate that dehydrin type and regulation could be a key factor determining the acquisition of water-stress tolerance in grasses. ; This research was funded by Spanish Ministry of Science and Innovation grant number PID2019-108195GB-I00, European Social Fund/Spanish Aragón Government grant number A01-20R, Spanish Junta de Andalucía grant number P18-RT-992, USDA grant number NIFA-2011-67012- 30663. MD was funded by a Spanish Mineco FPI PhD fellowship. BCM was funded by Spanish Fundación ARAID. ; Peer reviewed

Zugriff(Open Access)

BASE

Exportieren

Open Access#42020

RADICL-seq identifies general and cell type–specific principles of genome-wide RNA-chromatin interactions

Mammalian genomes encode tens of thousands of noncoding RNAs. Most noncoding transcripts exhibit nuclear localization and several have been shown to play a role in the regulation of gene expression and chromatin remodeling. To investigate the function of such RNAs, methods to massively map the genomic interacting sites of multiple transcripts have been developed; however, these methods have some limitations. Here, we introduce RNA And DNA Interacting Complexes Ligated and sequenced (RADICL-seq), a technology that maps genome-wide RNA–chromatin interactions in intact nuclei. RADICL-seq is a proximity ligation-based methodology that reduces the bias for nascent transcription, while increasing genomic coverage and unique mapping rate efficiency compared with existing methods. RADICL-seq identifies distinct patterns of genome occupancy for different classes of transcripts as well as cell type–specific RNA-chromatin interactions, and highlights the role of transcription in the establishment of chromatin structure. ; This work was funded by a Research Grant from the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan, to the RIKEN Center for Life Science Technologies (http://www.mext.go.jp/en/). This work was also supported by the Francis Crick Institute, UK, which receives its core funding from Cancer Research UK (FC010110), the UK Medical Research Council (FC010110), and the Wellcome Trust (FC010110). N.M.L. is a Winton Group Leader in recognition of the Winton Charitable Foundation's support towards the establishment of the Francis Crick Institute. N.M.L. isadditionally funded by a Wellcome Trust Joint Investigator Award (103760/Z/14/Z) and the MRC eMedLab Medical Bioinformatics Infrastructure Award (MR/L016311/1). Work in G.C.-B.'s laboratory was supported by the European Union (Horizon 2020 European Research Council Consolidator Grant EPIScOPE), Swedish Research Council (no. 2015-03558), Swedish Brain Foundation (no. FO2017-0075), and Ming Wai Lau Centre for Reparative Medicine, Hong Kong. E.A. was supported by European Union, Horizon 2020, Marie-Skłodowska Curie Actions, grant SOLO no. 794689. Y.A.M. was partially supported by RSF grant 18-14-00240 and the Russian Ministry for Science and Higher Education. Work in V.O.'s laboratory (J.G. and V.O.) was supported by grants from the European Union FP7 (InteGeR Marie Curie Initial Training Network and MODHEP), the Italian Ministry of Education, University and Research MIUR and the National Research Center CNR (Epigen), and grant from KAUST BAS01-01-37. Open access funding provided by Karolinska Institute.

Zugriff(Open Access)

BASE

Exportieren

Filter

Format

Medientyp

Sprache

Jahre

SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences

ccSOL omics: a webserver for solubility prediction of endogenous and heterologous expression in Escherichia coli

Comparative Genomics, Evolution, and Drought-Induced Expression of Dehydrin Genes in Model Brachypodium Grasses

RADICL-seq identifies general and cell type–specific principles of genome-wide RNA-chromatin interactions

Suchergebnisse

Filter

Format

Medientyp

Sprache

Jahre

SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences

ccSOL omics: a webserver for solubility prediction of endogenous and heterologous expression in Escherichia coli

Comparative Genomics, Evolution, and Drought-Induced Expression of Dehydrin Genes in Model Brachypodium Grasses

RADICL-seq identifies general and cell type–specific principles of genome-wide RNA-chromatin interactions

Kontakt

Hilfe