Bitte benutzen Sie diese Referenz, um auf diese Ressource zu verweisen:
Volltext verfügbar? / Dokumentlieferung
doi:10.22028/D291-43376
Titel: | Optimized Data Set and Feature Construction for Substrate Prediction of Membrane Transporters |
VerfasserIn: | Denger, Andreas Helms, Volkhard |
Sprache: | Englisch |
Titel: | Journal of Chemical Information and Modeling |
Bandnummer: | 62 |
Heft: | 23 |
Seiten: | 6242-6257 |
Verlag/Plattform: | ACS |
Erscheinungsjahr: | 2022 |
Freie Schlagwörter: | Biological Transport Carbohydrates Membranes Monomers Peptides And Proteins |
DDC-Sachgruppe: | 500 Naturwissenschaften |
Dokumenttyp: | Journalartikel / Zeitschriftenartikel |
Abstract: | α-Helical transmembrane proteins termed membrane transporters mediate the passage of small hydrophilic substrate molecules across biological lipid bilayer membranes. Annotating the specific substrates of the dozens to hundreds of individual transporters of an organism is an important task. In the past, machine learning classifiers have been successfully trained on pan-organism data sets to predict putative substrates of transporters. Here, we critically examine the selection of an optimal data set of protein sequence features for the classification task. We focus on membrane transporters of the three model organisms Escherichia coli, Arabidopsis thaliana, and Saccharomyces cerevisiae, as well as human. We show that organism-specific classifiers can be robustly trained if at least 20 samples are available for each substrate class. If information from position-specific scoring matrices is included, such classifiers have F1 scores between 0.85 and 1.00. For the largest data set (A. thaliana), a 4-class classifier yielded an Fscore of 0.97. On a pan-organism data set composed of transporters of all four organisms, amino acid and sugar transporters were predicted with an F1 score of 0.91. |
DOI der Erstveröffentlichung: | 10.1021/acs.jcim.2c00850 |
URL der Erstveröffentlichung: | https://pubs.acs.org/doi/10.1021/acs.jcim.2c00850 |
Link zu diesem Datensatz: | urn:nbn:de:bsz:291--ds-433767 hdl:20.500.11880/38902 http://dx.doi.org/10.22028/D291-43376 |
ISSN: | 1549-960X 1549-9596 |
Datum des Eintrags: | 6-Nov-2024 |
Fakultät: | NT - Naturwissenschaftlich- Technische Fakultät |
Fachrichtung: | NT - Biowissenschaften |
Professur: | NT - Prof. Dr. Volkhard Helms |
Sammlung: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Dateien zu diesem Datensatz:
Es gibt keine Dateien zu dieser Ressource.
Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt.