Bitte benutzen Sie diese Referenz, um auf diese Ressource zu verweisen:
doi:10.22028/D291-46387
Titel: | A comprehensive review and evaluation of species richness estimation |
VerfasserIn: | Elena Schmitz, Johanna Rahmann, Sven |
Sprache: | Englisch |
Titel: | Briefings in Bioinformatics |
Bandnummer: | 26 |
Heft: | 2 |
Verlag/Plattform: | Oxford University Press |
Erscheinungsjahr: | 2025 |
Freie Schlagwörter: | species richness diversity estimation upsampling immune repertoire microbiome comparative evaluation |
DDC-Sachgruppe: | 004 Informatik |
Dokumenttyp: | Journalartikel / Zeitschriftenartikel |
Abstract: | Motivation: The statistical problem of estimating the total number of distinct species in a population (or distinct elements in a multiset), given only a small sample, occurs in various areas, ranging from the unseen species problem in ecology to estimating the diversity of immune repertoires. Accurately estimating the true richness from very small samples is challenging, in particular for highly diverse populations with many rare species. Depending on the application, different estimation strategies have been proposed that incorporate explicit or implicit assumptions about either the species distribution or about the sampling process. These methods are scattered across the literature, and an extensive overview of their assumptions, methodology, and performance is currently lacking. Results: We comprehensively review and evaluate a variety of existing methods on real and simulated data with different compositions of rare and abundant species. Our evaluation shows that, depending on species composition, different methods provide the most accurate richness estimates. Simple methods based on the observed number of singletons yield accurate asymptotic lower bounds for several of the tested simulated species compositions, but tend to underestimate the true richness for heterogeneous populations and small samples containing 1% to 5% of the population. When the population size is known, upsampling (extrapolating) estimators such as PreSeq and RichnEst yield accurate estimates of the total species richness in a sample that is up to 10 times larger than the observed sample. Availability: Source code for data simulation and richness estimation is available at https://gitlab.com/rahmannlab/speciesrichness. |
DOI der Erstveröffentlichung: | 10.1093/bib/bbaf158 |
URL der Erstveröffentlichung: | https://doi.org/10.1093/bib/bbaf158 |
Link zu diesem Datensatz: | urn:nbn:de:bsz:291--ds-463875 hdl:20.500.11880/40661 http://dx.doi.org/10.22028/D291-46387 |
ISSN: | 1477-4054 1467-5463 |
Datum des Eintrags: | 7-Okt-2025 |
Bezeichnung des in Beziehung stehenden Objekts: | Supplementary data |
In Beziehung stehendes Objekt: | https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/bib/26/2/10.1093_bib_bbaf158/1/supplement_bbaf158.pdf?Expires=1762845776&Signature=3eKPryW68XKubVNhM8~iZ1OdjyLr4RARRX~YkMjVKRlHIqH6~4185B7ZWBn~DZv653LoIj8BRfNqXMsJrTxQRq97DdFbqc~kvH4mpvzBgsUlx5XDSdTAxlq6CQhqIuim0K2fa0Jww09xuCvHUcq7USXeNv9m4~5ZisFvLnp~e6AxsurSQUDZSKRkfDXzrBBWxUGqiZtnbkmQEwBcEwXkRiLrRiDgWHET8lC~UG5zdts5A-~rzVraAdhXH4tMYd-vAwSH8xg0MXyhRnADAn4t3LXPgNdzwJmoLIw4g7sjM-xntTpoFADuyjkv6m6~S5wvOa28JfUHkS~6OIiE26BT9A__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA |
Fakultät: | MI - Fakultät für Mathematik und Informatik |
Fachrichtung: | MI - Informatik |
Professur: | MI - Prof. Dr. Sven Rahmann |
Sammlung: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Dateien zu diesem Datensatz:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
bbaf158.pdf | 1,51 MB | Adobe PDF | Öffnen/Anzeigen |
Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons