Biodiversity Data Journal :
Data Paper (Biosciences)
|
Corresponding author: Franziska Baden-Böhm (franziska.baden-boehm@thuenen.de), Mario App (mario.app@thuenen.de)
Academic editor: Paolo Biella
Received: 09 Mar 2022 | Accepted: 28 Jul 2022 | Published: 14 Sep 2022
© 2022 Franziska Baden-Böhm, Mario App, Jan Thiele
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Baden-Böhm F, App M, Thiele J (2022) The FloRes Database: A floral resources trait database for pollinator habitat-assessment generated by a multistep workflow. Biodiversity Data Journal 10: e83523. https://doi.org/10.3897/BDJ.10.e83523
|
The decline of pollinating insects in agricultural landscapes proceeds due to intensive land use and the associated loss of habitat and food sources. The feeding of those insects depends on the spatial and temporal distribution of nectar and pollen as food resource. Hence, to protect insect biodiversity, a spatio-temporal assessment of food quantity of their habitats is necessary. Therefore, sufficient data on traits of floral resources are required.
As floral resources’ traits of plants are important to quantify food availability, we present two databases, the FloRes Database (Floral Resources Database) and the raw database, from where FloRes was derived. Both databases contain the plant traits: (1) flowering period, (2) floral-unit density per day, (3) nectar volume per floral unit per day, (4) sugar content per floral unit, (5) sugar concentration in nectar, (6) pollen mass or volume per floral unit and per day, (7) protein content of pollen and (8) corolla depth. All traits were sampled from literature and online databases. The raw database consists of 702 specified plant species, 138 unspecified species 37 species (spec., sp), 22 species pluralis (spp) and, for 79, only the genus was identified) and two species complexes (agg.). Those 842 taxa belong to 488 genera and 102 families. Finally, only 27 taxa have a complete set of traits, too few for a sufficient assessment of spatio-temporal availability of floral food-resources.
As information on floral resources is scattered throughout many publications with different units, we also present our multistep workflow implemented in five consecutive R-scripts. The multistep workflow standardises the trait units of the raw database to comparable entities with identical units and aggregates them on a reasonable taxonomic level into the second application database, the FloRes Database. Finally, the FloRes Database contains aggregated information of traits for 42 taxa and, when corolla depth is excluded, for 72 taxa.
This is the first attempt to gather these eight traits from different literature sources into one database with a multistep workflow. The publication of the multistep workflow enables the users to extend the FloRes Database on their own demands with other literature data or newly-gathered data to improve quantification of food resources. Especially, the combination of pollen, nectar and the open flowers per square metre is, as far as we know, a novelty.
The FloRes Database can be used to evaluate the quantity of food-resource habitats available for pollinators, for example, to compare seed mixtures of agri-environmental measures, such as flower strips, considering flower phenology on a daily basis.
pollinators, bumblebees, hoverflies, floral resource, pollen, nectar, corolla, phenology, sugar concentration, protein, habitat assessment
The intensive management of land, the associated loss of feeding, shelter and nesting habitats (
Pollinators, such as bees and hoverflies, rely on nectar as an energy source for movement and vital processes, as well as on pollen for reproduction (
Consequently, the spatio-temporal quantification of nectar and pollen supply for pollinators demands knowledge of:
Thus, information of phenology, floral unit density, nectar volumes or sugar amount of floral unit, sugar concentration, pollen per floral unit, protein content of pollen and corolla depth, based on literature and existing databases of plant traits, is required.
Already existing data (e.g.
Hence, we compiled a raw database composed of the demanded floral resource traits, based on literature and existing databases of plant traits, to increase the amount of plant taxa. Data on phenology and abundance are comparatively easy to acquire and, therefore, well available, but data on floral resources and corolla depth are scattered. As the raw database contains very few species with a complete set of traits and often traits do not refer to same level of the inflorescence (e.g. single flower or capitula, umbel etc.), we generated a second application database, the FloRes Database through a multistep workflow. This multistep workflow sets nectar and pollen in relation to the floral unit and then aggregates them on a reasonable taxonomic level. A floral unit is defined from the perspective of the insects as the number of flowers that can be visited without flying, ranging from a single flower up to thousands (
Thus, with FloRes Database, we start filling a knowledge gap about the floral resources provided to pollinators and their spatio-temporal provisioning to pollinators. Our database enables us to quantify nectar and pollen per habitat area and day throughout the seasons, allowing us to find temporal provisioning gaps. Due to the spatial connection of pollen and nectar with floral units per area, we are also able to estimate the floral resources in whole habitats and landscapes. The floral units per area can be easily divided through the specific habitat cover percentage of a plant to achieve this. In this way, food sources in habitats, such as semi-natural habitats and agri-environment measures, can better be described and assessed for pollinators. Such data can also be useful to compare flowering habitats or seed mixtures (
We collected data for eight floral traits (Table
Trait |
Coded in database |
Description |
Units raw database |
Units FloRes database |
Physiological traits |
||||
Phenology |
flowering flowering_start flowering_end |
Flower life span and flowering start and end given as day of the year |
d |
d |
Flower unit density |
flowers_m2 |
The number of open single flowers or inflorescences per square metre [m²] |
single flowers/m² or inflorescences/m² |
floral units /m² |
Floral resources |
||||
Nectar volume |
nectar_volume |
The nectar volume per single flower or inflorescences |
µl/d, ml/d, l/(m²d) |
ml/d |
Sugar concentration in nectar |
sugar_conc |
The concentration of sugar in nectar in mol per litre and percentage |
mol l-1, % |
mol l-1, % |
Sugar per flower |
nectar_sugar_cont |
The absolute sugar content of nectar per single flower or inflorescences |
µg/d, mg/d |
mg/d |
Pollen quantity |
pollen |
The pollen per single flower or inflorescences |
µg, mg, g, g/m², µl, |
mg |
Protein content of pollen |
protein |
The amount of protein in pollen |
%, g/100g dry mass |
% |
Resource availability |
||||
Corolla depth |
corolla |
The depth of the corolla tube |
mm |
mm |
For the raw database, some input data of the traits had to be adapted. We calculated the average molar sugar concentration per species from the data of
For the quantitative traits, we gathered minimum, maximum and mean values, if available. With the traits 'pollen', 'nectar volume', 'sugar per flower' and 'flower' or 'inflorescence density', we recorded the flower unit they referred to, i.e. either per single flower or per inflorescence. The reference flower unit is very important for scaling nectar volume, nectar sugar content and pollen to the same flower unit, enabling merging and aggregation of trait data from different sources. Furthermore, the nomenclature of species varied in literature. Therefore, we equalised the species names in our database in column 'species' in our database, but we also included the names used in the original publications to facilitate joins and backtrackings with the data source (column 'species_name_reference' in our database).
Data preparation and multistep workflow
We compiled the FloRes Database so as to include as many species/taxa with a complete set of traits as possible through a multistep workflow in R 4.1.0 (
The first and second script were used to convert the data to equal units, whereas scripts three to five were used to aggregate and combine the trait data on the most suitable taxonomic level, preferably on species level (Column 'taxon' in our database). However, we could frequently aggregate only on genus level.
Conversions of units
Throughout the literature, flowers were defined in different units and were given in different units. Therefore, the flower units needed to be transformed to the same definition of floral units (Carvalheiro et al. 2008) per area. This enabled us to scale pollen and nectar availabilty correctly per area.
For assessing the quantity of floral resources on habitat scale, the quantity of pollen or nectar (sugar) per standardised area is needed. In the raw data, the floral reference unit, either single flower or inflorescence, sometimes varied between flower-unit density, nectar and pollen data. For simplification the terms raceme, panicle, corymb, globular raceme, umbel or catkin in the database of
\(fu_A=f_A/f_I \) (1)
Floral resources Rf, i.e. pollen or nectar, were multiplied with the number of open flowers per floral unit fi (
\(R_{fu}=P_f⋅f_i\) (2)
For open flowers per inflorescence of Helianthus annuus, we used data from
For floral units per area, we used the maximum value, not the mean, when there were multiple values per species. Here, we used the maximum density as an approximation for 100% cover of the plant species. This allowed us to scale the floral resource per square metre in a given habitat, when the habitat specific cover percentages of plant species were available. The floral unit density per area needed to be divided through the cover percentage.
Mostly, nectar was measured as secretion of liquid per flower and day [volume flower-1 d-1] (e.g.
To receive sugar in mass per flower mz [mg], we multiplied nectar volume Vnec [ml] by density of saccharose ρz [1570 mg ml-1] and sugar concentration cperc [%]:
\(m_z = \rho_z V_{nec} \frac{c_{perc}}{100}\) (3)
When only molar concentration of sugar cmol [mol l-1] was given, we used the following equation with molar mass of saccharose Mz [342.3 mg mol-1]:
\(m_z = V_{nec} M_z \frac{c_{mol}}{1000}\) (4)
To calculate the nectar volume Vnec [ml] from sugar content per floral unit mz [mg], we used:
\(V_{nec} = \frac{m_z} {\rho_z⋅c_{perc}} 100\) (5)
When cperc was not given, we used cmol to calculate Vnec:
\(V_{nec} = \frac{m_z} {M_z c_{mol}} 1000\) (6)
For our application, we needed the sugar concentration in molar concentration, so we transformed cperc to molar concentration as:
\(c_{mol} = c_{perc} \frac{\rho_z} {M_z} 10\) (7)
For pollen, the physical units differed due to extraction methods. Mostly, the mass of pollen was given and the values only needed to be scaled to mg, if given in g or µg. However, sometimes it was given as estimated volume of pollen grains (
\(m_p=V_p * \rho_p\) (8)
to calculate pollen mass mp [mg] from the pollen volume Vp [ml] and the pollen density ρp [mg ml-1]. Since for most species, the density of fresh pollen ρp is not known, we used:
\(\rho_p = \rho_{prot} P_{prot} + \frac{(1 - P_{prot})}{3} (\rho_{starch} + \rho_{fat} + \rho_{water})\) (9)
with mean densities of protein ρprot [1300 mg ml-1] (
Aggregation of data, replacement of synonyms and FloRes Database
After equalising floral, physical and chemical units (script: 2_units.R), we aggregated the traits using the mean of multiple entries per taxon, except for the case of floral units per area, where we used the maximum value to receive an approximate density of floral units at 100% coverage of the species (script: 3_Aggregate_species.R). Subsequently, we checked the species for completeness of traits and grouped closely-related taxa with incomplete, but complementary, trait information in a table (Taxa_to_aggregate.csv) for further aggregation on genus level or a reasonable higher taxonomic level. We used this table to add how plant species should be automatically aggregated in the script 4_Selecting_taxa.R. Moreover, we used this step to aggregate the synonymous names of a species with their common name or on a higher taxonomic level. Finally, we aggregated the traits a second time by the selected taxon (species, genus or higher level). In cases where values for molar sugar concentration were still lacking after Step 5, we inserted the average value of 40% of sugar concentration as an estimate for wildflowers, as given in
After the final aggregation, we got three different output tables for the FloRes Database. “5_FloRes_raw” contains the mean values of all taxa for which at least some trait data were available. “5_FloRes_complete_trait” is the dataset of taxa without any gaps. “5_Selected_taxa_no_corolla” contains taxa where all traits, except for corolla depths, were complete. Those datasets can finally be used to calculate the amount of nectar and pollen of habitats within any defined time period, given that the plant species of the habitats are included in the database.
The database is a collection of data from the Northern Hemisphere, focused on Central Europe. The details about the geographical information of the raw database references are listed in https://datadryad.org/stash/share/pYjuf_kRaA0N9Lw25svZa_rnQ_mENIIyQAC2rkXicEI.
The raw data, all intermediate and auxillary datasets and the final FloRes Database are published in the Dryad repository.
In the raw database, the same traits are covered, but the units and the dependent flower units are given in extra columns ending on the ”_unit” and “_regarding_flowering_unit”. Further, the literature citation is given in the column ending with “_references”.
Column label | Column description |
---|---|
Phenology | Flower life span and flowering start and end given as day of the year [d]. |
Flower unit density | The number of single flowers or inflorescences per square metre [m²]. |
Nectar volume | The nectar volume per single flower or inflorescences [ml]. |
Sugar concentration in nectar | The concentration of sugar in nectar in mol per litre [mol l-1] or percentage [%]. Sugar is assumed to be pure saccharose. |
Sugar per flower | The absolute sugar content of nectar per single flower or inflorescences [mg]. |
Pollen | The pollen per single flower or inflorescences [mg or ml]. |
Protein content of pollen | The amount of protein in pollen [%]. |
Corolla depth | The depth of the corolla tube [mm]. |
All scripts used for generating the FloRes Database from the raw data.
Column label | Column description |
---|---|
none | none |
Raw database
The raw database consists of 702 specified plant species, 138 unspecified species (37 species (spec., sp), 22 species pluralis (spp) and, for 79, only the genus was identified) and two species complexes (agg.). Synonyms of species names are not counted as the same species. All 843 taxa belong to 448 genera and 102 families.
Most of the collected species had either data for one or few traits (Fig.
Number of species in the raw database which had at least one entry per trait. The traits comprised phenology, floral unit density per area, corolla depth, nectar volume per floral unit and day, sugar concentration in nectar, pollen per floral unit and day, as well as protein content of pollen.
Hence, most of the species were insufficiently provided with trait data. Therefore, it was necessary to combine and aggregate species on a reasonable taxonomic level for a comprehensive habitat assessment.
FloRes Database
After aggregating and combining the traits of the same species or closely-related taxa, 42 taxa with a complete set of traits remained in the FloRes Database. Those taxa belonged to 38 genera and 17 families. When excluding corolla depth, the numbers increased to 72 taxa from 63 genera and 22 families.
All traits varied strongly amongst the taxa (Fig.
The distribution of magnitude of each floral trait which is contained in FloRes Database (Floral Resources Database) using only the 42 plant taxa with entries in all traits. The lower/upper boundary of the boxes shows the 25%/75% quantile and the line dividing the box represents the median. All original values per floral traits are depicted as point scattering over the boxplot.
Pearson coefficient correlations between the species’ traits. Red colour scale (-1 to 0) indicates negative linear correlation and blue colour scale (0 to 1) positive linear correlation between two traits. White indicates no linear correlation (0) between two traits. The stronger the relationship, the darker the colour and the circle shape becomes more and more elliptical.
We did not collect our own data in the field or laboratory, but we gathered trait values from different sources. Thus, we often did not know if the density of the floral units referred to 100% cover of the plant species in its habitat. When not specified, we assumed the highest given density as 100% cover, which is only a rough estimation. Additionally, it was unknown in which habitats the flowers per area were counted. Therefore, an accurate estimation of nectar and pollen supply on habitat levels is hampered. In addition, the volume of nectar per flower varies per day and also within the day. The diurnal rhythm was not considered. Further, the sugar content in nectar depends on the soil moisture and air humidity (
Frequently, pollen is given in grains or volume and without exact measurements of pollen densities. Therefore, the values of pollen mass derived from volume are rough estimates, because the fat-carbohydrate-protein composition of pollen is mostly unknown. As well, there was very little information about anther position, which may limit the physiological accessibility of the pollen (
Hence, in its current state, the FloRes Database can provide a rough estimation on quantity of species-specific floral food resources.
Potential application of the database is the description and evaluation of the quantity of available food resources plant species provide on a habitat scale.
This allows us evaluate the temporally available floral resources in a given time period of, for example, days, weeks or months of existing seed mixtures for flower strips or other agri-environmental measures as similary is done in
In our own research, we applied the FloRes Database to generate input data of nectar and pollen supply of habitats for spatial and temporal explicit simulation models of bumblebee and hoverfly populations, to evaluate the effects of landscape composition and configuration on both species. For bumblebees, we used the agent-based model (ABM) BumbleBEEHAVE with the model BEESCOUT_2.0 (
Finally, the described workflow and the published scripts allow us and other users to easily expand and improve the FloRes Database by simply adding new lines to the raw database. This will facilitate a steady increase of bundled information of floral resources to improve the assessments of spatio-temporal food availability in habitats for pollinators.
We thank Jens Dauber for the suggestion of creating a database about pollen and nectar resources of plant species. We also are grateful to Niels Hellwig for checking the R scripts regarding comprehensibility and consistency.
The database was created in collaboration as part of the project 'Future Resources, Agriculture & Nature Conservation' (Für Ressourcen, Agrarwirtschaft & Naturschutz mit Zukunft, F.R.A.N.Z.) and the joint project 'Monitoring of biological diversity in agricultural landscapes' (Monitoring der biologische Vielfalt in Agrarlandschaften, MonViA). F.R.A.N.Z. has been funded by the Landwirtschaftliche Rentenbank (817759) with support from the Federal Ministry of Food and Agriculture (Bundesministerium für Landwirtschaft und Ernährung, BMEL) and the Federal Office for Agriculture and Food (Bundesanstalt für Landwirtschaft und Ernährung, BLE). MonViA has been funded by the BMEL.
FBB and MA share the first authorship. The conception of both databases and the multistep workflow is from FBB and MA. FBB researched for the data of floral resource and compiled it into the raw database. MA contributed the R scripts and calculations for the multistep workflow. FBB, MA and JT contributed to writing and editing of the final manuscript. All authors have read and agreed to the published version of the manuscript.