Biodiversity Data Journal :
Data Paper (Biosciences)
|
Corresponding author: Michael Rudolf Wohlwend (m.wohlwend@gmx.de)
Academic editor: Anatoliy Khapugin
Received: 14 Apr 2021 | Accepted: 14 Jun 2021 | Published: 20 Jul 2021
© 2021 Michael Wohlwend, Dylan Craven, Patrick Weigelt, Hanno Seebens, Marten Winter, Holger Kreft, Wayne Dawson, Franz Essl, Mark van Kleunen, Jan Pergl, Petr Pyšek, James Space, Philip Thomas, Tiffany Knight
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Wohlwend MR, Craven D, Weigelt P, Seebens H, Winter M, Kreft H, Dawson W, Essl F, van Kleunen M, Pergl J, Pyšek P, Space J, Thomas P, Knight T (2021) Data Descriptor: Pacific Introduced Flora (PaciFLora). Biodiversity Data Journal 9: e67318. https://doi.org/10.3897/BDJ.9.e67318
|
The Pacific Region has the highest density of naturalised plant species worldwide, which makes it an important area for research on the ecology, evolution and biogeography of biological invasions. While different data sources on naturalised plant species exist for the Pacific, there is no taxonomically and spatially harmonised database available for different subsets of species and islands. A comprehensive, accessible database containing the distribution of naturalised vascular plant species in the Pacific will enable new basic and applied research for researchers and will be an important information source for practitioners working in the Region.
Here, we present PacIFlora, an updated and taxonomically standardised list of naturalised species, their unified nativeness, cultivation and invasive status and their distribution across the Pacific Ocean, including harmonised location denoination. This list is based on the two largest databases on naturalised plants for the Region, specifically the Pacific Island Ecosystems at Risk (PIER) and the Global Naturalised Alien Flora (GloNAF) databases. We provide an outlook for how this database can contribute to numerous research questions and conservation efforts.
Island Biogeography, naturalised species, Pacific Ocean, plant invasion, species database
The Pacific Ocean covers a large area and contains over 25,000 islands (
To address research questions in an objective and accessible way, databases are required that contain occurrences (presences) of naturalised plant species and harmonised region information that span the whole Pacific Region, while also being interoperable with other databases (e.g. origin, BIEN, TRY and GIFT;
Here, we present PacIFlora, a consolidated database on naturalised plant species on Pacific islands which overcomes the challenges posed by combining two large databases. By merging, harmonising and standardising information on naturalised species on Pacific islands from GloNAF and PIER, we created this new database reporting the presence of naturalised plant species on each island or island group. We also categorised the islands with available data into sociogeographic groups, as this is useful for many invasion science research questions (Fig.
Map showing the boundaries of island groups in the Pacific used in this database. Underlying map: World Coastline for R, based on data from Natural Earth. Polygons surrounding island groups are designed to include all islands in the group using straight lines and, thus, these lines do not correspond to any political border.
We structured our data and R code in a way that makes PacIFlora easy to combine with other databases. We also provide our R code to facilitate the integration of additional data, in case, for example, a user of our database wants to focus on a smaller part of the Pacific Region or integrate additional data.
Accepted plant species scientific names were identified using the recently-published Leipzig Catalogue of Vascular Plants using the original names (LCVP,
Total number of records in categories of native status. Nativeness Score indicates naturalisation certainty, i.e. 0 indicates records that are certainly naturalised and 1 indicates records that are certainly native, respectively. Intermediate values indicate uncertainty in the native status. See main text for detailed clarification.
In addition, we provide a phylogeny of the naturalised plant species in PacIFlora by pruning the comprehensive supertree by
This database can be used to address a wide variety of research questions and for management applications, for example, by combining it with different datasources on environmental drivers (
This dataset can be used for research on a wide variety of questions, including: (1) the study of patterns of richness and composition of naturalised plants in the Pacific and the roles of anthropogenic and biogeographic drivers (
This dataset represents a second step (after PIER and GloNAF) towards the development of a comprehensive list of the presence and status of naturalised plant species in the Pacific Region. Important next steps involve validating and updating these data in strong collaboration with local experts from each island group. For example, naturalised data exist for 488 of the > 25000 islands in the Pacific. It remains to be validated whether the remaining islands in the Pacific really do not have established naturalised plant species (e.g. the numerous tiny atolls) or whether local information about naturalised plant presence was not included in the two region-spanning sources and, thus, in PacIFlora. We hope that PacIFlora can serve as a foundation for local organisations in the Pacific that can be updated and extended in the future. The authors provide their full support for the application, validation and extension of PacIFlora. Main contact persons for this are Michael Wohlwend (application), Mark van Kleunen (validation and extension, GloNAF) and Philip Thomas (validation and extension, PIER).
We note that the results in Wohlwend et al. (2021) used a subset of the records in PacIFlora (e.g. excluding all cultivated records for most analyses) and considered data aggregated by island group. PacIFlora aims to provide more comprehensive resources that can be used for other purposes than those that were the focus of Wohlwend et al. (2021), but information presented in this publication can give insights into the data.
To create a matrix of species presences on islands and island groups, we used raw data from GloNAF version 1.1 and raw data from PIER (updated 2 June 2018). Both PIER and GloNAF list their sources for all records of a naturalised species on an island.
We harmonised species names using the LCVP (
All calculations were performed and graphs were created using R (version 4.0.3,
PacIFlora includes the following columns: ID, Species, Island, Island group, Family, Order, Native, Cultivated, Source, Original_Name, Invasion. ID provides a unique number. Species is the accepted name of the species based on the LCVP. A total of 3963 species were recorded. Island is the location where the species is present. Islands often have many names or different spellings. We used web research and other information in our sources to identify synonyms and chose one name from the sources (usually the most commonly used name). Island refers to the smallest available unit of reference and is, therefore, sometimes used for several unnamed islets of an atoll in close proximity which are not or only sometimes (tidally) connected by land. A total of 488 islands where recorded. IslandGroup is a group name assigned by us. The inclusion of island groupings is useful for many types of research questions, as a complete species list at a broader spatial scale decreases problems of data deficiency for individual islands in an archipelago. Our groups are largely based on political borders, such as municipalities or states. If political borders did not reflect geographic borders, we used distance between islands and ocean trenches to assign each island to one of 50 island groups. Island group aggregation is visualised in Fig.
Table 1: Most frequent sources used in PacIFlora. We note that some of the sources might provide overlapping information on naturalised plant occurrences. Records refers to species × island occurrences. Sources are sorted in decreasing order, based on the number of records each source provide. The names of the three island groups with the most records from each source are displayed.
Source |
Records |
Most recorded island groups |
|
9126 |
New Caledonia, Clipperton |
|
5104 |
Hawaiian, NW Hawaiian |
|
4119 |
Society, Marquesas, Tubuai |
|
2780 |
Mariana, Northern Palau, Yap |
|
2600 |
Hawaiian, NW Hawaiian, Solomon_Bismark |
|
1982 |
Mariana |
|
1601 |
Galapagos, Solomon_Bismark |
|
1600 |
Society, Marquesas, Tubuai |
|
1467 |
Mariana, Northern Palau, Chuuk |
|
1170 |
Southern Cook, Northern Cook, Chuuk |
|
1076 |
Southern Cook, Northern Cook |
|
1054 |
New Caledonia, Loyaute, Hunter and Matthew |
|
1042 |
Marquesas |
|
1020 |
Marquesas |
|
881 |
Northern Palau, Southern Palau |
|
781 |
Society, Bass, Tuamotu |
|
775 |
Galapagos |
|
635 |
Tongatapu, Nauru, Easter |
|
614 |
Samoa, Tokelau |
|
599 |
Society Marquesas, Tuamotu |
|
574 |
Solomon_Bismark, New Caledonia, New Hebrides |
|
519 |
Tongatapu, Vava'u, Ha'apai |
We present our database in three formats:
(1) PacIFlora - Full list format of all records (species x island, including records with no information on island, but just island group level and records that could not be identified by the LCVP), Suppl. material
Additionally, you can find the following files:
(2) An island × species matrix, excluding records that were missing information. Specifically, this list does not include data that have no island information or species that could not be identified by the LCVP (Dryad only).
(3) An aggregated island group × species matrix (Dryad only).
(4) A table to access reference IDs, Suppl. material 3.
(5) List format of PacIFLora on island level, excluding all records with no information on island, but just island group level and records that could not be identified by the LCVP (Dryad only).
(6) List format of PacIFLora on island group level, excluding all records that could not be identified by the LCVP and providing aggregated values for naturalisation, cultivation and invasion status (Dryad only).
Both data matrices (2 and 3) are included to provide an easy-to-use format for research and conservation applications. When using these matrices, be aware that they include all records (e.g. including cultivated species for some records). We provide our full R code used for aggregation, starting from GloNAF and PIER raw data, which allows, for example, for the generation of personalised subsets.
To create a phylogeny for the naturalised plant species in the Pacific, we pruned the supertree by
Our R code allows a complete workflow from the publicly available PIER and GloNAF data to the final species x island matrix. All codes used to unify and aggregate the data are provided in the R programming language and is open access via github (https://github.com/MichaelWohlwend42/PacIFlora.git). We provide code to merge new data with PacIFlora in a standardised manner, using universally applicable harmonisation functions for species and islands. The data underpinning the analysis reported in this paper are deposited in the Dryad Data Repository at https://datadryad.org/stash/dataset/doi: 10.5061/dryad.qfttdz0hd, as well as partly in the supplementary material and on the abovementioned GitHub Page.
PaciFlora covers all islands with available data in the Pacific Ocean. Only oceanic islands between 40°N and 40°S are included as our focus was on (sub-)tropical islands. Larger landmasses, such as Japan, New Zealand the Philippines and Papua New-Guinea, as well as all islands on the Japanese, Pacific American or Australian coasts, were excluded. So, this database focuses on (sub-)tropical islands that are isolated from larger landmasses.
Column label | Column description |
---|---|
Species | Simple species name. |
Island | island name. |
IslandGroup | assigned island group name. |
Native | Standardised native score ranging from 0 (unanimously described as not native) to 1 (unanimously described as native) in given location. |
Cultivated | Standardised cultivation score ranging from 0 (unanimously described as not cultivated) to 1 (unanimously described as cultivated) in given location. |
Family | plant family. |
Order | plant order. |
Database | Origin GloNAF, PIER or both (glonpier). |
Invasive | Standardised invasive score ranging from 0 (unanimously described as not invasive) to 1 (unanimously described as invasive) in given location. |
Orginal_name | pre-harmonisation species name. |
Latitude | latitude of island (mercator). |
Longitude | longitude of island (mercator). |
Source | Literature cited for this entry in the raw data. |
Source_ID | Full list of references provided for this occurrence, which can be referenced using the attached list. |
Funding was provided by the Alexander von Humboldt Foundation in the framework of the Alexander von Humboldt Professorship, by the Helmholtz Recruitment Initiative of the Helmholtz Association (both to TMK) and by the German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, funded by the German Research Foundation (DFG) – FZT 118 – 202548816. This research was co-funded through the 2017-2018 Belmont Forum and BiodivERsA joint call for research proposals, under the BiodivScen ERA-Net COFUND programme and with the funding organisations BMBF (to HS) via the project ‘Alien Scenarios‘ (BMBF grant 01LC1807A). PP and JP were supported by EXPRO grant no. 19-28807X (Czech Science Foundation) and long-term research development project RVO 67985939 (Czech Academy of Sciences). FE appreciates funding by the Austrian Science Foundation (FWF project no I 2086-B16).
Complete Datasbase in list form, as it will be uploaded to zenodo and GitHub.
Allows for linking additional source ids to the actual source