Biodiversity Data Journal :
Software Description
|
Corresponding author: Edward Baker (edwbaker@gmail.com)
Academic editor: Anne Thessen
Received: 03 Oct 2019 | Accepted: 21 Nov 2019 | Published: 27 Nov 2019
© 2019 Edward Baker, Steen Dupont, Vincent Smith
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Baker E, Dupont S, Smith VS (2019) Ecological interactions in the Scratchpads virtual research environment. Biodiversity Data Journal 7: e47043. https://doi.org/10.3897/BDJ.7.e47043
|
The Natural History Museum, London has a number of online databases that describe interactions between species, including the HOSTS database of lepidopteran host plants (
This paper describes the implementation within the Scratchpads VRE of a new ecological interactions module that is capable of handling the needs of these projects, while at the same time is flexible to handle the needs of future projects with different data sources.
ecological informatics, biodiversity informatics, ecological interactions
In order to understand life on Earth, it is essential to understand not only the distribution and traits of species, but how they interact with each other. Biodiversity informatics as a discipline has created global infrastructures for both species distribution (GBIF) and trait (TraitBank) data. Global Biotic Interactions (GloBI:
Ecological interaction datasets are, like natural history specimens, fragmented and widely distributed. They can be found scattered through scientific literature and specimen labels in museums. Numerous publications have synthesised interaction datasets for taxonomic groups (e.g. cockroaches
The need for biodiversity informatics to address species interactions was set as a challenge by
The existing datasets to be migrated were all of a similar format: two species had been documented interacting in a bibliographic reference, often at a specified location. While this model formed the basis of the implementation, we expanded this model to allow for interactions that are recorded from museum specimens (for an example see
The introduction of non-native species into an ecosystem may result in novel interactions behaviour. For example, the accidental introduction of the stick insect Carausius morosus (Sinéty, 1901) into the San Diego area means it has been found on many plants it would not have encountered in its native habitat in southern India (
Organism status (presence status; aligned to DarwinCore establishmentMeans).
Status | Description |
---|---|
Native | The organism either evolved in this region or arrived by non-anthropogenic means. |
Naturalised | The organism reproduces naturally and forms part of the local ecology. |
Introduced | The organism arrived in the region via an anthropogenic mechanism or mechanisms. |
Invasive | The organism is having a deleterious impact on another organism, multiple organisms or the ecosystem as a whole. |
Captivity | The organism is kept in captivity. |
Managed | The organism maintains its presence through intentional cultivation or husbandry. |
Another example of potentially confusing data for phasmids is the numerous food plants that are successfully used to rear these species in captivity (e.g.
Status | Description |
Interaction recorded in the wild | |
Interaction recorded in captivity | Used when the status of the specimens is uncertain |
Interaction recorded in captivity from wild caught specimens | |
Interaction recorded in captivity from captive bred specimens |
In addition, some interactions have significant importance, such as the defoliation of food and timber crops by stick insects (
Status | Description |
Economic | The interaction has financial impact for human society |
Economic (crop pest) | The interaction is damaging to food crops |
Economic (timber pest) | The interaction is damaging to timber |
Economic (pest control) | The interaction helps to control a pest species |
Medical | The interaction has medical important consequences on humans |
Veterinary | The interaction has medical important consequences on animals |
The migrated hosts data (
The above examples are limited to the Phasmida, a small order of insects with less than 4,000 valid species (
The Phthiraptera Scratchpad (http://phthiraptera.info/) documents approximately 12,000 interactions between parasitic lice (Subinfraorder Phthiraptera) and their mammal and bird hosts. Multiple mammal classifications are used, reflecting the fact that different authorities have used different host classifications when compiling checklists for blood sucking lice (superfamily Anoplura) and chewing lice (superfamilies Amblycera, Ischnocera and Rhynchophthirina). This extensive database underpins a significant body of research on parasitic lice, which is used as a model to study co-speciation.
The Scratchpads project was conceived and developed as part of a much wider portfolio of biodiversity informatics platforms and systems, so from the outset, the system described here was designed to operate with the Global Biotic Interactions project (GloBI;
Funding was provided by the Natural History Museum to employ EB during part of 2016.
The code developed for this project is, like the rest of the Scratchpads project, released under the GNU General Public License v2.0. Scratchpad users have fine-grain control over the licence applied to each piece of content on their site, but the project encourages the use of open data licences following
The implementation creates a new content type (Drupal: node type) for ecological interactions. Each interaction (a Drupal node) has a unique URL and identifier (UUID). The UUID uniquely identifies each interaction and should persist if the dataset is aggregated, as a means to trace the provenance of the data in the source dataset (e.g. to correct errors or add annotation). The Scratchpads enivronment has defined content types for bibliographic references, specimens and observations and locations, as well as tools for manipulating biological classification. The new ecological interactions content type links to these existing content types and classifications as shown in Fig.
The Relations Ontology (RO; https://www.ebi.ac.uk/ols/ontologies/ro) provides several ontology terms for describing inter-species interactions, which have been adopted by the module described here. The system presents a human-readable description (e.g. "eats") to the user, but additionally stores the relevant URI from the RO. In addition, the reverse URI is also stored (e.g. "eaten by" is the reverse of "eat"). At present, this functionality is not used, but in the future, this will allow for more advanced searches. These terms are stored in a Drupal taxonomy, allowing terms to be stored in a hierarchy (i.e. "ectoparasite of" is a child term of "parasite of"; Fig.
To maintain visual coherence with other Scratchpads features and for increased search speed, the main ecological interactions page (showing all interactions; Fig.
A text file suitable for ingest by GloBI is found at /interactions.txt on Scratchpad sites where the module is enabled (e.g. http://phthiraptera.info/interactions.txt). This file can be harvested by external aggregators.
The NHM is undertaking work to combine the output of several Scratchpad sites, as well as other sources, to create an institutional 'interactions bank' that will provide a unified entry point for these disparate interactions datasets. The NHM Data Portal is used by NHM staff to publish their research datasets, whereas Scratchpads can be used by both NHM and external researchers. For this reason, it is currently possible to contribute to GloBI directly from an individual Scratchpad and NHM-affiliated researchers, in future, will be able to contribute via the interactions bank.
An example of a Scratchpad hosted project, that is starting to adopt the ecological interactions module, is the BioAcoustica database (
The interactions of species with the human environment is also not yet properly covered.
The data model we have developed is based upon the needs of the initial projects migrated and there is scope for future additions to accommodate additional needs (e.g. recording the date and time of observed interactions using DarwinCore eventTime).
There is great potential for the biodiversity informatics community to adopt or develop controlled vocabularies.
Jorrit Poelen has provided advice at all stages of the project. In addition, we would like to thank the authors and compilers of the projects that have been migrated.
The project to move existing databases into modern infrastructures was overseen by VS. EB researched the requirements and implemented the Scratchpad module. SD has oversight of the HOSTS project and advised on requirements during development.