Biodiversity Data Journal :
Methods
|
Corresponding author: Andrew R Deans (adeans@psu.edu)
Academic editor: Daniel Mietchen
Received: 29 May 2024 | Accepted: 18 Aug 2024 | Published: 26 Aug 2024
© 2024 Andrew Deans, Louis Nastasi, Charles Davis
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Deans A, Nastasi L, Davis C (2024) GallOnt: An ontology for plant gall phenotypes. Biodiversity Data Journal 12: e128585. https://doi.org/10.3897/BDJ.12.e128585
|
|
Galls are novel plant structures that develop in response to select biotic stressors. These structures, extended phenotypes of the inducer, usually serve to protect and feed the inducer or its progeny. This life history strategy has evolved dozens of times, and tens of thousands of species — including many bacteria, fungi, nematodes, mites and insects — are capable of manipulating plants in this way. The variation in gall phenotypes is extraordinary across species but usually predictable for each species of inducer. We introduce here a new ontology, GallOnt, that facilitates consistent descriptions and the semantic representation of and reasoning over plant gall phenotype data. GallOnt was largely developed from ontologies in the Open Biological and Biomedical Ontology (OBO) Foundry and stands to connect plant gall phenotypes to knowledge derived from model plant systems, including genotype-phenotype and agricultural research. We also introduce the idea of a new gall data standard — Minimum Information for the Description of Galls (MIDG version 0.1) — as a starting point for discussions regarding cecidology best practices.
phenotype, plants, zoocecidium, morphology, cecidology, Cecidomyiidae, Cynipidae, Diplolepididae, Cynipoidea
The ability to induce galls on plants has evolved independently many times across the phylogeny of Life, with evidence of this kind of interaction dating back at least to the Middle Devonian (385 million years ago;
Some of the galls formed by plants in response to manipulation by gall wasps (Hymenoptera, Cynipoidea). Clockwise from the spiny gall at the top: Diplolepis bicolor leaf galls on Rosa sp.; Holocynips maxima "root" gall on Quercus montana; Melikaiella tumifica mid-rib gall on Quercus rubra; undescribed mid-rib gall cluster on Quercus montana; Andricus quercusstrobilanus bud gall cluster on Quercus bicolor; Callirhytis quercusoperator catkin gall on Quercus rubra; Callirhytis piperoideas mid-rib gall cluster on Quercus velutina; Andricus chinquapin stalked leaf gall on Quercus alba; (middle) Disholcaspis quercusmamma gall cluster on Quercus bicolor stem. All photos (CC BY 4.0) by Andrew R. Deans.
We provide here another resource, a structured glossary of gall phenotype terms, that facilitates natural language and semantic descriptions of gall forms and can forge connections to knowledge generated through model organism research. As proposed in other systems (
Through this paper, we aim to announce this new ontology, which was developed using Open Biological and Biomedical Ontology (OBO) Foundary principles (
The development process largely transitioned through three main phases: (1) glossary development, i.e. gathering relevant concepts and terms, with community input; (2) structuring the glossary as an ontology, using established best practices; and (3) demonstrating the computational nature of the ontology, by reasoning over phenotype data.
Given the modest size of the lexicon used to describe gall morphology, we opted for a fully manual assembly of the ontology. We extracted gall terms from original descriptions catalogued by
Community recommendations were incorporated into the ontology (see next section) or set aside for future consideration (i.e. documented as issues in the GitHub repository; https://github.com/adeans/gallont/issues).
We used the Ontology Development Kit (ODK) tool set and protocol (
We extracted gall phenotypes from natural language descriptions of galls that are induced on white oak, Quercus alba Linnaeus, 1753, as listed at the Gallformers website (https://gallformers.org). For example,
Examples of gall traits of Zopheroteras cuneatum Weld, 1944 as originally described by the author (column 1), translated into semantic statements that are composed from multiple ontologies (column 2). Weld (1944) describes the spectrum of phenotypes he observed (column 1), but we are generating an example dataset of instances, in this case, mock specimens. See notes (column 3) for explanation.
original description | phenotype | notes |
---|---|---|
red | 'has quality' some red | |
3.0–4.4 mm long | hasHeightInMm 4.4 | We are generating a mock dataset comprised of individuals; therefore, we chose to represent just the maximum value from the range, for the "specimen" that represents this taxon. |
attached to the very base of the petiole | 'part of' some petiole | |
in fall | 'has seasonal maturity' value autumn | 'has seasonal maturity' is an object property created for GallOnt |
conical | 'has quality' some conical | |
1.5 mm broad at base | hasDiameterInMm 1.5 | hasDiameterInMm and hasHeightInMm are data properties created for GallOnt |
Host.—Quercus alba | 'has host' some 'Quercus alba' | Taxon names mostly come from the NCBI Taxonomy |
'host of' some 'Zopheroteras cuneatum (agamic)' | 'Zopheroteras cuneatum (agamic)' was created as a child of Zopheroteras cuneatum and designated as an asexual organism | |
'plant gall' | to declare that this is a gall |
Zopheroteras cuneatum gall on white oak (Quercus alba), from an observation in iNaturalist (https://www.inaturalist.org/observations/96124447) by Jeff Clark (CC BY 4.0)
The final version of the gall glossary yielded 136 concepts (classes) that are directly relevant to gall morphology. We assigned each class a primary label, sometimes called a "preferred term" (
The size and complexity of the resulting ontology, version 2024-04-19, is reported in Table
Axiom | 6,042 |
Logical axiom count | 973 |
Declaration axioms count | 666 |
Class count | 394 |
Object property count | 132 |
Data property count | 3 |
Individual count | 13 |
Annotation Property count | 128 |
The reasoning dataset was created as a separate test ontology (see Suppl. material
Descriptive Logic (DL) Queries. Examples of some biological questions that could be answered through queries of semantic plant gall phenotype data. Rows highlighted in yellow reveal the incompleteness of the original descriptions, in that there are 67 galls in the dataset, but only 33 have data regarding the number of larvae inside (i.e. whether they are mono- vs. polythalamous). *Note that the results in Protégé yield counts and also every individual that matches the query; only counts are listed here. **ELK does not support queries of data properties, but a query using another reasoner would return results; unfortunately, our ontology is too complex for the other reasoners in Protégé at this time.
Question | DL Query | Result* |
---|---|---|
How many plant galls are in this dataset? | 'plant gall' | 67 |
How many of these galls occur on Quercus alba? | 'plant gall' and 'has host' some 'Quercus alba' | 65 |
How many galls occur on an inflorescence? | 'plant gall' and 'part of' some inflorescence | 1 |
How many galls occur on stems and branches? | 'plant gall' and 'part of' some 'shoot axis' | 15 |
How many galls occur on leaves? | 'plant gall' and 'part of' some leaf | 30 |
How many galls occur on leaf veins? | 'plant gall' and 'part of' some 'leaf vein' | 9 |
How many galls occur on leaf mid-veins? | 'plant gall' and 'part of' some 'leaf midvein' | 3 |
How many galls occur on the roots? | 'plant gall' and 'part of' some 'root system' | 1 |
How many galls fall to the ground when mature? | 'plant gall' and 'has quality' some 'deciduous (generic)' | 13 |
How many galls are fully integrated into plant tissue? | 'plant gall' and 'has quality' some non-deciduous | 7 |
How many galls have a kapéllo? | 'plant gall' and 'has part' some kapéllo | 0 |
How many galls are monothalamous? | 'plant gall' and 'has quality' some monothalamous | 22 |
How many galls are polythalamous? | 'plant gall' and 'has quality' some polythalamous | 11 |
How many galls mature in autumn and have an asexual occupant? | 'has seasonal maturity' value autumn and 'host of' some 'asexual organism' | 16 |
How many galls mature in autumn and have a sexual occupant? | 'has seasonal maturity' value autumn and 'host of' some 'gonochoristic organism' | 0 |
How many galls are induced on buds by asexual organisms? | 'induced by' some 'asexual organism' and ('part of' some bud) | 4 |
How many galls are induced on buds by sexual organisms? | 'induced by' some 'gonochoristic organism' and ('part of' some bud) | 0 |
How many galls have a diameter greater than 5 mm? | 'plant gall' and hasDiameterInMm some xsd:integer [>5] | 19** |
We demonstrate above and in the supplementary files how this ontology can be used to compose semantic phenotypes for plant galls. The ontology also stands as a controlled vocabulary for describing gall morphology. We recommend using a character:character state or entity:quality format, for example in a spreadsheet or as rendered in many contemporary taxonomic descriptions (e.g. see species descriptions in
Our small survey of gall descriptions also revealed inconsistencies and incompleteness of the phenotypes that were represented, making it difficult to confidently query published knowledge of galls. For example, only 33 descriptions out of 67 reported whether the gall was mono- or polythalamous (highlighted rows in Table
GallOnt likely is not yet sufficient for describing every type of gall nor all the complex phenotypes exhibited in these structures. For example, the ontology does not yet include many classes needed to represent histological traits (tissue qualities) nor temporal changes in gall morphology (i.e. stages of gall development). During the initial development, we also focused almost exclusively on phenotypes expressed in galls found in North America, as induced by cynipoid wasps. Expanding the ontology to cover galls induced by thrips (Thysanoptera), mites (especially Eriophyidae), gall midges and other flies (Diptera) and non-arthropod inducers, in other parts of the world, will undoubtedly require additional classes and new versions of MIDG. New classes and other modifications to the ontology can be proposed using the GitHub issue tracker linked above and ongoing development in relevant ontologies (see below) will likely facilitate more sophisticated representation of gall phenotypes.
GallOnt can be used in generating Gene Ontology annotation files (GAFs;
We provide here the first ontology designed to represent plant gall phenotypes. The ontology was developed with the latest tools, to be extensible, accessible and persistent, and we demonstrate its utility as a reasoning tool. We invite potential users and contributors to view the documentation at the GallOnt site (https://adeans.github.io/gallont/), to provide new terms and other refinements and to offer additional use cases.
Jim Balhoff, Laurel Cooper, Pankaj Jaiswal and members of the OBO community more broadly assisted with troubleshooting and encouragement. Adam Kranz, Heather Hines, Denise Montelongo, John Tooker, Antoine Guiguet, Antonio Casadei, Cecil Smith and Laura Porturas provided invaluable input regarding terminology, unusual gall phenotypes and future applications of this resource.
This file contains the plant gall ontology and numerous individuals, each of which represents a gall on white oak (Quercus sect. Quercus, mostly on Q. alba). This file was used to demonstrate reasoning over phenotype data, using descriptive language (DL) queries, as implemented with ELK reasoner (version 0.4.3) in the software application Protégé (version 5.6.3).
This spreadsheet contains natural language descriptions of all described galls on white oak (Quercus alba). The descriptions are parsed into individual characters and links (URIs) are provided for each source.