Computable species descriptions and nanopublications: applying ontology-based technologies to dung beetles (Coleoptera, Scarabaeinae)

Giulio Montanaro; James Balhoff; Jennifer Girón; Max Söderholm; Sergei Tarasov

doi:10.3897/BDJ.12.e121562

Biodiversity Data Journal : Taxonomy & Inventories

PDF

Taxonomy & Inventories

Computable species descriptions and nanopublications: applying ontology-based technologies to dung beetles (Coleoptera, Scarabaeinae)

Giulio Montanaro^‡, James P. Balhoff^§, Jennifer C. Girón^|, Max Söderholm^‡, Sergei Tarasov^‡

‡ Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland

§ RENCI, University of North Carolina, Chapel Hill, North Carolina, United States of America

| Museum of Texas Tech University, Texas, United States of America

Corresponding author: Giulio Montanaro (giuliomontanaro98@gmail.com), Sergei Tarasov (sergei.tarasov@helsinki.fi)

Academic editor: Panakkool Thamban Aneesh

Received: 23 Feb 2024 | Accepted: 22 May 2024 | Published: 13 Jun 2024

This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Citation: Montanaro G, Balhoff JP, Girón JC, Söderholm M, Tarasov S (2024) Computable species descriptions and nanopublications: applying ontology-based technologies to dung beetles (Coleoptera, Scarabaeinae). Biodiversity Data Journal 12: e121562. https://doi.org/10.3897/BDJ.12.e121562

Abstract

Background

Taxonomy has long struggled with analysing vast amounts of phenotypic data due to computational and accessibility challenges. Ontology-based technologies provide a framework for modelling semantic phenotypes that are understandable by computers and compliant with FAIR principles. In this paper, we explore the use of Phenoscript, an emerging language designed for creating semantic phenotypes, to produce computable species descriptions. Our case study centers on the application of this approach to dung beetles (Coleoptera, Scarabaeinae).

New information

We illustrate the effectiveness of Phenoscript for creating semantic phenotypes. We also demonstrate the ability of the Phenospy python package to automatically translate Phenoscript descriptions into natural language (NL), which eliminates the need for writing traditional NL descriptions. We introduce a computational pipeline that streamlines the generation of semantic descriptions and their conversion to NL. To demonstrate the power of the semantic approach, we apply simple semantic queries to the generated phenotypic descriptions. This paper addresses the current challenges in crafting semantic species descriptions and outlines the path towards future improvements. Furthermore, we discuss the promising integration of semantic phenotypes and nanopublications, as emerging methods for sharing scientific information. Overall, our study highlights the pivotal role of ontology-based technologies in modernising taxonomy and aligning it with the evolving landscape of big data analysis and FAIR principles.

Keywords

Phenoscript, taxonomy, semantic data, phenotypic traits, characters, morphology, Grebennikovius, microCT

Introduction

Taxonomists have produced vast amounts of phenotypic data through species descriptions published in numerous papers and monographs. Yet, scientists outside taxonomy largely under-utilise this resource because it is challenging to comprehend these data and analyse them computationally (Deans et al. 2012). Traditionally written in natural language (NL), species descriptions are largely inaccessible for computer-based analysis, impeding phenomic research in biology and rendering the data non-compliant with FAIR (Findable, Accessible, Interoperable, Reusable) principles (Wilkinson et al. 2016).

Ontology-based technologies have emerged as a promising solution to this challenge (Deans et al. 2015). They can be used to model phenotypic data as computable and interoperable units known as semantic phenotypes (Balhoff et al. 2013), thus unlocking their potential for phenomic-level studies. A remarkable advancement in this area has been made by the Phenoscape project (Edmunds et al. 2015), which has established key standards (e.g. see the Guide to Character Annotation) and protocols for the integration of ontological annotations with evolutionary phenotypes (Dahdul et al. 2010, Mullins et al. 2012, Balhoff et al. 2013, Dahdul et al. 2018). A notable output of this project is the software called Phenex (Balhoff et al. 2010), which facilitates the annotation of character matrices with ontology terms.

However, ontology-driven modelling of species descriptions remains challenging due to their more flexible nature compared with character matrices for phylogenetic analyses. Furthermore, previous semantic approaches to phenotypes were mostly class-based in which phenotypic statements were expressed as ontology classes (Balhoff et al. 2013). These approaches are also referred to as the TBox (Terminological Box) approaches, since the TBox represents the structural framework of an ontology and includes logical definitions and a hierarchy of classes. The class-based approaches present their own set of challenges, such as complex semantics, limited expressiveness and difficulty in interpreting the results by humans (Vogt 2021). An alternative approach, the individual-based approach, in which phenotypic statements are expressed as ontology individuals, seems to be more intuitive and effective (Vogt 2021). It is also called the ABox (Assertional Box) approach because ontology's ABox includes actual data, namely assertions about individuals (instances) of the classes defined in the TBox. The individual-based approach conceptualises the description of phenotypes as the construction of knowledge graphs, where nodes represent anatomical structures and their metadata or characteristics and edges represent the relationships between them.

In this paper, we aim to explore the utility of an individual-based approach by semantically describing four species of dung beetles from the genus Grebennikovius as a case study (Montanaro et al. 2024). In order to demonstrate the power of ontology-based technologies, we provide examples of simple queries to automatically retrieve phenotypic data (Balhoff et al. 2013). We also strive to integrate semantic species descriptions with the concept of nanopublications (Groth et al. 2010, Kuhn and Dumontier 2017, Kuhn et al. 2021), which encapsulates discrete pieces of information into a comprehensive knowledge graph. Data in this graph are directly queriable by scientists, making it FAIR through a variety of semantic resources (Shefchek et al. 2019, Kuhn et al. 2021).

To accelerate the creation of semantic species descriptions, we apply Phenoscript, a newly-designed computer language. Phenoscript enables constructing knowledge graphs from textual code in the text editor VS Code, using the respective Phenoscript plugin. The Phenoscript code is then converted into the Web Ontology Language (OWL), a standard format for working with ontologies, allowing for computational comparisons and analyses of semantic data. This conversion is mediated by Phenospy, a python package that also translates OWL phenotypes into annotated NL descriptions for publication and traditional scientific communication.

Phenoscript and Phenospy, still in development, are assessed in this study for their practicality and effectiveness in managing phenotypic data. This is the second paper in the series that tests Phenoscript (Mikó et al. 2021). We demonstrate that, with the proposed approach, scientists can bypass the need to write NL-based species descriptions entirely. Instead, they can initially code semantic descriptions in Phenoscript and then automatically translate them into NL using Phenospy. In the following sections, we discuss the advancements and challenges of using semantic species descriptions.

Materials and methods

Data availability

The data files and scripts necessary to reproduce the results of this study are available as Supplementary material that can be accessed either through Zenodo or via the Github repository.

Taxon Selection

For this proof-of-concept study, we selected the dung beetle genus Grebennikovius (Coleoptera, Scarabaeinae), recently revised by Montanaro et al. (2024) (Fig. 2a) and endemic to the Eastern Arc Mountains (Tanzania). The genus comprises four species, for which we have generated and analysed semantic descriptions.

Figure 1.

Workflow for processing semantic descriptions.

Figure 2.

Dorsal aspect and description of Grebennikovius species.

a: Clockwise, from top left corner: Grebennikovius basilewskyi, G. lupanganus, G. armiger, G. pafelo;
b: Screenshot of the description of Grebennikovius armiger using the PhenoScript plugin in VS Code.

Micro-CT imaging

To observe and illustrate morphological characters in great detail, we obtained micro-CT images of a specimen of Grebennikovius basilewskyi (Balthasar, 1960). Imaging was conducted at the Finnish Museum of Natural History LUOMUS (University of Helsinki) using a Nikon XT H 225 and the following settings. Multi-metal target with molybdenum setting, 70–100 kV beam energy, 70–100 uA beam current, 1420 ms exposure time and 4476 projection images with four frames of averaging per projection. Detector binning was set to 1x1, gain to 24 dB and white target to 60k. The complete scan time was approximately seven hours and the resulting voxel size of the dataset 2.998 µm. The volumetric dataset was reconstructed from the projection images using Nikon CT pro-3D Version XT 6.9.1 and the dataset was exported to VGSTUDIO MAX 2023.2 (Volume Graphics GmbH, Heidelberg, Germany) in 16-bit format. Excess material was excluded from the dataset. The dataset was visualised using volume renderer (Phong) and aligned correctly. Images from the sample were rendered from anterodorsal, dorsal, lateral, posterior and ventral views. The aedeagus of G. basilewskyi in Fig. 3b was modified from Montanaro et al. (2024).

Figure 3.

Visual guide to positional terminology; see the section "Anatomically consistent positional conventions" for details.

a: right hind leg of G. basilewskyi showing the positional terms referring to the dorsoventral axis of the beetle's body;
b: aedeagus of G. basilewskyi in lateral (left) and dorsal (right) views, clarifying the correct left-right (= lateral) axis.

Figure 4.

Grebennikovius basilewskyi in dorsal and ventral views. Numbers next to arrows indicate patterns of phenotype statements explained in the section "Phenoscript: main patterns of phenotype statements". Arrow numbers from T1 to T5 illustrate individual body parts.

a: dorsal view;
b: ventral view. Arrows T1–T5 show mesoventrite (T1), metaventrite (T2), hypomeron (T3), mesanepisternum (T4) and metanepisternum (T5).

Creating semantic phenotypes with Phenoscript

To describe species semantically, we employed the Phenoscript language powered by its dedicated plugin for VS Code (Fig. 2b). The primary purpose of Phenoscript and its plugin is to streamline the process of creating semantic descriptions. Although other tools can be used for this purpose, such as Protégé, a comprehensive, GUI-based ontology software or Turtle syntax for knowledge graph construction, these methods are much slower than Phenoscript. With the Phenoscript plugin, users can benefit from syntax highlighting and snippets. This makes it easier to select terms from predefined biological ontologies and write semantic statements.

For this study, we used the ontologies listed in Table 1. The process of creating semantic decriptions often requires the addition of new terms to existing ontologies, as was the case in our study, which is discussed in a separate section below.

Table 1.

Download as

CSV

XLSX

Ontologies used in the species descriptions. For details, see the OBO Foundry repository https://obofoundry.org.

Ontology	URI	Description
Ontology for the Anatomy of the Insect SkeletoMuscular system (AISM)	http://purl.obolibrary.org/obo/aism.owl	General anatomy of insects, includes terms such as “pronotum”, “wing”.
Coleoptera Anatomy Ontology (COLAO)	http://purl.obolibrary.org/obo/colao.owl	Anatomy of Coleoptera, for example, “elytron”, “mesoventrite”.
Phenoscript Ontology (PHS)	Github	Phenoscript metadata, for example, "has trait", "OTU Block".
Phenotype And Trait Ontology (PATO)	http://purl.obolibrary.org/obo/pato.owl	Phenotypic qualities, for example, “red”, “convex”, “length”, "setose"
Biological Spatial Ontology (BSPO)	http://purl.obolibrary.org/obo/bspo.owl	Spatial regions of anatomical parts, for example, “distal region”, “ventral side”.
Comparative Data Analysis Ontology (CDAO)	http://purl.obolibrary.org/obo/cdao.owl	Taxon metadata, for example, “TU” (taxonomic unit).
Information Artifact Ontology (IAO)	http://purl.obolibrary.org/obo/iao.owl	Information entities, for example, “denotes”.
Relation Ontology (RO)	http://purl.obolibrary.org/obo/ro.owl	Mostly relationships between antomical parts and qualities, for example, “part of”, “has characteristic”.
Units of measurement ontology (UO)	http://purl.obolibrary.org/obo/uo.owl	Units of measurement, for example, "millimeter".
Biological Collections Ontology (BCO)	http://purl.obolibrary.org/obo/bco.owl	Darwin Core terms, for example, "catalogNumber", "TaxonID".
Uberon multi-species anatomy ontology (UBERON)	http://purl.obolibrary.org/obo/uberon/uberon-base.owl	General anatomy terms, for example, "female organism", "adult organism".
Taxonomic rank vocabulary (TAXRANK)	http://purl.obolibrary.org/obo/taxrank.owl	Taxonomic rank terms, for example, "species".

Writing in Phenoscript closely resembles composing natural language (NL) descriptions, albeit with its own distinct syntax, which is still quite akin to NL. The language documentation and tutorials are available on the Phenoscript repository. The initial step typically involves setting up a YAML configuration file to specify author names, project title and the ontologies to be used. As a next step, Phenospy can generate snippets for the necessary ontology terms. Snippets, which are ontology terms or small blocks of Phenoscript code, can be selected from a drop-down menu in the Phenoscript description, appearing upon typing the first letters. Once the snippets are ready, the user can begin coding semantic descriptions in VS Code using the Phenoscript plugin. For convenience, we present below an overview of the major character patterns used in describing species of Grebennikovius, both in NL and in Phenoscript (see the section "Phenoscript: main patterns of phenotype statements").

Once the Phenoscript description is complete, it can be processed and analysed as outlined in the pipeline shown in Fig. 1, with technical details provided in subsequent sections.

A pipeline for processing semantic descriptions

The pipeline (Fig. 1) consists of six steps outlined below, which are facilitated by a makefile tool (Supplementary Material). In a nutshell, a makefile automates the process of sequentially executing various programmes and commands. In our context, it automates the execution of different pipeline steps.

Step 1. Once Phenoscript description is written as a Phenoscript file, it must be converted into OWL format using the Phenospy package, which provides the necessary command-line tools for this conversion. This creates the ABox component of the ontology for further processing.

Step 2. This stage involves validating the OWL file with SHACL (Shapes Constraint Language) to ensure that semantic data satisfy the requirements of the data models employed by the user. SHACL is a conventional tool for validating RDF graph patterns against a set of predefined criteria. As an example, in our context, these criteria require that all phenotypes are linked to species names and include the necessary metadata. We used the SHACL command-line interface provided by the Apache Jena framework. Proceed to the next step if validation succeeds. If it does not, return to the Phenoscript description and correct it.

Step 3. Make a TBox file by downloading and merging all the source ontologies used to create semantic descriptions. This step is automated using Phenospy and ROBOT (Jackson et al. 2019), a command-line tool for manipulating and working with biomedical ontologies.

Step 4. Perform ontology reasoning using the ABox (step 1) and TBox (step 3) files. This step is mediated by the materializer tool which uses the whelk reasoner. Ontology reasoning refers to the process of deriving logical conclusions from a set of asserted facts or axioms within an ontology and knowledge graph. Reasoning is used to logically validate the ontology and infer the class membership of the individuals in the ABox.

Logical validation ensures that the ontology contains no contradictions in its structure, definitions or relationships between its entities. If this is the case, the ontology is referred to as "consistent". If the ontology is found to be inconsistent at this stage, it is most likely because there are logical errors within the semantic descriptions that need to be corrected. Additionally, Class inference generates new data from the initial assertions, which can be used for downstream semantic queries. If the validations at steps 2 and 4 are successful, the user can proceed to the next stages.

Step 5. Using Phenospy, automatically generate the annotated NL description from the OWL file. See the section below for more information.

Step 6. Perform semantic queries to extract trait data from the descriptions. See the section below for more information.

Generating NL Species Descriptions

NL descriptions were created using Phenospy's algorithm which traverses the knowledge graph encoded in an OWL file and translates the graph patterns into NL. The algorithm searches for character patterns, such as, for example, the presence or absence of anatomical entities, their measurements and then translates them into human-readable NL text.

Generated NL descriptions consist of hierarchical trait statements that usually resemble entity-quality syntax (Washington et al. 2009, Balhoff et al. 2010). Typically, each statement begins with a sequence of locator terms that specify the trait's position (= entity) on the organismal body. Qualities or other phenotypic properties are specified following the locator terms. They can be listed on the same line if only one property is associated with the given locator or on several subsequent lines for multiple properties. In addition, multiple statements associated with the same body part may be nested within one another.

Querying Semantic Phenotypes

In order to demonstrate the ease with which phenotypic information can be retrieved from our descriptions, we employed two sets of semantic queries. The first set aimed to determine the number of individuals per species associated with an ontology class representing one of the following phenotypic characteristics: colour, shape, size and texture (Table 2, rows 1–4). To achieve this, we executed a SPARQL query using the ontology generated during the step 4 of our pipeline. For instance, applying this query to the statement "head is red in G. armiger" would yield a single individual for G. armiger classified under the "colour" class. We utilised the SPARQL Notebook extension for VS Code to execute SPARQL queries, enabling the organisation of multiple queries within a single file and the annotation of queries using Markdown syntax.

Table 2.

Download as

CSV

XLSX

Results of the semantic queries.

Entities \ Species	G. armiger	G. basilewskyi	G. lupanganus	G. pafelo
1. colour (PATO:0000014)	3	5	3	3
2. shape (PATO:0000052)	32	25	28	34
3. size (PATO:0000117)	24	24	22	24
4. texture (PATO:0000150)	2	1	2	2
5. insect head (AISM:0000107) or its parts	23	23	20	20
6. insect thorax (AISM:0000108) or its parts	65	74	58	70
7. insect abdomen (AISM:0000109) or its parts	11	15	12	11
8. insect leg (AISM:0000031) or its parts	29	32	27	31

The second set of queries focused on determining the number of individuals associated with the major body parts: head, thorax, abdomen and leg (Table 2, rows 5–8). Within our context, this association signifies that individuals must belong to one of these four classes or be "part of" (BFO:0000050) one of them. For instance, a statement indicating "antennae are long in G. armiger" would result in one individual being associated with the class "head" as antennae are considered part of the "head". To facilitate these types of queries, we incorporated four custom classes into the merged ontology file generated during step 3 (i.e. the Tbox file). These classes were logically defined using Manchester OWL syntax as follows: "X or (part_of some X)", where "X" represents each of the four body parts. By conducting ontology reasoning in step 6 of the pipeline, the individuals from the ABox were automatically classified into these custom classes. Subsequently, these custom classes were used in SPARQL in a manner analogous to the first set of queries.

Adopting Insect (AISM) and Coleoptera (COLAO) Ontologies for Dung Beetle Phenotypes

We expanded the Insect Anatomy Ontology (AISM; Girón et al. (2024), Girón et al. (2023b)) and the Coleoptera Anatomy Ontology (COLAO; Girón et al. (2023a)) to include morphological terms necessary for the descriptions of Grebennikovius species. We created a total of 152 new terms in AISM and COLAO following the editing instructions available on the AISM GitHub repository.

New AISM terms (110 terms, v.2023-04-14 to v.2024-05-11) describe the more general insect body plan and most of them are applicable to a range of insect orders. These terms cover appendage subdivisions (specific tarsomeres, flagellomeres, palpomeres etc.), specific cuticular protrusions (protibial and clypeal teeth, carinae, etc.) and different types of cuticular punctures.

Punctures are particularly important characters in dung beetle taxonomy, especially at the interspecific level (d'Orbigny 1913, Davis et al. 2008). We recognised two main types of punctures: setigerous and non-setigerous. Each type is further subdivided into simple, ocellate and granulate subtypes. However, we currently avoid using the term “raspish” punctures, which are described as having a small, sharp granule-like protrusion (see “râpeuse” (e.g. d'Orbigny (1913)), “raspish” (e.g. Ziani et al. (2019)), “raspose” (e.g. Barbero et al. (2003)), “raspelartig” (e.g. Balthasar (1963))). This decision is based on the difficulty in distinctly separating them from granulate punctures.

Conversely, the new terms introduced in COLAO (42 terms, v.2023-03-30 to v.2024-02-14) are more specific to beetles, particularly to Scarabaeoidea and Scarabaeinae. These terms include descriptions of elytral striation, pronotal protrusions and depressions and genital structures, such as parameres and endophallites (Génier 2019).

Notably, we use the term “mesometaventral sulcus” instead of the more traditional “meso-metasternal suture”, since the use of meso- and metaventrite should be preferred over meso- and metasternum (Lawrence and Ślipiński 2013). Additionally, “pygidium” is substituted by the more general term “abdominal tergite VIII” (see also Cristóvão and Vaz-de-Mello (2021)).

Anatomically consistent positional conventions

Traditional positional terminology used in Scarabaeinae taxonomy sometimes does not reflect the true positional relationships between the insect body and its parts. For example, in most scarab beetles (Scarabaeoidea), legs are flattened anteroposteriorly and rotated forwards (fore legs) or backwards (middle and hind legs), making more intuitive (but incorrect) to use dorsal and ventral instead of anterior and posterior for referring to their broad, flattened sides and the reverse for the narrow dorsal and ventral sides (see Fig. 3a). One of the most striking consequence of this reversed terminology is that the protibial margin that bears teeth in scarabs is often called “outer”, “lateral” or even “external”, but, in fact, corresponds to the dorsal side of the insect leg.

Another example can be found in the parameres of the male genitalia (Fig. 3b). Traditionally, left and right parameres are defined, based on an axis in which the articulation between parameres and phallobase is positioned "basally" and the distalmost part of the paramere "apically". Left and right sides are then inferred interpreting this "basoapical" axis as the equivalent of an anteroposterior one. However, if we consider the position of the aedeagus relative to the entire insect body, such axes – and, consequently, left and right sides – are reversed.

The revised interpretation of position has been implemented in the natural language (NL) descriptions by Montanaro et al. (2024) and we have also adopted it here, both for developing new ontology terms and for crafting semantic descriptions. While initially this perspective may seem counterintuitive to traditional taxonomists, it represents the most anatomically consistent way for describing these structures.

Phenoscript: main patterns of phenotype statements

In the examples below, we provide traditional NL statements followed by their equivalent Phenoscript statements (in italics), for the main types of semantic traits used in our descriptions. The "note" section briefly explains the rationale for the use of certain semantic constructs.

In a nutshell, a Phenoscript statement consists of a sequence of nodes and edges (i.e. relationships) in a knowledge graph. Edges are defined by a preceding dot "." symbol. Each statement should begin and end with a node. The nodes are followed by edges, which specify the relationships between the nodes. Node–edge sequences may be as long as necessary. The semantic statement is closed with a semicolon ";". Every ontology term is prefixed with the abbreviation of its originating ontology, functioning as a namespace. For instance, the term "aism-cuticular_carina" signifies that "cuticular carina" comes from the AISM ontology. This prefixing helps to clearly identify the source ontology of each term.

Most edges refer to object properties, such as "has_part", "has_characteristic" and their inverses. To improve readability, Phenoscript uses aliases instead of long labels. The following aliases are used:

">" has_part;
"<" part_of;
">>" has_characteristic;
"<<" characteristic_of;
"|>|" increased_in_magnitude_relative_to;
"|<|" decreased_in_magnitude_relative_to.

We encourage the reader to consult the Phenoscript language guide for syntax details. For more information about semantic characters, we suggest the Phenoscape Guide to Character Annotation and other relevant materials (Balhoff et al. 2010, Dahdul et al. 2010, Dahdul et al. 2018); note that they were designed for a class-based approach, but can be adapted to an instance-based approach with minor modifications. The Phenoscript version of the statements below is available in Supplementary Material.

Presence Phenotypes

Dorsal margin of metafemur with carina (Fig. 5b:16).
- uberon-male_organism > aism-metafemur > bspo-dorsal_margin > aism-cuticular_carina;
- Note. Simple character statement indicating presence of a structure, i.e. a cuticular carina on the dorsal margin of the metafemur of our organism.
Interstria 5 tuberculated distally (Fig. 4a:7 and Fig. 5c:7).
- uberon-male_organism > colao-elytral_interstria_5 > bspo-distal_region > aism-cuticular_tubercle;
- Note. Simple character statement analogous to the previous one.
Axial and subaxial endophallites fused.
- uberon-male_organism > colao-fused_axial_and_subaxial_endophallites;
- Note. The ad hoc COLAO term fused_axial_and_subaxial_endophallites allows to write a complex phenotypic character as a simple presence statement, i.e. the endophallite originated by the fusion of the axial_endophallite with the subaxial_endophallite.

Figure 5.

Grebennikovius basilewskyi in different views. Numbers next to arrows indicate patterns of phenotype statements explained in the section "Phenoscript: main patterns of phenotype statements". Arrows T6–T12 illustrate individual body parts.

a: anterodorsal view. Arrows T6–T9 show lateral clypeal tooth 1 (T6), dorsal protibial cuticular tooth 1 (T7), dorsal protibial cuticular tooth 2 (T8) and dorsal protibial cuticular tooth 3 (T9);
b: lateral view. Arrows T10–T12 show pronotum (T10), lateral pronotal carina (T11), and hypomeron (T12);
c: posterior view.

Absence Phenotypes

Posterior longitudinal hypomeral carina absent (Fig. 4b:9).
- uberon-male_organism !> colao-posterior_longitudinal_hypomeral_carina;
- Note. Simple absence statement in which the negation (!) expresses the lack of the entity colao-posterior_longitudinal_hypomeral_carina in the organism's body.
Fourth protibial cuticular tooth absent (Fig. 5a:14).
- uberon-male_organism > aism-protibia !> aism-dorsal_protibial_cuticular_tooth_4;
- Note. This character goes along with the character "Protibia with 3 teeth". In fact, the statement that the protibia has three teeth does not logically imply that the fourth one is absent. To assert it unequivocally, the presence of the tooth must be negated.

Count Phenotypes

Maxillary palpus with 4 palpomeres (Fig. 4b:8).
- uberon-male_organism > aism-maxillary_palpus_with_4_palpomeres;
- Note. Simple character statement indicating the presence of a structure. Interestingly, this apparently simple line automatically encapsulates, through the AISM term maxillary_palpus_with_4_palpomeres, the presence of each individual palpomere (i.e. AISM maxillary_palpomere_I to maxillary_palpomere_IV).
Legs with 5 tarsomeres (Fig. 4:6).
- uberon-male_organism > (aism-protarsus_with_5_protarsomeres, aism-mesotarsus_with_5_mesotarsomeres, aism-metatarsus_with_5_metatarsomeres);
- Note. Expressing the presence of pro-, meso- and metalegs would intuitively require to write three separate statements, one for each leg type. However, PhenoScript allows to write node lists, i.e. encapsulating into one line, in the form of a list between parentheses, multiple characters that are simultaneously ascribed to the organism.
Protibia with 3 teeth (Fig. 5a:14).
- uberon-male_organism > aism-protibia > (aism-dorsal_protibial_cuticular_tooth_1, aism-dorsal_protibial_cuticular_tooth_2, aism-dorsal_protibial_cuticular_tooth_3);
- Note. Similarly to the previous one, this statement expresses that the protibia bears protibial teeth 1 (distalmost) to 3 (proximalmost). The absence of a possible protibial_tooth_4 is expressed by the pattern "Fourth protibial cuticular tooth absent" (see "Absence Phenotypes").

Qualitative Phenotypes

Organism oval-shaped and flattened (Fig. 4 and Fig. 5).
- uberon-male_organism >> (pato-ovate, pato-flattened);
- Note. Simple quality statement in the form of a node list, expressing on the same line that the organism is both ovate and flattened.
Clypeus with two sharp, upturned triangular teeth (Fig. 5a:13).
- uberon-male_organism > aism-lateral_clypeal_tooth_1 >> (pato-upturned, pato-sharp);
- Note. Composed statement which first expresses the presence of lateral_clypeal_tooth_1 (which is bilaterally paired by definition), then it qualifies it as both upturned and sharp.
Posteromedial margin of head with small smooth area (Fig. 5a:11).
- uberon-male_organism > aism-vertex > bspo-posterior_region > bspo-medial_region >> pato-smooth;
- Note. Composed statement expressing the presence of a posteromedial region of vertex, defined by postcomposition as the posterior_region of medial_region of vertex. This region is then qualified as smooth.
Posterior margin of pronotum with row of large, oval, ocellate punctures (Fig. 4a:2).
- uberon-male_organism > aism-pronotum > bspo-posterior_region > aism-row_of_punctures > aism-ocellate_cuticular_puncture >> (pato-increased_size, pato-ovate);
- Note. Complex statement which first describes the presence of a row of punctures on the posterior margin of the pronotum composed by (an undefined number of) ocellate_cuticular_puncture and then qualifies such punctures as increased_size (= large) and ovate. This feature is particularly evident in G. armiger and G. pafelo, but a row of large punctures can also be observed in G. basilewskyi.
Scutellar shield absent (Fig. 4a:3).
- uberon-male_organism > colao-scutellar_shield >> pato-concealed;
- Note. We have chosen the term concealed instead of simply expressing the absence of scutellum because this structure is not visible from above, but present beneath the elytra.
Mesometaventral sulcus rounded medially (Fig. 4b:10).
- uberon-male_organism > colao-mesometaventral_sulcus > bspo-medial_region >> pato-curved;
- Note. We adopt the term "mesometaventral sulcus" to refer to the transverse sulcus between the mesoventrite and the metaventrite. The statement defines its medial region and then qualifies it as curved, a PATO term with a meaning equivalent to the NL "rounded".
Mesotibia expanded distally (Fig. 4a:5).
- uberon-male_organism > aism-metatibia > bspo-distal_region >> pato-dilated;
- Note. Composed statement expressing the presence of the metatibia, then defining its distal_region and lastly qualifying it as expanded through the semantically equivalent PATO term dilated.
Protibial apex bearing a row of short, thick setae (Fig. 5a:15).
- uberon-male_organism > aism-protibia > aism-antero-distal_margin > aism-cuticular_seta >> (pato-multiple, pato-increased_thickness);
- Note. Composed statement in which the presence of many cuticular_seta is described by qualifying them as multiple. The relatively high thickness of setae is then given by increased_thickness.
Parameres symmetrical, elongated.
- uberon-male_organism > aism-parameres >> (pato-symmetrical, pato-elongated);
- Note. Statement qualifying parameres both as symmetrical and elongated. Here we used aism-parameres instead of aism-paramere to mean that the region of cuticle formed by the two parameres is symmetrical and, therefore, the left and right parameres have shapes that mirror each other. Using aism-paramere would have been incorrect, since an individual paramere is not symmetrical per se.

Absolute and Relative Measurement Phenotypes

Body length: 4.5 mm.
- uberon-male_organism >> pato-length .iao-is_quality_measured_as iao-measurement_datum:md-c4c164 .aism-has_unit unit-millimeter; iao-measurement_datum:md-c4c164 .iao-has_measurement_value 4.5;
- Note. Absolute measurements require to define a measurement_datum, to which the measure of the focal quality (in this case, length) will be assigned. To measurement_datum are then ascribed: 1) an absolute measurement unit (in this case, millimetre) through the property .aism-has_unit; and 2) a measurement value (in this case, 4.5) through the property .iao-has_measurement_value. Due to its syntax, this statement must necessarily be broken into two lines, separated by semicolons. For this reason, an alphanumeric tag must be added to iao-measurement_datum to make sure it is recognised as the same entity between the two lines.
Pronotal surface covered with ocellate setigerous punctures separated by 1—2 diameters (Fig. 4a:1).
- uberon-male_organism > aism-pronotum:id-1549f8 >> aism-interpunctural_distance .iao-is_quality_measured_as iao-measurement_datum:md-1b36aa .aism-has_unit pato-diameter << aism-ocellate_setigerous_cuticular_puncture < aism-pronotum:id-1549f8; iao-measurement_datum:md-1b36aa .iao-has_measurement_value 1.5;
- Note. Relative measurements are substantially similar to absolute ones, but they use qualities of other entities as measurement units. In this example, the interpunctural_distance on pronotum is measured using the quality diameter inhering to ocellate_setigerous_cuticular_puncture. For simplicity, we prefer to give the average measurement value (1.5) instead of specifying the interval (1–2). As explained in the previous pattern, alphanumeric tags identify aism-pronotum and iao-measurement_datum occurring in separate lines as the same entities.

Relative Comparison Phenotypes (within the same species)

Head punctures becoming smaller anteriorly (Fig. 5a:12).
- uberon-male_organism > aism-clypeus > bspo-anterior_region > aism-setigerous_cuticular_puncture >> pato-diameter |<| pato-diameter << aism-setigerous_cuticular_puncture < aism-frons;
- Note. This pattern compares the quality of an entity (the diameter of some setigerous_cuticular_puncture located on the anterior region of clypeus) with an equivalent quality of another entity (the diameter of some setigerous_cuticular_puncture located on the frons). The property decreased_in_magnitude_relative_to (alias: |<|) between the two entities states that the former quality has a smaller value than the latter.
Ventral membranes of parameres equally sclerotised.
- uberon-male_organism > aism-left_ventral_conjunctiva_of_paramere >> pato-thickness .ro-similar_in_magnitude_relative_to pato-thickness << aism-right_ventral_conjunctiva_of_paramere < uberon-male_organism;
- Note. This statement compares the degree of sclerotisation of some left_ventral_conjunctiva_of_paramere (preferred quality term: pato-thickness) with that of the right_ventral_conjunctiva_of_paramere. In AISM and COLAO, the term "conjunctiva" is preferred over "membrane" as conjunctiva is explicitly defined for cuticular elements in insects. Since the two quality values are similar, the property .ro-similar_in_magnitude_relative_to was chosen.

Relative Comparison Phenotypes (between species)

Hind wing of Grebennikovius armiger shorter than hind wing of Grebennikovius basilewskyi.
- uberon-male_organism::grebennikovius_armiger > aism-hind_wing >> pato-length |<| pato-length:id-d38816[exclude = True] << aism-hind_wing:id-6b490e[exclude = True] < uberon-male_organism:grebennikovius_basilewskyi[exclude = True];
- Note. This statement compares two qualities ascribed to two different OTUs, Grebennikovius armiger and Grebennikovius basilewskyi. To allow the use of entities or qualities belonging to one OTU (G. basilewskyi) within another OTUs' (G. armiger) description, the function “exclude” must be used. This relative comparison is composed by two parts: 1) the length of the hind_wing of the organism uberon-male_organism::grebennikovius_armiger and 2) the length of the hind_wing of uberon-male_organism:grebennikovius_basilewskyi. The comparison is expressed through the property decreased_in_magnitude_relative_to (alias: |<|) written between the two lengths.

Other Phenotypes

Elytral interstria 8 carinated, the carina located medially to the lateral region of elytron (Fig. 4a:4 and Fig. 5b:4).
- uberon-male_organism > colao-elytron_with_9_striae:id-5770f5 > colao-elytral_interstria_8 > aism-cuticular_carina .aism-medial_to bspo-lateral_region < colao-elytron_with_9_striae:id-5770f5;
- Note. This statement describes the position of an anatomical entity (a cuticular_carina located on the elytral_interstria_8) with respect to another entity (the lateral_region of the elytron itself). The positional property aism-medial_to is used to state that the first entity is medial to the second.
Margin of abdominal tergite VIII entirely grooved (Fig. 5c:17).
- uberon-male_organism > aism-abdominal_tergite_VIII:id-791f84 > bspo-anatomical_margin .ro-coincident_with aism-cuticular_groove < aism-abdominal_tergite_VIII:id-791f84;
- Note. In dung beetles, the abdominal tergite VIII (a.k.a. pygidium) often has a more or less strongly impressed groove running parallel to the tergite's margins. This structure is, therefore, defined here as a cuticular_groove coincident_with the anatomical_margin of the tergite. The RO term coincident_with is, in this case, the most appropriate one to describe such positional relationship since it refers to linear structures that are both parallel and adjacent/coincident.

Taxon treatments

Grebennikovius armiger Montanaro, Grebennikov, Rossini, Grapputo, Ruzzier & Tarasov, 2024

Material Download as CSV

Holotype:

scientificName:
Grebennikovius armiger
; taxonID:
http://zoobank.org/883D3610-BE9D-4818-AE90-7D729A205190
; family:
Scarabaeidae
; genus:
Grebennikovius
; specificEpithet:
armiger
; scientificNameAuthorship:
Montanaro, Grebennikov, Rossini, Grapputo, Ruzzier & Tarasov, 2024
; country:
Tanzania
; locality:
Uluguru Mountains, at Tchenzema village
; verbatimElevation:
2408 m
; decimalLatitude:
-7.115
; decimalLongitude:
37.609444
; samplingProtocol:
litter sifting
; eventDate:
11-08-10
; habitat:
forest
; fieldNumber:
sifting10
; individualCount:
1
; sex:
male
; lifeStage:
adult
; preparations:
dry specimen
; catalogNumber:
http://id.luomus.fi/GAC.37252
; recordedBy:
V. Grebennikov
; identifiedBy:
G. Montanaro
; dateIdentified:
2023
; institutionID:
MZH
; basisOfRecord:
PreservedSpecimen
; occurrenceID:
294CCFE0-1DAE-581A-9D3A-CC1F672D5814

Description

male organism, chitin-based cuticle: red brown;
male organism, ventral side: dark brown;
male organism, antenna: yellow brown;
male organism, gena: sharp;
male organism, lateral clypeal tooth 1
- lateral clypeal tooth 1: upturned;
- lateral clypeal tooth 1: sharp;
male organism, head margin at genoclypeal sulcus: notched;
male organism, head capsule, frons, interpunctural distance = 1.0, unit: diameter of setigerous cuticular puncture;
male organism, clypeus, anterior region, setigerous cuticular puncture: diameter smaller than diameter of setigerous cuticular puncture of frons
male organism, vertex, posterior region, medial region: smooth;
male organism, antenna with 9 antennomeres, antennal club
- antennal club, flagellomere 5: present;
- antennal club, flagellomere 6: present;
- antennal club, flagellomere 7: present;
male organism, glossa: present;
male organism, epipharynx: present;
male organism, insect maxilla: present;
male organism, maxillary palpus with 4 palpomeres: present;
male organism, labial palpus with 3 palpomeres: present;
male organism, pronotum
- pronotum, antero-lateral region: flattened;
- pronotum, posterior region
  - posterior region, row of punctures, ocellate cuticular puncture
    - ocellate cuticular puncture: increased size;
    - ocellate cuticular puncture: ovate;
  - posterior region: sloped;
- pronotum, antero-lateral margin
  - antero-lateral margin: curved;
  - antero-lateral margin: obtuse;
- pronotum, posterolateral pronotal angle: curved;
- pronotum, posterior margin: curved;
- pronotum, postero-lateral margin: oblique orientation;
- pronotum, longitudinal pronotal groove: smooth;
- pronotum: width
  - width larger than width of elytron with 9 striae
  - width of pronotum
- pronotum, interpunctural distance = 1.5, unit: diameter of ocellate setigerous cuticular puncture;
male organism, pronotal disc: convex;
male organism, elytron with 9 striae
male organism, elytral interstria 5
- elytral interstria 5, proximal region: concave;
- elytral interstria 5, cuticular tubercle: absent;
male organism, elytral interstria 6, anterior-most region: concave;
male organism, scutellar shield: concealed;
male organism, hind wing
- hind wing: atrophied;
- hind wing: length;*
male organism, anterior hypomeral depression: present;
male organism, procoxal cavity, width = 0.375, unit: width of pronotum;
male organism, mesometaventral sulcus
- mesometaventral sulcus, medial region: curved;
- mesometaventral sulcus, lateral region: straight;
male organism, mesoventrite, cuticular puncture: present;
male organism, metaventrite
- metaventrite, punctate cuticle
  - punctate cuticle, posterior region, cuticular puncture: diameter smaller than diameter of cuticular puncture of anterior region of punctate cuticle
  - punctate cuticle, interpunctural distance = 1.5, unit: diameter of cuticular puncture;
- metaventrite: convex;
male organism, abdomen with 7 sternites: present;
male organism, abdominal tergite VIII
- abdominal tergite VIII, anatomical margin coincident with cuticular groove of abdominal tergite VIII
- abdominal tergite VIII, cuticle with setigerous punctures, ocellate setigerous cuticular puncture: present;
- abdominal tergite VIII, cuticular tubercle: bilaterally paired;
male organism, anterior groove of tergite VIII: present;
male organism, protarsus with 5 protarsomeres: present;
male organism, mesotarsus with 5 mesotarsomeres: present;
male organism, metatarsus with 5 metatarsomeres: present;
male organism, protibia
- protibia, dorsal protibial cuticular tooth 1: present;
- protibia, dorsal protibial cuticular tooth 2: present;
- protibia, dorsal protibial cuticular tooth 3: present;
- protibia, antero-distal margin, cuticular seta
  - cuticular seta: multiple;
  - cuticular seta: increased thickness;
- protibia, postero-distal margin
  - postero-distal margin: dilated;
  - postero-distal margin: curved;
- protibia: curved;
- protibia, dorsal protibial cuticular tooth 4: absent;
male organism, mesotibia, distal region: dilated;
male organism, metatibia
- metatibia, distal region: dilated;
- metatibia, medial region: curved;
- metatibia, distal region: dilated;
- metatibia, dorsal margin, distal region: curved;
male organism, profemur, ventral side
- ventral side: dilated;
- ventral side: tapered;
male organism, metafemur
- metafemur, dorsal margin, cuticular carina: present;
- metafemur, ventral margin, cuticular protrusion
  - cuticular protrusion: triangular;
  - cuticular protrusion: flattened;
male organism, procoxa, ventral region, cuticular carina, cuticular tubercle: present;
male organism, parameres
- parameres, ventral side, proximal region: notched;
- parameres, lateral side, distal region: tapered;
- parameres: symmetrical;
- parameres: elongated;
male organism, left ventral conjunctiva of paramere: thickness similar in magnitude relative to thickness of right ventral conjunctiva of paramere of male organism
male organism, lamella copulatrix
- lamella copulatrix: elongated;
- lamella copulatrix: straight;
male organism, fused axial and subaxial endophallites: present;
male organism: ovate;
male organism: flattened;
male organism, posterior longitudinal hypomeral carina: absent;
male organism, frontolateral peripheral endophallite: absent;
male organism, length = 4.5, unit: millimeter;

Notes

The asterisk (*) next to "hind wing: length;" denotes an incomplete conversion from OWL to generated NL, suggesting that the Phenospy algorithm requires refinement. The correct statement should read "the length of the hind wing is smaller than that of the male of G. basilewskyi".

Grebennikovius basilewskyi (Balthasar, 1960)

Material Download as CSV

scientificName:
Grebennikovius basilewskyi
; taxonID:
https://www.gbif.org/species/10023107
; family:
Scarabaeidae
; genus:
Grebennikovius
; specificEpithet:
basilewskyi
; scientificNameAuthorship:
(Balthasar, 1960)
; country:
Tanzania
; locality:
Uluguru Mountains, at Bunduki village
; verbatimElevation:
1592 m
; decimalLatitude:
-7.021389
; decimalLongitude:
37.652778
; samplingProtocol:
litter sifting
; eventDate:
11-22-10
; habitat:
forest
; fieldNumber:
sifting21
; individualCount:
1
; sex:
male
; lifeStage:
adult
; preparations:
dry specimen
; catalogNumber:
http://id.luomus.fi/GAC.37261
; recordedBy:
V. Grebennikov
; identifiedBy:
G. Montanaro
; dateIdentified:
2023
; institutionID:
MZH
; basisOfRecord:
PreservedSpecimen
; occurrenceID:
D5993BB7-AB31-5808-BEF9-3D55E77CCD84

Description

male organism, chitin-based cuticle: red brown;
male organism, ventral side: dark brown;
male organism, antenna: yellow brown;
male organism, gena: obtuse;
male organism, lateral clypeal tooth 1
- lateral clypeal tooth 1: upturned;
- lateral clypeal tooth 1: sharp;
male organism, head margin at genoclypeal sulcus: notched;
male organism, head capsule, frons, interpunctural distance = 1.0, unit: diameter of setigerous cuticular puncture;
male organism, clypeus, anterior region, cuticular puncture: diameter smaller than diameter of setigerous cuticular puncture of frons
male organism, vertex, posterior region, medial region: smooth;
male organism, antenna with 9 antennomeres, antennal club
- antennal club, flagellomere 5: present;
- antennal club, flagellomere 6: present;
- antennal club, flagellomere 7: present;
male organism, glossa: present;
male organism, epipharynx: present;
male organism, insect maxilla: present;
male organism, maxillary palpus with 4 palpomeres: present;
male organism, labial palpus with 3 palpomeres: present;
male organism, pronotum
- pronotum, antero-lateral margin
  - antero-lateral margin: curved;
  - antero-lateral margin: obtuse;
- pronotum, posterolateral pronotal angle: curved;
- pronotum, posterior margin: curved;
- pronotum, postero-lateral margin: oblique orientation;
- pronotum, posterior region
  - posterior region, longitudinal pronotal groove: present;
  - posterior region, ocellate cuticular puncture: diameter larger than diameter of ocellate cuticular puncture of anterior region of pronotum
- pronotum, pronotal disc, interpunctural distance = 1.5, unit: diameter of ocellate setigerous cuticular puncture;
- pronotum, lateral region: interpunctural distance;
- pronotum: width
  - width smaller than width of elytron with 9 striae
  - width of pronotum
male organism, pronotal disc: convex;
male organism, elytron with 9 striae
male organism, elytral interstria: convex;
male organism, elytral interstria 1, distal region: protruding;
male organism, elytral interstria 7, proximal region
- proximal region: protruding;
- proximal region: yellow brown;
male organism, elytral interstria 6, proximal region: yellow brown;
male organism, elytral interstria 5, distal region, cuticular tubercle: present;
male organism, scutellar shield: concealed;
male organism, hind wing
- hind wing of male organism*
- hind wing: atrophied;
- hind wing, length = 0.5, unit: length of elytron with 9 striae;
male organism, anterior hypomeral depression: present;
male organism, procoxal cavity, width = 0.375, unit: width of pronotum;
male organism, mesometaventral sulcus
- mesometaventral sulcus, medial region: curved;
- mesometaventral sulcus, lateral region: straight;
male organism, mesoventrite, cuticular puncture: present;
male organism, metaventrite
- metaventrite, punctate cuticle
  - punctate cuticle, medial region, cuticular puncture: decreased magnitude;
  - punctate cuticle, interpunctural distance = 1.5, unit: diameter of cuticular puncture;
- metaventrite: convex;
male organism, abdomen with 7 sternites: present;
male organism, abdominal tergite VIII
- abdominal tergite VIII, anatomical margin coincident with cuticular groove of abdominal tergite VIII
- abdominal tergite VIII, cuticle with setigerous punctures, ocellate setigerous cuticular puncture: present;
- abdominal tergite VIII, medial region: protruding;
male organism, anterior groove of tergite VIII: present;
male organism, protarsus with 5 protarsomeres: present;
male organism, mesotarsus with 5 mesotarsomeres: present;
male organism, metatarsus with 5 metatarsomeres: present;
male organism, protibia
- protibia, dorsal protibial cuticular tooth 1: present;
- protibia, dorsal protibial cuticular tooth 2: present;
- protibia, dorsal protibial cuticular tooth 3: present;
- protibia, antero-distal margin, cuticular seta
  - cuticular seta: multiple;
  - cuticular seta: increased thickness;
- protibia, postero-distal margin: obtuse;
- protibia: curved;
- protibia, dorsal protibial cuticular tooth 4: absent;
male organism, mesotibia, distal region: dilated;
male organism, metatibia
- metatibia, distal region: dilated;
- metatibia, distal region: dilated;
- metatibia, dorsal margin, distal region: straight;
male organism, profemur, ventral margin, medial region, cuticular tooth: sharp;
male organism, metafemur
- metafemur, dorsal margin, cuticular carina: present;
- metafemur, ventral margin, proximal region, cuticular tubercle: present;
- metafemur, anatomical region
  - anatomical region distal to cuticular tubercle
  - anatomical region: dilated;
male organism, procoxa, ventral region, cuticular carina, cuticular tubercle: present;
male organism, phallobase, proximal region: curved;
male organism, parameres
- parameres, proximal region, ventral region
  - ventral region: notched;
  - ventral region: dilated;
- parameres, distal region: tapered;
- parameres: symmetrical;
male organism, left ventral conjunctiva of paramere: thickness similar in magnitude relative to thickness of right ventral conjunctiva of paramere of male organism
male organism, lamella copulatrix
- lamella copulatrix, distal margin
  - distal margin, cuticular spine: present;
  - distal margin: curved;
- lamella copulatrix: elongated;
male organism, fused axial and subaxial endophallites: present;
male organism: ovate;
male organism: flattened;
male organism, posterior longitudinal hypomeral carina: absent;
male organism, frontolateral peripheral endophallite: absent;
male organism, length = 3.8, unit: millimeter;

Notes

The asterisk (*) next to "hind wing of male organism" denotes an improper conversion to NL. The statement should not appear in the generated NL description, but does so due to similar errors elsewhere.

Grebennikovius lupanganus Montanaro, Grebennikov, Rossini, Grapputo, Ruzzier & Tarasov, 2024

Material Download as CSV

Holotype:

scientificName:
Grebennikovius lupanganus
; taxonID:
http://zoobank.org/1D32A7F2-0376-436B-976F-54C2EEEC430C
; family:
Scarabaeidae
; genus:
Grebennikovius
; specificEpithet:
lupanganus
; scientificNameAuthorship:
Montanaro, Grebennikov, Rossini, Grapputo, Ruzzier & Tarasov, 2024
; country:
Tanzania
; locality:
Uluguru Mountains, Lupanga Peak
; verbatimElevation:
1921 m
; decimalLatitude:
-6.865
; decimalLongitude:
37.707778
; samplingProtocol:
litter sifting
; eventDate:
12-01-12
; habitat:
forest
; fieldNumber:
sifting27
; individualCount:
1
; sex:
male
; lifeStage:
adult
; preparations:
dry specimen
; catalogNumber:
http://id.luomus.fi/GAC.37250
; recordedBy:
V. Grebennikov
; identifiedBy:
G. Montanaro
; dateIdentified:
2023
; institutionID:
MZH
; basisOfRecord:
PreservedSpecimen
; occurrenceID:
79A3C8F9-BE33-5ED6-80C9-087F08795D29

Description

male organism, chitin-based cuticle: red brown;
male organism, ventral side: dark brown;
male organism, antenna: yellow brown;
male organism, gena: obtuse;
male organism, lateral clypeal tooth 1
- lateral clypeal tooth 1: upturned;
- lateral clypeal tooth 1: sharp;
male organism, head margin at genoclypeal sulcus: notched;
male organism, head capsule, frons, interpunctural distance = 1.0, unit: diameter of cuticular puncture;
male organism, vertex, posterior region, medial region: smooth;
male organism, antenna with 9 antennomeres, antennal club
- antennal club, flagellomere 5: present;
- antennal club, flagellomere 6: present;
- antennal club, flagellomere 7: present;
male organism, glossa: present;
male organism, epipharynx: present;
male organism, insect maxilla: present;
male organism, maxillary palpus with 4 palpomeres: present;
male organism, labial palpus with 3 palpomeres: present;
male organism, pronotum
- pronotum, pronotal disc: convex;
- pronotum, antero-lateral region: flattened;
- pronotum, antero-lateral margin
  - antero-lateral margin: curved;
  - antero-lateral margin: obtuse;
- pronotum, posterolateral pronotal angle: curved;
- pronotum, posterior margin: curved;
- pronotum, postero-lateral region: oblique orientation;
- pronotum, longitudinal pronotal groove: smooth;
- pronotum: width
  - width smaller than width of elytron with 9 striae
  - width of pronotum
- pronotum, interpunctural distance = 1.5, unit: diameter of ocellate setigerous cuticular puncture;
male organism, elytron with 9 striae
male organism, scutellar shield: concealed;
male organism, hind wing
- hind wing: atrophied;
- hind wing: length;*
male organism, anterior hypomeral depression: present;
male organism, procoxal cavity, width = 0.375, unit: width of pronotum;
male organism, mesometaventral sulcus
- mesometaventral sulcus, medial region: curved;
- mesometaventral sulcus, lateral region: straight;
male organism, mesoventrite, cuticular puncture: present;
male organism, metaventrite
- metaventrite, punctate cuticle
  - punctate cuticle, posterior region, cuticular puncture: diameter smaller than diameter of cuticular puncture of anterior region of punctate cuticle
  - punctate cuticle, interpunctural distance = 1, unit: diameter of cuticular puncture;
- metaventrite: convex;
male organism, abdomen with 7 sternites: present;
male organism, abdominal tergite VIII
- abdominal tergite VIII, anatomical margin coincident with cuticular groove of abdominal tergite VIII
- abdominal tergite VIII, cuticle with setigerous punctures, ocellate setigerous cuticular puncture: present;
- abdominal tergite VIII, posterior region: flattened;
- abdominal tergite VIII: convex;
male organism, protarsus with 5 protarsomeres: present;
male organism, mesotarsus with 5 mesotarsomeres: present;
male organism, metatarsus with 5 metatarsomeres: present;
male organism, protibia
- protibia, dorsal protibial cuticular tooth 1: present;
- protibia, dorsal protibial cuticular tooth 2: present;
- protibia, dorsal protibial cuticular tooth 3: present;
- protibia, antero-distal margin, cuticular seta
  - cuticular seta: multiple;
  - cuticular seta: increased thickness;
- protibia, postero-distal margin
  - postero-distal margin: dilated;
  - postero-distal margin: notched;
- protibia: curved;
- protibia, dorsal protibial cuticular tooth 4: absent;
male organism, mesotibia, distal region: dilated;
male organism, metatibia
- metatibia, distal region: dilated;
- metatibia, distal region: dilated;
male organism, profemur, ventral side: dilated;
male organism, metafemur
- metafemur, dorsal margin, cuticular carina: present;
- metafemur, ventral margin, proximal region, cuticular tubercle: present;
male organism, procoxa, ventral region, cuticular carina, cuticular tubercle: present;
male organism, parameres
- parameres, lateral side, distal region
  - distal region: obtuse;
  - distal region: tapered;
- parameres, proximal region, ventral region: notched;
- parameres: symmetrical;
- parameres: elongated;
male organism, left ventral conjunctiva of paramere: thickness similar in magnitude relative to thickness of right ventral conjunctiva of paramere of male organism
male organism, lamella copulatrix
- lamella copulatrix, distal region
  - distal region: flattened;
  - distal region: curved;
- lamella copulatrix: elongated;
male organism, fused axial and subaxial endophallites: present;
male organism: ovate;
male organism: flattened;
male organism, posterior longitudinal hypomeral carina: absent;
male organism, frontolateral peripheral endophallite: absent;
male organism, length = 4.0, unit: millimeter;

Notes

The asterisk (*) next to "hind wing: length;" denotes an incomplete conversion to NL, see the description of G. armiger for details.

Grebennikovius pafelo Montanaro, Grebennikov, Rossini, Grapputo, Ruzzier & Tarasov, 2024

Material Download as CSV

Holotype:

scientificName:
Grebennikovius pafelo
; taxonID:
http://zoobank.org/6AA504F4-91BB-4EF9-AF8F-31EDA06AD5F9
; family:
Scarabaeidae
; genus:
Grebennikovius
; specificEpithet:
pafelo
; scientificNameAuthorship:
Montanaro, Grebennikov, Rossini, Grapputo, Ruzzier & Tarasov, 2024
; country:
Tanzania
; locality:
Uluguru Mountains, at Tchenzema village
; verbatimElevation:
2429 m
; verbatimLatitude:
-7.121944
; verbatimLongitude:
37.621944
; samplingProtocol:
litter sifting
; eventDate:
11-07-10
; habitat:
forest
; fieldNumber:
sifting09
; individualCount:
1
; sex:
male
; lifeStage:
adult
; preparations:
dry specimen
; catalogNumber:
http://id.luomus.fi/GAC.37245
; identifiedBy:
G. Montanaro
; dateIdentified:
2023
; institutionCode:
MZH
; basisOfRecord:
PreservedSpecimen
; occurrenceID:
B5BA29D3-3E5A-5D76-95BB-A24310DBB962

Description

male organism, chitin-based cuticle: red brown;
male organism, ventral side: dark brown;
male organism, antenna: yellow brown;
male organism, gena: obtuse;
male organism, lateral clypeal tooth 1
- lateral clypeal tooth 1: upturned;
- lateral clypeal tooth 1: sharp;
male organism, head margin at genoclypeal sulcus: notched;
male organism, head capsule, frons, interpunctural distance = 1.0, unit: diameter of cuticular puncture;
male organism, vertex, posterior region, medial region: smooth;
male organism, antenna with 9 antennomeres, antennal club
- antennal club, flagellomere 5: present;
- antennal club, flagellomere 6: present;
- antennal club, flagellomere 7: present;
male organism, glossa: present;
male organism, epipharynx: present;
male organism, insect maxilla: present;
male organism, maxillary palpus with 4 palpomeres: present;
male organism, labial palpus with 3 palpomeres: present;
male organism, pronotum
- pronotum, antero-lateral region: flattened;
- pronotum, posterior region
  - posterior region, row of punctures, ocellate cuticular puncture
    - ocellate cuticular puncture: increased size;
    - ocellate cuticular puncture: ovate;
  - posterior region: sloped;
- pronotum, antero-lateral margin
  - antero-lateral margin: curved;
  - antero-lateral margin: obtuse;
- pronotum, posterolateral pronotal angle: curved;
- pronotum, posterior margin: curved;
- pronotum, postero-lateral region: parallel-sided;
- pronotum, longitudinal pronotal groove: smooth;
- pronotum: width
  - width smaller than width of elytron with 9 striae
  - width of pronotum
- pronotum, interpunctural distance = 1.5, unit: diameter of ocellate setigerous cuticular puncture;
male organism, elytron with 9 striae
male organism, elytral interstria 4, proximal region: concave;
male organism, elytral interstria 5
- elytral interstria 5, proximal region: concave;
- elytral interstria 5, cuticular tubercle: absent;
male organism, elytral interstria 6, anterior-most region: concave;
male organism, elytral interstria 8, anatomical side
- anatomical side lateral_to cuticular carina
- anatomical side: convex;
male organism, scutellar shield: concealed;
male organism, hind wing
- hind wing: atrophied;
- hind wing: length;*
male organism, anterior hypomeral depression: present;
male organism, procoxal cavity, width = 0.375, unit: width of pronotum;
male organism, mesometaventral sulcus
- mesometaventral sulcus, medial region: curved;
- mesometaventral sulcus, lateral region: straight;
male organism, mesoventrite, cuticular puncture: present;
male organism, metaventrite
- metaventrite, punctate cuticle
  - punctate cuticle, posterior region, cuticular puncture: diameter smaller than diameter of cuticular puncture of anterior region of punctate cuticle
  - punctate cuticle, interpunctural distance = 1, unit: diameter of cuticular puncture;
- metaventrite: convex;
male organism, abdomen with 7 sternites: present;
male organism, abdominal tergite VIII
- abdominal tergite VIII, anatomical margin coincident with cuticular groove of abdominal tergite VIII
- abdominal tergite VIII, cuticle with setigerous punctures, ocellate setigerous cuticular puncture: present;
- abdominal tergite VIII: convex;
male organism, protarsus with 5 protarsomeres: present;
male organism, mesotarsus with 5 mesotarsomeres: present;
male organism, metatarsus with 5 metatarsomeres: present;
male organism, protibia
- protibia, dorsal protibial cuticular tooth 1: present;
- protibia, dorsal protibial cuticular tooth 2: present;
- protibia, dorsal protibial cuticular tooth 3: present;
- protibia, antero-distal margin, cuticular seta
  - cuticular seta: multiple;
  - cuticular seta: increased thickness;
- protibia, postero-distal margin
  - postero-distal margin: dilated;
  - postero-distal margin: curved;
- protibia: curved;
- protibia, dorsal protibial cuticular tooth 4: absent;
male organism, mesotibia, distal region: dilated;
male organism, metatibia
- metatibia, distal region: dilated;
- metatibia, medial region: curved;
- metatibia, distal region: dilated;
- metatibia, dorsal margin, distal region: curved;
male organism, profemur, ventral side: dilated;
male organism, metafemur
- metafemur, dorsal margin, cuticular carina: present;
- metafemur, ventral margin, proximal region, cuticular tubercle: present;
- metafemur, anatomical region
  - anatomical region distal to cuticular tubercle
  - anatomical region: dilated;
male organism, procoxa, ventral region, cuticular carina, cuticular tubercle: present;
male organism, parameres
- parameres, lateral side
  - lateral side, distal region: tapered;
  - lateral side: curved;
- parameres, proximal region, ventral region: notched;
- parameres: symmetrical;
- parameres: elongated;
male organism, left ventral conjunctiva of paramere: thickness similar in magnitude relative to thickness of right ventral conjunctiva of paramere of male organism
male organism, lamella copulatrix
- lamella copulatrix, distal region
  - distal region: flattened;
  - distal region: angular;
- lamella copulatrix: elongated;
male organism, fused axial and subaxial endophallites: present;
male organism: ovate;
male organism: flattened;
male organism, posterior longitudinal hypomeral carina: absent;
male organism, frontolateral peripheral endophallite: absent;
male organism, length = 4.9, unit: millimeter;

Notes

The asterisk (*) next to "hind wing: length;" denotes an incomplete conversion to NL; see the description of G. armiger for details.

Analysis

In this work, we assessed the utility of the Phenoscript language for creating taxonomic descriptions of four Grebennikovius species, based on an individual-based approach. We initially wrote the descriptions in Phenoscript code and subsequently converted them to ontology format (OWL) and annotated NL text.

The ontology format represents a semantic description as a knowledge graph (i.e. the ABox composed of ontology individuals), where nodes indicate anatomical structures, their metadata (or characteristics) and edges indicate their relationships. The nodes and edges come from pre-selected ontologies. To create the semantic descriptions for this study, we used twelve different ontologies (Table 1). In order to semantically describe the morphological diversity of Grebennikovius, we had to expand the AISM and COLAO ontologies by adding 152 additional entities for insect and beetle anatomy. The four species descriptions contained 756 ontology individuals in total, which represent elementary units of a phenotypic statement.

While the ontology format is not easily understandable for humans, it is essential for making the descriptions semantically queriable. Using simple queries, we demonstrated the practical application of the semantic approach. With them, we were able to obtain the number of individuals for the selected ontology classes (Table 2). The results reveal that most of the phenotypic terms used in our descriptions are related to the shapes of anatomical structures and are mainly located in the insect thorax.

In contrast to the ontology format, the generated description in NL format was included in this publication to facilitate human-friendly reading. The NL format annotates all phenotypic terms with hyperlinks, allowing the reader to access the term's definition, properties and relationships directly through a web browser.

Generally, the algorithm for converting OWL to NL in Phenospy worked well since most of the statements are easily readable. However, the algorithm could not properly convert four semantic statements (one for each species) dealing with relative comparisons between species into NL. In the taxon treatments above, these statements are indicated by an asterisk "*" and discussed in the "Note" section therein.

Processing semantic descriptions involves several steps and requires the use of a variety of software programmes. To streamline this process, we created an openly available computational pipeline using the makefile tool (Fig. 1) (Supplementary Material).

We also created four nanopublications (see the section "Nanopublications") using the nanodash tool, accessible via the Biodiversity Data Journal (BDJ) portal. With this service, nanopublications can easily be created and integrated with BDJ. Our nanopublications specify that each Grebennikovius species inhabits a forest environment.

Discussion

The exploration of ontology-based technologies in our study highlights their significant potential for modelling computable phenotypes and species descriptions, effectively integrating taxonomy into the domain of phenomics (Deans et al. 2012, Deans et al. 2015, Edmunds et al. 2015, Mikó et al. 2021). This integration not only advances the field of taxonomy, but also enhances the interoperability of phenotypic data across various biological disciplines. Recent years have seen considerable progress in applying ontologies to phenotypes, such as crafting biological ontologies (Yoder et al. 2010, Mungall et al. 2012, Girón et al. 2023b), developing methods for annotation of character matrices, extracting presence/absence data (Dececchi et al. 2015) and phenotype-to-genotype prediction (Edmunds et al. 2015, Shefchek et al. 2019), thereby making a pivotal change in the field.

Phenoscript and computable descriptions

Our paper assessed the utility of Phenoscript, an emerging language for semantic descriptions and its associated tools for producing semantic data using an individual-based approach. Our results demonstrated the effectiveness of Phenoscript in creating semantic descriptions thanks to its syntax that is similar to NL expressions. In addition, the syntax was also improved over the previous version of the language (Mikó et al. 2021). The general concept of Phenoscript makes it a versatile tool, extendable beyond applications in biosystematics to ecological and other domains. Phenoscript allows taxonomists to bypass traditional NL descriptions and, instead, create semantic ones. These descriptions can then be converted into NL text for publication and into an ontology format for further analysis and dissemination.

The proposed computational pipeline automates the production of semantic descriptions and can be applied to any other taxon using a desktop computer. Due to our focus on dung beetles, the developed approach can be specifically applied to them with ease, enabling further semantic-based research in this group. Thus, the proposed semantic approach opens up possibilities for new types of publications, where taxonomists can semantically re-describe known species in order to unlock their traits for other cross-disciplinary research within biology.

Challenges and Future Prospects

Semantic descriptions, however, are currently slower to write than traditional NL ones for a number of reasons. A shortage of comprehensive educational resources and a relatively small community create a high initial barrier to learning semantic methods in evolutionary biology (Deans et al. 2015).

At present, composing semantic descriptions involves the addition of many new terms to ontologies, due to ontology incompleteness. In this study, we had to add 152 terms to AISM and COLAO to cover all the necessary morphological terminology for complete species descriptions. Eventually, we expect this task will diminish as usage of ontologies increases and they become saturated with terms.

Significant time is spent thinking about how to code particular traits semantically. Despite the establishment of the necessary protocols (Balhoff et al. 2010, Dahdul et al. 2010, Dahdul et al. 2018), we still require new logical models for certain types of traits. For example, the species descriptions in this study were based on single specimens, which worked well since the species have limited variation. This approach will not be optimal for many other species with at least moderate morphological variation. Thus, new data models and solutions are needed to model intraspecific variation efficiently.

In order to demonstrate the queriability of the descriptions, we conducted semantic queries as a proof of concept. Currently, such queries can be used to retrieve phenotypic information semi-manually for analysis and comparison across species. The process involves creating a query targeting specific traits and applying it to a set of semantic phenotypes. However, automatic comparison, such as identifying common or different traits between species, is not feasible with current methods and remains a topic for future research.

The conversion of semantic descriptions to natural language is not trivial. Our study encountered difficulties in accurately translating certain character patterns, underscoring the need for improved methods in this area. It is also essential to develop new methods for post-processing and analysing semantic descriptions. Particularly in taxonomy, innovative approaches for species diagnosis and comparison would be highly beneficial.

Nanopublications and Semantic Phenotypes

A nanopublication is a concept in scientific data management that is particularly relevant in the context of big data and FAIR principles (Kuhn et al. 2013, Kuhn et al. 2018). It represents a minimal unit of publishable information that can be used to describe anything, such as "species X feeds on species Y". Technologically, nanopublication is a small knowledge (RDF) graph (Kuhn et al. 2021) that is similar to the semantic phenotypes produced in the present study.

Although the concept of nanopublications is still emerging, it promises to revolutionise information sharing and analysis (Kuhn and Dumontier 2017). Creating a nanopublication involves generating an RDF graph through a specialised service, such as nanodash, as used in this study. Once created, the nanopublication is immediately accessible to the public, facilitating its use in big data analysis.

Currently, nanopublications created using the nanodash service are not subject to peer review. Thus, authors are encouraged to take full responsibility for the content. However, the nanodash service does distinguish peer-reviewed from non-peer-reviewed nanopublications. Moreover, its integration with BDJ facilitates the direct integration of nanopublications into conventional academic publications, where they do undergo the peer-review process.

As semantic phenotypes and nanopublications have technological similarities and both aim to be computable and FAIR, integrating them into one framework would be beneficial. In our research, we generated both semantic phenotypes and nanopublications. At the moment, these datasets are not integrated and stored separately, not in a unified semantic graph or triple store. As a result of this disintegration, we are unable to simultaneously query species traits and the data from nanopublications. Therefore, there is a need for additional methodological advancements in order to achieve effective integration. Data from this study can be used to explore and develop new methods for such an integration.

Conclusion

Semantic phenotypes offer a significant improvement in the generation, analysis and sharing of taxonomic data, marking a substantial move towards FAIR phenotypes and computable information. To fully integrate semantic phenotypes and descriptions into standard practices in taxonomy and biology, further advancements in computational methods are needed, along with the development of platforms for managing semantic phenotypes and active engagement from the scientific community.

Acknowledgements

We express our gratitude to Tobias Kuhn (Vrije Universiteit Amsterdam, Amsterdam, The Netherlands) and Lyubomir Penev (Pensoft Publishers, Sofia, Bulgaria) for their support and help with the creation of nanopublications; to István Mikó (University of New Hampshire, New Hampshire) for valuable comments on the definition of terms from ontologies. GM thanks all the people at the Finnish Museum of Natural History (Helsinki, Finland) and at the University of Padova (Padova, Italy) who helped him with his MSc thesis of which this article is partially a continuation. We also thank the Editor and the Reviewer for their constructive comments on the draft.

References

Balhoff J, Dahdul W, Kothari C, Lapp H, Lundberg J, Mabee P, Midford P, Westerfield M, Vision T (2010)

Phenex: Ontological annotation of phenotypic diversity

Nature Precedings

https://doi.org/10.1038/npre.2010.4636.1

Balhoff J, Mikó I, Yoder M, Mullins P, Deans A (2013)

A semantic model for species description applied to the ensign wasps (Hymenoptera: Evaniidae) of New Caledonia

Systematic Biology

(

639

‑

659

. https://doi.org/10.1093/sysbio/syt028

Balthasar V (1963)

Monographie der Scarabaeidae und Aphodiidae der palaearktischen und orientalischen Region. Coleoptera: Lamellicornia. Band 2. Coprinae (Onitini, Oniticellini, Onthophagini)

Tschechoslowakischen Akademie der Wissenschaften

Prague

627

pp.

Barbero E, Palestrini C, Roggero A (2003)

Revision of the genus Phalops Erichson, 1848 (Coleoptera: Scarabaeidae: Onthophagini). Monografie XXXVIII

Museo Regionale di Scienze Naturali

378

Cristóvão JP, Vaz-de-Mello FZ (2021)

The terminalia of the superfamily Scarabaeoidea (Coleoptera): specific glossary, dissecting methodology, techniques and previously unrecorded sexual dimorphism in some difficult groups

Zoological Journal of the Linnean Society

191

(

1001

‑

1043

. https://doi.org/10.1093/zoolinnean/zlaa079

Dahdul W, Manda P, Cui H, Balhoff JP, Dececchi TA, Ibrahim N, Lapp H, Vision T, Mabee PM (2018)

Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems

Database

2018

https://doi.org/10.1093/database/bay110

Dahdul WM, Balhoff JP, Engeman J, Grande T, Hilton EJ, Kothari C, Lapp H, Lundberg JG, Midford PE, Vision TJ, Westerfield M, Mabee PM (2010)

Evolutionary characters, phenotypes and ontologies: curating data from the systematic biology literature.

PLOS One

(

e10708

. https://doi.org/10.1371/journal.pone.0010708

Davis AL, Frolov AV, Scholtz CH (2008)

The African dung beetle genera

Protea Book House

https://doi.org/10.1649/0010-065X-64.4.394

Deans A, Yoder M, Balhoff J (2012)

Time to change how we describe biodiversity

Trends in Ecology & Evolution

(

‑

. https://doi.org/10.1016/j.tree.2011.11.007

Deans A, Lewis S, Huala E, Anzaldo S, Ashburner M, Balhoff J, Blackburn D, Blake J, Burleigh JG, Chanet B, Cooper L, Courtot M, Csösz S, Cui H, Dahdul W, Das S, Dececchi TA, Dettai A, Diogo R, Druzinsky R, Dumontier M, Franz N, Friedrich F, Gkoutos G, Haendel M, Harmon L, Hayamizu T, He Y, Hines H, Ibrahim N, Jackson L, Jaiswal P, James-Zorn C, Köhler S, Lecointre G, Lapp H, Lawrence C, Le Novère N, Lundberg J, Macklin J, Mast A, Midford P, Mikó I, Mungall C, Oellrich A, Osumi-Sutherland D, Parkinson H, Ramírez M, Richter S, Robinson P, Ruttenberg A, Schulz K, Segerdell E, Seltmann K, Sharkey M, Smith A, Smith B, Specht C, Squires RB, Thacker R, Thessen A, Fernandez-Triana J, Vihinen M, Vize P, Vogt L, Wall C, Walls R, Westerfeld M, Wharton R, Wirkner C, Woolley J, Yoder M, Zorn A, Mabee P (2015)

Finding our way through phenotypes

PLOS Biology

(

). https://doi.org/10.1371/journal.pbio.1002033

Dececchi TA, Balhoff J, Lapp H, Mabee P (2015)

Toward synthesizing our knowledge of morphology: Using ontologies and machine reasoning to extract presence/absence evolutionary phenotypes across studies

Systematic Biology

(

936

‑

952

. https://doi.org/10.1093/sysbio/syv031

d'Orbigny H (1913)

Synopsis des Onthophagides d’Afrique

Annales de la Société Entomologique de France

‑

608

Edmunds R, Su B, Balhoff J, Eames BF, Dahdul W, Lapp H, Lundberg J, Vision T, Dunham R, Mabee P, Westerfield M (2015)

Phenoscape: Identifying candidate genes for evolutionary phenotypes

Molecular Biology and Evolution

(

‑

. https://doi.org/10.1093/molbev/msv223

Génier F (2019)

Endophallites: a proposed neologism for naming the sclerotized elements of the insect endophallus (Arthropoda: Insecta)

Annales de la Société entomologique de France (NS)

(

482

‑

484

. https://doi.org/10.1080/00379271.2019.1685907

Girón JC, Lumen R, Montanaro G, Tarasov S (2023a)

The Coleoptera Anatomy Ontology (COLAO)

Zenodo

. Release date:

2024-2-14

. URL: https://zenodo.org/records/10659136

Girón JC, Tarasov S, González Montaña LA, Matentzoglu N, Smith AD, Koch M, Boudinot BE, Bouchard P, Burks R, Vogt L, Yoder M, Osumi-Sutherland D, Friedrich F, Beutel RG, Mikó I (2023b)

Formalizing invertebrate morphological data: A descriptive model for cuticle-based skeleto-muscular systems, an ontology for insect anatomy, and their potential applications in biodiversity research and informatics

Systematic Biology

(

1084

‑

1100

. https://doi.org/10.1093/sysbio/syad025

Girón JC, Mikó I, Gonzalez-Montaña LA, Montanaro G, Tarasov S, Matentzoglu N (2024)

Ontology for the Anatomy of the Insect SkeletoMuscular system (AISM)

Github

. Release date:

2024-1-31

. URL: https://github.com/insect-morphology/aism

Groth P, Gibson A, Velterop J (2010)

The anatomy of a nanopublication

Information Services & Use

‑

. https://doi.org/10.3233/isu-2010-0613

Jackson R, Balhoff J, Douglass E, Harris N, Mungall C, Overton J (2019)

ROBOT: A tool for automating ontology workflows

BMC Bioinformatics

(

). https://doi.org/10.1186/s12859-019-3002-3

Kuhn T, Barbano PE, Nagy ML, Krauthammer M (2013)

Broadening the scope of nanopublications

The Semantic Web: Semantics and Big Data

487

‑

501

. https://doi.org/10.1007/978-3-642-38288-8_33

Kuhn T, Dumontier M (2017)

Genuine semantic publishing

Data Science

139

‑

154

. https://doi.org/10.3233/ds-170010

Kuhn T, Banda J, Willighagen E, Ehrhart F, Evelo C, Malas T, Dumontier M, Merono-Penuela A, Malic A, Poelen J, Hurlbert A, Centeno Ortiz E, Furlong L, Queralt-Rosinach N, Chichester C (2018)

Nanopublications: A growing resource of provenance-centric scientific linked data

2018 IEEE 14th International Conference on e-Science (e-Science)

https://doi.org/10.1109/escience.2018.00024

Kuhn T, Taelman R, Emonet V, Antonatos H, Soiland-Reyes S, Dumontier M (2021)

Semantic micro-contributions with decentralized nanopublication services.

PeerJ. Computer Science

e387

. https://doi.org/10.7717/peerj-cs.387

Lawrence J, Ślipiński A (2013)

Australian beetles. Volume 1: Morphology, classification and keys

CSIRO Publishing

https://doi.org/10.1071/9780643097292

Mikó I, Masner L, Ulmer J, Raymond M, Hobbie J, Tarasov S, Margaría CB, Seltmann K, Talamas E (2021)

A semantically enriched taxonomic revision of Gryonoides Dodd, 1920 (Hymenoptera, Scelionidae), with a review of the hosts of Teleasinae

Journal of Hymenoptera Research

523

‑

573

. https://doi.org/10.3897/jhr.87.72931

Montanaro G, Grebennikov VV, Rossini M, Grapputo A, Ruzzier E, Tarasov S (2024)

Microallopatric speciation in the relict dung beetle genus Grebennikovius (Coleoptera: Scarabaeidae) in the Eastern Arc Mountains

Insect Systematics and Diversity

(

‑

. https://doi.org/10.1093/isd/ixae004

Mullins P, Kawada R, Balhoff J, Deans A (2012)

A revision of Evaniscus (Hymenoptera, Evaniidae) using ontology-based semantic phenotype annotation

ZooKeys

223

‑

. https://doi.org/10.3897/zookeys.223.3572

Mungall CJ, Torniai C, Gkoutos GV, Lewis SE, Haendel MA (2012)

Uberon, an integrative multi-species anatomy ontology

Genome Biology

(

). https://doi.org/10.1186/gb-2012-13-1-r5

Shefchek KA, Harris NL, Gargano M, Matentzoglu N, Unni D, Brush M, Keith D, Conlin T, Vasilevsky N, Zhang XA, Balhoff JP, Babb L, Bello SM, Blau H, Bradford Y, Carbon S, Carmody L, Chan LE, Cipriani V, Cuzick A, Della Rocca M, Dunn N, Essaid S, Fey P, Grove C, Gourdine J, Hamosh A, Harris M, Helbig I, Hoatlin M, Joachimiak M, Jupp S, Lett KB, Lewis SE, McNamara C, Pendlington ZM, Pilgrim C, Putman T, Ravanmehr V, Reese J, Riggs E, Robb S, Roncaglia P, Seager J, Segerdell E, Similuk M, Storm AL, Thaxon C, Thessen A, Jacobsen JOB, McMurry JA, Groza T, Köhler S, Smedley D, Robinson PN, Mungall CJ, Haendel MA, Munoz-Torres MC, Osumi-Sutherland D (2019)

The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species

Nucleic Acids Research

https://doi.org/10.1093/nar/gkz997

Vogt L (2021)

FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example

Journal of Biomedical Semantics

(

). https://doi.org/10.1186/s13326-021-00254-0

Washington N, Haendel M, Mungall C, Ashburner M, Westerfield M, Lewis S (2009)

Linking human diseases to animal models using ontology-based phenotype annotation

PLOS Biology

(

). https://doi.org/10.1371/journal.pbio.1000247

Wilkinson M, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten J, da Silva Santos LB, Bourne P, Bouwman J, Brookes A, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo C, Finkers R, Gonzalez-Beltran A, Gray AG, Groth P, Goble C, Grethe J, Heringa J, ’t Hoen PC, Hooft R, Kuhn T, Kok R, Kok J, Lusher S, Martone M, Mons A, Packer A, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone S, Schultes E, Sengstag T, Slater T, Strawn G, Swertz M, Thompson M, van der Lei J, van Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B (2016)

The FAIR Guiding Principles for scientific data management and stewardship

Scientific Data

(

). https://doi.org/10.1038/sdata.2016.18

Yoder M, Mikó I, Seltmann K, Bertone M, Deans A (2010)

A gross anatomy ontology for Hymenoptera

PLOS One

(

). https://doi.org/10.1371/journal.pone.0015991

Ziani S, Abdel-Dayem MS, Aldhafer HM, Barbero E (2019)

An overview of the Onthophagini from the Arabian Peninsula (Coleoptera: Scarabaeoidea: Scarabaeidae)

Zootaxa

4658

(

‑

. https://doi.org/10.11646/zootaxa.4658.1.1

Supplementary material

Nanopublications

Nanopublication	Creator	Date
883D3610-BE9D-4818-AE90-7D729A205190 - has habitat - forest ecosystem association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name 883D3610-BE9D-4818-AE90-7D729A205190. association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name 883D3610-BE9D-4818-AE90-7D729A205190.	0000-0003-0836-1364	21-12-2023 16:41:03
Grebennikovius basilewskyi (Balthasar, 1960) (species) - has habitat - forest ecosystem association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name Grebennikovius basilewskyi (Balthasar, 1960) (species). association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name Grebennikovius basilewskyi (Balthasar, 1960) (species).	Sergei Tarasov	19-12-2023 17:38:43
1d32a7f2-0376-436b-976f-54c2eeec430c - has habitat - forest ecosystem association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name 1d32a7f2-0376-436b-976f-54c2eeec430c. association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name 1d32a7f2-0376-436b-976f-54c2eeec430c.	0000-0003-0836-1364	22-12-2023 14:22:05
6aa504f4-91bb-4ef9-af8f-31eda06ad5f9 - has habitat - forest ecosystem association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name 6aa504f4-91bb-4ef9-af8f-31eda06ad5f9. association is an association of an organism taxon to an environment. association refers to taxon. association links the taxon to the environment forest ecosystem. association refers to the relation (between taxon and environment) has habitat. taxon has the name 6aa504f4-91bb-4ef9-af8f-31eda06ad5f9.	0000-0003-0836-1364	22-12-2023 14:24:23

Endnotes