Biodiversity Data Journal :
General research article
|
Corresponding author:
Academic editor: Ross Mounce
Received: 06 Aug 2015 | Accepted: 23 Sep 2015 | Published: 28 Sep 2015
© 2015 Yde de Jong, Juliana Kouwenberg, Louis Boumans, Charles Hussey, Roger Hyam, Nicola Nicolson, Paul Kirk, Alan Paton, Ellinor Michel, Michael Guiry, Phillip Boegh, Henrik Pedersen, Henrik Enghoff, Eckhard von Raab-Straube, Anton Güntsch, Marc Geoffroy, Andreas Müller, Andreas Kohlbecker, Walter Berendsohn, Ward Appeltans, Christos Arvanitidis, Bart Vanhoorne, Joram Declerck, Leen Vandepitte, Francisco Hernandez, Róisín Nash, Mark Costello, David Ouvrard, Pascale Bezard-Falgas, Thierry Bourgoin, Florian Wetzel, Falko Glöckler, Günther Korb, Caroline Ring, Gregor Hagedorn, Christoph Häuser, Nihat Aktaç, Ahmet Asan, Adorian Ardelean, Paulo Borges, Dhimiter Dhora, Hasmik Khachatryan, Michael Malicky, Shaig Ibrahimov, Alexander Tuzikov, Aaike De Wever, Snejana Moncheva, Nikolai Spassov, Karel Chobot, Alexi Popov, Igor Boršić, Spyros Sfenthourakis, Urmas Kõljalg, Pertti Uotila, Gargominy Olivier, Jean-Claude Dauvin, David Tarkhnishvili, Giorgi Chaladze, Michael Tuerkay, Anastasios Legakis, László Peregovits, Gudmundur Gudmundsson, Erling Ólafsson, Liam Lysaght, Bella Galil, Francesco Raimondo, Gianniantonio Domina, Fabio Stoch, Alessandro Minelli, Voldermars Spungis, Eduardas Budrys, Sergej Olenin, Armand Turpel, Tania Walisch, Vladimir Krpach, Marie Gambin, Laurentia Ungureanu, Gordan Karaman, Roy Kleukers, Elisabeth Stur, Kaare Aagaard, Nils Valland, Toril Moen, Wieslaw Bogdanowicz, Piotr Tykarski, Jan Węsławski, Monika Kędra, Antonio M. de Frias Martins, António Abreu, Ricardo Silva, Sergei Medvedev, Alexander Ryss, Smiljka Šimić, Karol Marhold, Eduard Stloukal, Davorin Tome, Marian Ramos, Benito Valdés, Francisco Pina, Sven Kullander, Anders Telenius, Yves Gonseth, Pascal Tschudin, Oleksandra Sergeyeva, Volodymyr Vladymyrov, Volodymyr Rizun, Chris Raper, Dan Lear, Pavel Stoev, Lyubomir Penev, Ana Rubio, Thierry Backeljau, Hannu Saarenmaa, Sandrine Ulenberg
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
de Jong Y, Kouwenberg J, Boumans L, Hussey C, Hyam R, Nicolson N, Kirk P, Paton A, Michel E, Guiry MD, Boegh PS, Pedersen HÆ, Enghoff H, von Raab-Straube E, Güntsch A, Geoffroy M, Müller A, Kohlbecker A, Berendsohn W, Appeltans W, Arvanitidis C, Vanhoorne B, Declerck J, Vandepitte L, Hernandez F, Nash R, Costello MJ, Ouvrard D, Bezard-Falgas P, Bourgoin T, Wetzel FT, Glöckler F, Korb G, Ring C, Hagedorn G, Häuser C, Aktaç N, Asan A, Ardelean A, Borges PAV, Dhora D, Khachatryan H, Malicky M, Ibrahimov S, Tuzikov A, De Wever A, Moncheva S, Spassov N, Chobot K, Popov A, Boršić I, Sfenthourakis S, Kõljalg U, Uotila P, Gargominy O, Dauvin J-C, Tarkhnishvili D, Chaladze G, Tuerkay M, Legakis A, Peregovits L, Gudmundsson G, Ólafsson E, Lysaght L, Galil BS, Raimondo FM, Domina G, Stoch F, Minelli A, Spungis V, Budrys E, Olenin S, Turpel A, Walisch T, Krpach V, Gambin MT, Ungureanu L, Karaman G, Kleukers RM.J.C, Stur E, Aagaard K, Valland N, Moen TL, Bogdanowicz W, Tykarski P, Węsławski JM, Kędra M, M. de Frias Martins A, Abreu AD, Silva R, Medvedev S, Ryss A, Šimić S, Marhold K, Stloukal E, Tome D, Ramos MA, Valdés B, Pina F, Kullander S, Telenius A, Gonseth Y, Tschudin P, Sergeyeva O, Vladymyrov V, Rizun VB, Raper C, Lear D, Stoev P, Penev L, Rubio AC, Backeljau T, Saarenmaa H, Ulenberg S (2015) PESI - a taxonomic backbone for Europe. Biodiversity Data Journal 3: e5848. https://doi.org/10.3897/BDJ.3.e5848
|
Reliable taxonomy underpins communication in all of biology, not least nature conservation and sustainable use of ecosystem resources. The flexibility of taxonomic interpretations, however, presents a serious challenge for end-users of taxonomic concepts. Users need standardised and continuously harmonised taxonomic reference systems, as well as high-quality and complete taxonomic data sets, but these are generally lacking for non-specialists. The solution is in dynamic, expertly curated web-based taxonomic tools.
The Pan-European Species-directories Infrastructure (PESI) worked to solve this key issue by providing a taxonomic e-infrastructure for Europe. It strengthened the relevant social (expertise) and information (standards, data and technical) capacities of five major community networks on taxonomic indexing in Europe, which is essential for proper biodiversity assessment and monitoring activities. The key objectives of PESI were: 1) standardisation in taxonomic reference systems, 2) enhancement of the quality and completeness of taxonomic data sets and 3) creation of integrated access to taxonomic information.
This paper describes the results of PESI and its future prospects, including the involvement in major European biodiversity informatics initiatives and programs.
PESI, Euro+Med PlantBase, Fauna Europaea, European Register of Marine Species, Index Fungorum, International Plant Names Index (IPNI), AlgaeBase, ZooBank, European taxonomic backbone, EU-nomen, Taxonomic indexing, Authority File, Taxonomy, Nomenclature, Global Names Architecture, INSPIRE, LifeWatch, EUBON
The Pan-European Species-directories Infrastructure (PESI) provides a mechanism to deliver an integrated, annotated checklist of the species occurring in 'geographic Europe', aiming to cover the Western Paleartic biogeographic region. The PESI checklist (also called EU-nomen) serves as a taxonomic standard and backbone for Europe. At the core of EU-nomen are five community networks, with common nomenclatures or systems designations: Zoology, Botany, Marine Biota, Mycology and Phycology. These five community networks are integrated in five infrastructural components (Fig.
Five community networks (horizontal) are integrated in five categories of coordination effort (vertical) in PESI. Community networks represent the FP4 and FP5 key programs on European taxonomic indexing: Fauna Europaea, ERMS, Euro+Med PlantBase, supplemented by Index Fungorum and AlgaeBase.
PESI merges data from multiple sources and publishes it online. This requires a mapping between the different schemas used by the different data sources and/or an implementation of standards within those data sources. A pragmatic approach was taken for the main databases. A bespoke procedure was developed at the Botanic Garden and Botanical Museum Berlin-Dahlem (BGBM) using the EDIT Common Data Model-based Platform for Cybertaxonomy to merge Euro+Med PlantBase and Fauna Europaea. These data are merged with ERMS and additional data provided by these checklists (such as distribution details) in the PESI Data Warehouse. This procedure is repeated periodically to keep the data warehouse synchronised with the source databases. The Flanders Marine Institute (VLIZ), that manages the portal, adds further information (such as images, vernacular names and priority status) and gathers information from other portals, such as the World Register of Marine Species.
In order for the data sets to be merged in this way they need to share common vocabularies for some fields – these include: taxon status, nomenclatural status and occurrence status. PESI provides species lists based on geographic regions and European legislation (e.g. the Habitat Directive, Birds Directive, CITES and IUCN (conservation status) directives). The resulting dataset includes a total of nearly 450,000 scientific names (which include 240,000 valid species and infraspecific names) and 190,800 vernacular names in 117 languages. In the PESI portal, each species has its own result page including a map of its geographical distribution, and many have deep links to additional data (photos, videos, literature, global prioritised species databases). The portal further includes: a Taxon Search Tool, a Taxon Match Tool (for users to match their species names against the names in PESI to check spelling, synonymy and classificaiton), a metadata database on taxonomic resources and expertise (provided by focal point partners all over Europe and adjacent countries), Web Services and links to existing nomenclators. The Project website provides information about the PESI project and the available communication tools.
PESI builds upon previous European taxonomic projects (like EDIT, ENBI and EuroCAT) and strengthens the taxonomic networks to ensure that data are updated and that relevant social networks and research communities are maintained and developed. PESI contributes to building the capacity necessary to support a growing number of international initiatives such as the Global Biodiversity Information Facility (GBIF), LifeWatch (LW), the Catalogue of Life (CoL), the Encyclopedia of Life (EoL), the Global Names Architecture (GNA), GÉANT, and Research Data Alliance (RDA). The project also provides key contributions to relevant ensuing EC-FP7 projects like ViBRANT (on cybertaxonomy), OpenUp! (opening up the natural history heritage for Europeana), BioVeL (on developing virtual labs for biodiversity research), iMarine (on developing an e-Infrastructure for fisheries management and marine conservation), EU BON (on building the European Biodiversity Observation Network, as a European contribution to GEO BON) and BiodiversityKnowledge (on developing a recommended design for a scientific biodiversity Network of Knowledge). At the same time, it provides a robust infrastructure to support the nomenclatural needs of European users and stakeholders on biodiversity management and research. PESI has specifically addressed the issues of pooling resources, expertise networks, standardisation, sustainability, accessibility and international cooperation (
PESI was initiated during the EC-FP6 European Distributed Institute of Taxonomy Network (EDIT) and Marine Biodiversity and Ecosystem Functioning (MarBEF) EU Networks of Excellence and funded as a project by the European Commission, contributing to the EC-FP7-Infrastructures 'Capacities' subprogram under the Coordination and Support Actions (CSA) funding scheme (RI-223806), starting in May 2008 for a period of three years. PESI is formally accepted as an INSPIRE standard for Europe and adopted in the workplans of various major EU initiatives on biodiversity research, including LifeWatch and EU BON, as the taxonomic backbone for Europe.
The strengthening and integration of European taxonomic communities has been progressing since the start of the taxonomic indexing EU framework programmes Fauna Europaea (
PESI makes a significant addition to the species registers through the expansion of the network of expertise towards Eastern Europe. In addition to providing more comprehensive data for the pan-European species registers, this expansion has overcome the separation of knowledge and taxonomic practice over decades. Another achievement is the closer collaboration of the taxonomic societies, especially with respect to improving taxonomic coverage and addressing long-term maintenance and upgrading of the taxonomy.
In addition to creating a network of taxonomic experts, PESI has developed a network of regional (often national) focal points. These focal points (either individuals or organisations) complement the taxonomic network through: (1) liaising with national governmental bodies on the implementation of European standards relevant to, for instance, national and European regulations and environmental monitoring, (2) collecting and transferring local expertise and applied tools, (3) lobbying and public policy assistance at national and European level, and (4) supporting closer collaboration of scientific contributor and user communities across Europe. Focal points contribute country-specific information about species, relevant databases, local literature, experts, professional societies and major users such as government organisations.
In biology, taxon names provide anchors that allow information about organisms to be linked. A taxonomic name, typically a species name, is attached to every primary data object (field observation, specimen, genetic data, etc.). Therefore names, together with their organisation into taxonomic classifications, are understood as core (meta-)data for biological information systems. There are many challenges in integrating data sources that contain taxonomic names and classifications, particularly where the sources extend over different biological kingdoms or national boundaries. Names may be erroneously assigned or incomplete and so searches based on exact character matching against names in current use may fail. Names that are synonyms or old combinations no longer in current use may occur in museum and herbarium specimen catalogues or in legislative lists. Names with orthographical errors occur in legislative lists on national or international level and some of them are often in use by certain taxonomists. There may also be disagreement amongst experts on the identity of specimens and on the taxonomic constituents of genera and the arrangement of classifications. The partners involved in PESI have extensive experience with handling such problems and PESI has produced practical solutions for many of these issues. The availability of authoritative taxonomic metadata standards is of particular relevance where species are directly linked to societal issues such as conservation and environmental control. PESI promotes harmonisation and certification of taxonomic metadata standards of prioritised taxa that are listed in various EU regulations and legislative lists. To address these issues, PESI had the following objectives:
PESI has technically integrated the pan-European species registers into an e-infrastructure by creating a joint access (middle) layer, the PESI data warehouse. This includes an index of species names associated with a number of attributes, such as synonyms, their place in the management classification and their geographical distribution. This data content results from the integration of the earlier pan-European checklists into a unified directory, following advanced routines on data verification and harmonisation. These routines are laid down in the PESI Common Data Model store (PESI CDM-store), an instance of the EDIT Platform for Cybertaxonomy. The PESI data warehouse not only hosts the integrated Taxonomic Backbone for Europe, but also allows unambiguous cross-links to, and compatibility with, other biodiversity information services, such as persistent identifiers, standardised vocabularies and exchange formats. Therefore this component is crucial to secure the stability of the European Taxonomic Backbone and to link PESI with global e-gateways.
PESI has built an interactive, multilingual web portal to carry out the dissemination of the developed species names service and to support the use of the pan-European species data in the e-science domain. This includes relevant supplementary data, such as (region-based) occurrence details, literature and DNA sequence information, and applies dynamic links to other pertinent data services. Additionally, web services allow users to link PESI-functions into desktop applications as well as service-oriented information infrastructures and hereby establish enhanced access to species names, GUIDs, occurrence details and the hierarchical classification. The PESI web portal provides the interface to the European Taxonomic Backbone.
PESI resulted in an integrated overview regarding the taxonomy and occurrence of European species, including their current legislation status and other important metadata annotations (like vernacular or common names). The PESI Taxonomic Backbone serves as a taxonomic data standard resource, facilitating and optimising the integration and sharing of European biodiversity data, supporting a wide range of European services, major biodiversity programs and stakeholders on nature conservation and biodiversity management.
A pan-European Taxonomic Workforce (ETW), consisting of all people who carry out taxonomic work in Europe, was identified as pivotal for the integration and coordination of the current and future expert networks within Europe (Fig.
Quality control and standards
PESI has set out a working Intellectual Property Rights (IPR) model, based on the SMEBD model, under which PESI can proceed with its aim to integrate taxonomic databases in Europe, ensuring the free and wide dissemination of data, and promoting the further virtualisation efforts (e-Taxonomy; e-Science; e-Publishing) in this field. The SMEBD model encourages scientists to input their data without being burdened with its upkeep. This allows scientists to retain full ownership of their work, knowing that data stewards are managing their data accordingly. This IPR model proposes a Creative Commons Attribution (cc-BY) license that allows for the wide dissemination of data to other scientists and interested parties, giving recognition to the creator of the data (but see
Gap analyses and Continuity
To secure the continuity of electronic biodiversity data, PESI has reviewed gaps in taxonomic expertise, species registers and informatics resources throughout Europe and surveyed potential ways to complete these gaps (Suppl. material
The functional and successful operation of ETW involves a series of informatics resources, which will be addressed within the gap analysis, particularly in relation to successful developments in this area, like Scratchpads, the Platform for Cybertaxonomy and tools for e-Publishing.
Recently the hosting of the EDIT Expert Database (including the ETW) at the Zoological Museum Copenhagen has ended in favour of a less centralised model, hosting the relevant (zoological, botanical, marine, terrestrial, etc.) expert data in the respective pan-European checklists databases. A collaboration with CETAF on the governance of taxonomic knowledge networks is under consideration (see section 'Progressions').
The continuity of relevant electronic taxonomic biodiversity data resources and expertise networks were examined (Suppl. material
The terrestrial and freshwater Focal Point Networks in PESI originate from focal point partners within Fauna Europaea (
PESI Focal Points Network, showing the respective Fauna Europaea, Euro+PlantBase and ERMS focal point partners. Northern African 'proto focal points' are indicated with open circles. Pan-Caucasian Plant Biodiversity Initiative partners are indicated with green asterisks.
Focal Points network details are described in Suppl. material
For the botanical community, the PESI Focal Point network built on the existing infrastructure of a regional advisory network for Euro+Med Plantbase. The former Euro+Med PlantBase editorial centres, each of them responsible for the coordination of a share of vascular plant families, became the PESI Botanical Focal Points: Berlin (Euro+Med PlantBase secretariat), Bratislava, Palermo and Sevilla, plus Helsinki as a non-contracting partner. The Berlin Botanical Focal Point established and maintained the connection with the regional advisory network, consisting of 85 floristic and taxonomic botanical experts, mostly based at botanical institutions all over Europe, Northern Africa, the Levant and the Caucasian countries. The regional advisers are responsible for 64 major areas (countries) and 48 minor subdivisions (mostly individual islands). Euro+Med PlantBase now has 190 vascular plant families online, corresponding to 95% of the European Flora of Vascular Plants. The Helsinki FP took care of linking grid-map data from the Atlas Flora Europaeae with the map services of the PESI web Portal.
A selection of the Fauna Europaea Focal Points agreed to become PESI Focal Point partners, others became associate, non-contracted Focal Points. The network of Marine National Focal Points aims at the continuous communication, information exchange and harmonisation of taxonomic data and practices followed by the systematists involved. Only through this process it was possible for the European taxonomic experts on marine taxa to join forces and avoid further fragmentation after the end of the Networks of Excellence, within the 6th European Framework Programme.
Over three years the entire process resulted in two major achievements: a) it created the largest network of taxonomists in Europe through the National Focal Points, and b) produced a massive amount of information on all kinds of existing nomenclators, museum catalogues, existing taxonomic expertise, societies and systematists assemblages, taxonomic information publishing gates, etc., which are now available through the single multilingual portal of PESI, which can, in turn, be used by any member of the National Focal Point network. However, the overall achievement is that the defragmentation of the European taxonomic community was encountered and that the process paved the way for this community towards a cohesive and challenging framework in the future. The next successful steps are already visible through a number of projects and initiatives (e.g. LifeWatch, EU BON, and COST action applications).
At this moment the public PESI Portal includes a large number of taxonomic-related data provided by the Focal Points. These data comprise contact and expertise details of nearly 2,000 experts and more than 500 organizations (from professional institutions to amateur societies) in Europe, in addition to the European Taxonomic Workforce (ETW) and EDIT Expert Database (see above, Fig.
PESI has a significant outreach to eastern European countries and beyond, connecting local knowledge networks to pan-European expertise and supporting the implementation of relevant e-Infrastructures for information management. For example, the Ukraine PESI Focal Point was selected as incrEAST "Project of the Month - March 2012" as well as contributing to the GÉANT Eastern Partnership Event Program in Moldova.
PESI is about integration and presentation of data. To do this effectively a common framework is needed to support the establishment and promotion of controlled vocabularies and metadata standards that enable effective integration of data and best practices. PESI sets out to provide a taxonomic backbone for Europe, at the heart of which is an annotated synonymised checklist of species. The prime focus has been to identify appropriate standards to support this task. In addition to ensuring that data held within PESI are successfully integrated, it is important that our data are accessible and interoperable with other initiatives. We have therefore looked at protocols, data models, controlled vocabularies and ontologiesthat will facilitate this interoperability, working closely with external organisations involved with biodiversity informatics to assist in the process of developing standards and resources for data exchange and data validation.
More specifically PESI focused on:
PESI researched and proposed standards for working with data associated with biological organisms, including:
The latter has been developed by the Biodiversity Information Standards organisation (TDWG) in association with GBIF and provides a simple flat-file structure, such as may be used in a spreadsheet or delimited text file, together with two XML metadata files that describe the resource and the data structure. It is important to use a simple solution to make it easy for data providers to submit their data to PESI. It also provides the mechanism for PESI data to be passed to third parties, such as GBIF or other European taxonomy and biodiversity projects.
Controlled vocabularies developed and recommended for occurrence status, taxon status, nomenclatural status and geographical regions in use in the PESI data warehouse and PESI Portal are listed as an appendix in the PESI Focal Point Handbook (Suppl. material
PESI partners organised sessions and presented papers at the annual conferences of the Biodiversity Standards Organisation (TDWG) and, most importantly, made use of these occasions to further the development of PESI objectives through networking with the leading participants in the development of biodiversity informatics standards. A significant outcome from these activities is the ‘Montpellier Declaration’. This is an agreement, proposed by PESI, between major biodiversity informatics projects to use a standard approach to sharing data (discussed in Suppl. material
International infrastructures for the production, maintenance, and publication of taxonomic checklists are highly heterogeneous with regard to scope, information models, workflows, and implementation. The PESI project recognised from the beginning that an effort for integrating European checklist information into a single unified system would need an “information broker” responsible for merging disparate and potentially conflicting information, performing data quality measures, and streamlining the process of data publication from the individual checklist to the common European information portal and service layer.
The EDIT Platform for Cybertaxonomy (
The model can be deployed using almost any Database Management System (DBMS). Application programmers can develop all kinds of systems (e.g. portals, editor software, import and export functions) using well defined APIs (Application Programming Interfaces) exposed as a Java Library (wp5.e-taxonomy.eu/cdmlib) and as generic web-services (wp5.e-taxonomy.eu/cdmlib/rest-api.html).
Data merging and publication
For its deployment as the central merging facility in PESI, the BGBM-team extended the platform and its library (Suppl. material
With these methods, a data publication cycle in PESI is performed in three basic phases (Fig.
The PESI GUID strategy
The publication of taxonomic data in web portals (for human consumption) is an important aspect of the PESI infrastructure. However, providing information in machine-readable form is becoming increasingly important, because this makes the information reusable in a wide range of potential applications (e.g. as part of the taxonomic backbone in species information systems). PESI addresses this aspect by including a SOAP-compliant web-service interface into its portal implementation. In addition, REST-full services have been implemented, providing light-weight interfaces to the PESI backbone, like EU BON taxonomic backbone UTIS. As prerequisite, the technical partners had to agree on a common approach for handling identifiers (Suppl. material
The agreement on a common identifier system had to include clear rules as to when changes to an object imply issuing a new identifier. To ensure consistent application, these rules have to work at a machine level - the operations that turn an existing taxonomic object into a new one had to be defined. The PESI approach has been recognised by the “Beginner’s Guide to Persistent Identifiers” recently published by GBIF and is fully compliant with the persistent identifier strategies developed by the organisation for Biodiversity Information Standards (TDWG).
Sustainability of the platform development
The EDIT Platform for Cybertaxonomy is continuously improved by an international team of developers, coordinated by the Biodiversity Informatics Research Group of the BGBM. It is the basis for an increasing number of checklists on different scales as well as geographic and taxonomic scopes. In addition, several new EU e-infrastructure projects have the EDIT-Platform integrated as an important technological component:
With new projects related to EDIT platform developments in progress (like EU BON) and the increasing number of users of the technology we expect to maintain the Platform as an integral part of the European and world-wide biodiversity informatics landscape.
Sustainability of European GSDs
PESI supported the development of governance plans for EU-based Global or Regional Species Databases (GSDs/RSDs) in the following ways: (a) as a European taxonomic infrastructural component, (b) as a contribution to global efforts like the Catalogue of Life (Fig.
PESI portal
The PESI web portal is the interface to the European Taxonomic Backbone (Fig.
Still operating separately, the register’s data are merged about every year in the PESI Data Warehouse and are made available through this single portal. In addition to taxonomic information, PESI harvests information on species (images, literature, conservation status) and provides links to other portals (e.g. national checklists, red species lists and other bioinformatics databases such as the Biodiversity Heritage Library for literature and the DNA databases of the Barcode of Life and GenBank). So far three editions of the European Taxonomic Backbone have been released (see section 'Dissemination').
The portal is an important instrument for standardisation of species names. The search interface is the main public access point to information on species living in Europe. However, the portal also provides services for those building their own species applications. The PESI-website can be consulted in 21 European languages.
Web statistics
The website attracts around 5,000 unique visitors per month (Fig.
Search parameters
The advanced search interface provides a number of fields to generate output based on selected parameters (Fig.
Taxonomic parameters
Additional parameters
The occurrence status “present” includes all other statuses except for “absent”. The list of areas is linked to the Marine Regions gazetteer. This hierarchical gazetteer of place names makes it possible to add relationships to areas. The users can, for example, generate a list of species from France and the system automatically includes the species that are recorded as present in Corsica.
Besides creating lists of species, the user can search for a particular taxon by entering (part of) the scientific name, name authority or common name. However, if there is no exact match, the search tool performs a number of ‘intelligent’ consecutive queries until matches are found:
E-services for taxonomic standardisation
The PESI web portal also provides a number of tools for quality control and to standardise a user's own species names. The ‘taxon match’ is an intelligent name validation service to cross-match external species lists against names in PESI. In addition, if an external species list is restricted to a particular area, you can also check if this corresponds to the occurrences in the PESI database. Once you have run the match, the tool exports the results (exact and suggested spellings, the higher classification, occurrence status and the checklist’s Globally Unique Identifiers) as a spreadsheet file.
The PESI taxon match tool has been promoted as an important tool within the PESI focal points network and at various other meetings (e.g. GBIF EU-nodes meetings). During the project phase, as part of the PESI Focal Points validation process, 357 files have been uploaded and matched.
In contrast to the taxon match, where users have to upload a species list, the portal also provides a platform-independent SOAP/WSDL web service. This web service allows users to dynamically link their own applications to the PESI database and will allow them to match a locally stored species list and add taxonomic and additional information derived from PESI.
A few examples of possible applications:
Additional (other than taxonomic) information on the web portal
The PESI portal shows species distribution maps, if occurrence details are available. There are over 6 million distribution records in the PESI database. The maps are built on OpenLayers. The backend of both occurrence types is GeoServer, an open source implementation of WMS that implements the Open Geospatial Consortium (OGC) standards.
There are two types of occurrence data on the portal:
Deep links to other biodiversity information systems
The PESI portal provides links to other portals (see above). On name matching, more specifically:
The Biodiversity Heritage Library (BHL) API is queried for scientific names (redirecting to the Global Names recognition and discovery tools and services). If matches are found, the information is cached in our database for one month. The next time someone loads a particular taxon page within the same month, the cached information is displayed. If the request time compared to the last update in the database is above one month, re-querying the BHL web service refreshes the information.
GenBank provides a standard method for linking to their pages. We retrieve the information from their Taxon Browser tool. If the link exists, it is displayed on the species page.
The Barcode of Life database (BOLD) results are fetched from the BOLD Taxonomy Browser. Only when the 'specimens with barcodes' parameter is set, a link is visible on the species page.
The PESI project recognises the need to link the conventional scientific publications to the species databases to provide users with a more comprehensive resource. To that end the project engaged with major publishers of scientific journals to understand how their online information systems were developing, and to explain how the species databases were evolving. A first workshop was held on 16th July 2009 in Amsterdam inviting key publishers, including PLoS, InterResearch, Allen Press, CRC Press (Taylor & Francis), Oxford University Press, Scopus, Science Direct, Elsevier, ISI Web of Science of Thomson Reuters, OvidSP (Biological Abstracts), Wiley, ProQuest (part of Cambridge), and JSTOR (Suppl. material
The process of contacting publishers revealed that most publishers were very limited in the functionality they had on their websites, and constrained by the limitations of the commercially provided software they used to provide added functionality. A first step in that regard has been the initiation of a special PLoS ONE Collection of scientific papers arising from the World Register of Marine Species (WoRMS). WoRMS is a superset of the European Register of Marine Species, which is one of the three primary European species databases that form the core of the PESI project. At present, more than 19 papers have been published (
A similar approach was followed later on by Fauna Europaea in collaboration with Pensoft, supported by the FP-7 ViBRANT project, starting the publishing of so-called 'Contributions on Fauna Europaea' as a special series of the Biological Data Journal (BDJ), with 10 data papers published so far. Euro+Med from the beginning took a distinctive approach towards updating the database. For instance, all occurrence records for a species in Euro+Med must have been published in a peer-reviewed paper. To accommodate miscellaneous new data, a series of "Euro+Med Notulae" was initiated in 2005 (
Another conclusion of this publishers workshop was that an important, practical, option to link to a wider range of journals would be the use of RSS feeds, because this would provide no or little action on the side of the journal. Such a system would need the feeds to be aware of what species (or higher taxonomic) names to search and match to the published papers. The Global Biodiversity Information Facility (GBIF) communicated their advances on developing a relevant tool. The usefulness of such a tool should be tested. For example, if species or taxonomic names are not apparent in the titles, abstracts and keywords of journal articles, they may be overlooked. Alternatively, papers of very peripheral relevance may be fed to the species database and overwhelm the editors and readers. In the latter case additional filters could be placed to constrain the RSS feed. The PESI partners intend to explore these options further on when the GBIF tool becomes available, a collaboration highlighted at the EDIT second publishers workshop in Bratislava (Suppl. material
The highest priority journals for species databases to be linked are those describing new species, and rationalising species nomenclatures (e.g. identifying synonyms or reclassifying species). One of the leading taxonomic journals in this field is Zootaxa, which publishes almost five times more new species than the next ranked journals (http://www.organismnames.com/metrics.htm?page=tsj). The PESI project started a regular dialogue with this journal, which led to the invitation of the Zootaxa Chief Editor (Dr Zhi-Qiang Zhang), to a follow-up EDIT third publishers workshop in Copenhagen on 7-8 October 2010 (Suppl. material
In relation to this subject, the 'Environmental and Natural Science Publishing in Europe' (ENSI) proposal was drafted to the European Commission FP7-SCIENCE-IN-SOCIETY-2011-1 call for funding entitled ‘Improved dissemination and preservation of natural history publications’ (FP7 289063).
The project results have been dissemimated in various ways. Some main public communication tools are mentioned below.
The PESI project homepage can be found here: http://www.eu-nomen.eu/pesi
An interface to the European Taxonomic Backbone is provided by the PESI webportal: www.eu-nomen.eu
PESI statistics for version 3 are summarised in Fig.
PESI Portal and PESI Datawarehouse (version 3) statistics. Source: http://www.eu-nomen.eu/portal/stats.php.
An introduction to PESI is available as a videoclip: http://www.eu-nomen.eu/portal/introvid (Suppl. material
A PESI brochure is available here: Suppl. material
A PESI Flyer is available here: Suppl. material
PESI is well situated within the EC infrastructural and policy development. This is partly due to the fact that many end-users, stakeholders and EC directives (like INSPIRE) adopted PESI as a European standard. Therefore major European biodiversity programs, like LifeWatch and EU BON, incorporate PESI components in their respective roadmaps and work plans to further the developments of a Taxonomic Backbone for Europe.
For establishing standards and sharing resources, PESI makes use of a huge network of taxonomic specialists in all European countries. Together with principal partners (like CETAF), PESI contributes to the objective on establishing an integrated taxonomic (working) community for Europe, operating as one virtual workforce representing a shared knowledge network.
In addition, PESI is selected by diverse EC bodies (like GÉANT) to outreach European biodiversity e-Infrastructures for the Eastern Partnership countries, thus extending the current geographic scope of the pan-European checklists, finally covering the whole Palearctica ("Flora/Fauna Palearctica").
PESI contributes to ongoing e-infrastructural developments, like VIBRANT and BioVeL, supporting the development of a shared and open virtual infrastructure to provide a more efficient interface between the existing biodiversity information infrastructures and stakeholders (policymakers, researchers, biodiversity managers).
Some instances of the above synergies and developments are highlighted below.
The INSPIRE directive establishes an infrastructure for spatial information in Europe to support community environmental policies and activities, which may have an impact on the environment, operated by the Member States of the European Union. As part of the INSPIRE Data Specification for the spatial data theme Species Distribution (Commission Regulation (EU) No 1253/2013), PESI is selected as a formal taxonomic standard for Europe, meaning that it is the first prioritised taxonomic reference classification to be used for data connected to species names.
Part of the PESI future progress lies in its potential to further engage and organise the taxonomic community participation as a vital, virtual workforce including (i) the instalment of proper expert network governance, (ii) the application of proper ownership licences, (iii) the use of appropriate mechanisms for acknowledging expert contributions, (iv) the ability to function as an efficient 'knowledge hub', supporting biodiversity research and decision making more generally, (v) the successful involvement of more open and dynamically organised social networks and communities, like non-professional taxonomists and citizen scientists.
Network governance
Considering the overall decline of taxonomy as a scientific discipline, the maintenance of a basic (taxonomic) expertise capacity will be essential to satisfy the knowledge needs for a wide range of biodiversity-related information services in the near future, including the pan-European checklists. In PESI we moved forward from the efforts of EDIT on establishing a collaborative workforce, adding the integrated PESI expert networks into a single EDIT Expert Database and by defining more sophisticated social networking standards. To support the process of expert network integration, the SMEBD licensing model was applied. Further, the role of taxonomic institutes on the sustainable hosting of the taxonomic databases was studied (
A network of European leading taxonomic institutions forms the Consortium of European Taxonomic Facilities (CETAF), holding the majority of the worlds' biodiversity collections and their data. CETAF's mission contains the enhancement of Europe's knowledgebase capacity on taxonomy for a wide range of scientific and popular users (
Accreditation and data papers
An important vehicle for authoring metadata provenance, providing a clear recognition of all contributors and enabling the receipt of credits as a formal scientific publication by means of citation, are 'data papers'. Data papers also play an important role in the publication of small bits and pieces of information, like new distributions, which would otherwise be difficult to publish (termed 'micro-publication'). Data papers allow a more flexible/dynamic way of expert involvement within the process of data collation and reviewing, because contributions are more easily acknowledgeable than is currently the case. Finally, by using novel e-publishing tools, the process of manuscript drafting is highly automated, being more convenient for the editors, but also enabling a direct cross-indexing with other relevant metadata resources, supporting feedback mechanisms in foreseen name annotation workflows (see also 'next generation name infrastructure' subsection, below).
PESI already supports the publishing of data papers accompanying the WoRMS, Fauna Europaea and Euro+Med updating process (see 'Results'). More emphasis will be put on this work within the EU BON project, to trigger expert participation, to automate the dissemination and back-linking (tagging) of taxon names (within data papers) to external resources (
Networks of Knowledge
As part of the Intergovernmental Platform on Biodiversity and Ecosystem Services (IPBES) developments and the Mapping and Assessing Ecosystems and their Services (MAES) process, the European Commission is exploring mechanisms to build so-called Networks of Knowledge (NoKs), to interact more broadly with the whole community of knowledge holders on biodiversity and ecosystem services, to inform decision making and strengthen the knowledge-policy interface. BiodiversityKnowledge (KNEU) is proposed as a governance structure for this NoK. PESI is involved in the development and testing of the BiodiversityKnowledge prototype, improving the knowledge flow between biodiversity knowledge holders and users in Europe, a process recently concluded (Suppl. material
Non-professional taxonomists and citizen scientists
Currently around 50% of the experts contributing to the updating of the pan-European checklists includes professional taxonomists. However, the contributions of non-professional taxonomists will significantly increase in the near future. PESI should effectively anticipate this development by implementing relevant social networking mechanisms, including mentoring, educational and accreditation systems, in close collaboration with taxonomic institutes and societies.
An associated exercise is the adequate integration of efforts of voluntary biodiversity recorders, who provide a major contribution to the continuing monitoring of Europe's biodiversity. During PESI an ESF networking proposal was submitted entitled “Citizens Monitoring Biodiversity (CMD)” to optimise the involvement of the volunteer biodiversity observation community into the European biodiversity programs (
Indexing biodiversity is a global challenge. PESI contributes to worldwide efforts on preparing global catalogues (like CoL) and supports relevant name services (like GBIF-ECAT) on increasing the resolving power for integrating biodiversity data.
More particularly, PESI has extended the pan-European checklists geographic scope by involving Focal Points from outer European Union territories, with the ultimate intention of covering the whole Palearctic (see 'Outlooks' section). The Palearctic is the largest biogeographic area of the world, containing a very rich and unique flora and fauna within a vastly diverse environment of unique habitats, especially in biogeographic transition and refuge zones regions, like the Caucasus. Integrating the available taxonomic expertise, data and resources into shared research infrastructures is crucial for globalising biodiversity assessments.
PESI partners from Eastern European countries, up to Russia and the Caucasus, are actively involved in carrying out aspects of the PESI work plan. To progress the participation of Caucasus partners a proposal was drafted to the European Commission FP7-INCO-2010-6 call for funding entitled "Network for Biodiversity Research in the Caucasus (NBRC): Developing the Biodiversity Research Centre in Tbilisi in a regional and international context" further building local capacities and integrating the scientific excellence and facilities for exploring the Caucasus Biodiversity Hotspot into the European Research Area. Unfortunately this proposal was not funded in this round.
Similarly, for the Mediterranean, a connection was made with national partners (Morocco, Algeria, Tunisia, Egypt) and biodiversity networks (BioNET-NAFRINET / ATUTAX) in Northern Africa (see: BioNET-NAFRINET 2010 LCC Meeting). Northern Africa is an area of great ecological importance as a conversion zone in between several biogeographical regions and as a passageway for migrating species. A proposal written to the JRS Biodiversity Foundation entitled "Networks for Biodiversity Indexing in Northern Africa contributing to EU-nomen" to extend the PESI work program toward Northern Africa was rejected for full funding, although travel grants have been applied for North African scientists to attend TDWG meetings, to become familiar with biodiversity informatics best practises and to share their experiences in doing this kind of work at home.
Vernacular names are the most important search terms for non-professional users to retrieve biodiversity information. By means of the network of Focal Points (see 'Results' section), PESI collected substantial additional information on European species, including non-scientific names.
As part of the OpenUp! project, Opening Up the Natural History Heritage for Europeana (
PESI is participating in innovative biodiversity-informatics projects, developing and staging the ongoing virtualisation of the biodiversity research domain, building virtual tools and workbenches, including the collective use of data from multiple sources, and the automation of workflows of various tasks and processes.
The pan-European checklists have been applying automation for around 15 years since the initial versions of their data management systems. This has included implementing advanced virtual workbenches, including largely automated data-entry and data cleaning routines, which has eliminated a lot of manual processing. PESI continued this practice by installing automated routines for, among others, assembling the PESI Data Warehouse and for supporting the validation of checklists by means of the automated mapping tools (see Results).
More efficiency in workflow automation is obtained when restrictions of distributed architectures are further reduced, enhancing cross-platform operationability. This requires a broad set of infrastructural adaptations, including the harmonisation and standardisation of APIs, data exchange formats and ontologies. As part of the ViBRANT project, PESI contributed to the development of a common publishing platform for taxonomists, driving the (virtual) integration of some major biodiversity information delivery platforms (see: Fig.
In its original stage, the PESI infrastructure implemented two gateways to European biodiversity based on the taxon-level information provided by the participating checklists:
Both webportal and webservices are optimised for a usage scenario with requests on individual objects (e.g. a particular name or taxon, information related to a particular identifier, etc.). However, we believe that PESI services will play an increasingly important role in workflow-driven systems. In this context, PESI will, for example, be used to expand a taxon name query to include its synonyms when combining independent scientific services that use different taxonomies. For this purpose several measures should be taken, including: (i) the extension of existing service layers supporting the retrieval of massive amounts of data, (ii) the optimisation of services for performance and reliability to ensure their usefulness in a workflow environment, and (iii) the optimisation of the PESI data warehouse structure for efficient output-oriented queries.
With these extensions, new workflow-oriented scientific applications can be realised and add further value to the PESI infrastructure. Both the EC-FP7 BioVeL and iMarine projects, in which PESI was represented, followed this approach towards a virtual biodiversity e-Library enabling pipelining of data and analysis into efficient integrated workflows (see subsection 'VREs', bellow).
Further European Taxonomic Backbone advancements are scheduled as part of the EU BON and LifeWatch projects:
In the EU BON project, Building the European Biodiversity Observation Network, the PESI Backbone will be advanced to satisfy the needs of the GEO BON / GEOSS system, also serving as a taxonomic backbone for the projected EU BON Biodiversity Portal. The European Biodiversity Portal will be developed to serve as a main information hub by providing integrated biodiversity data from different fields. The linked data will come both from in-situ as well as from earth observation data and the taxonomic backbone will be essential to link the data from different disciplines and locations. This process (so far) includes: (i) the completion of the Fauna Europaea and Euro+Med PlantBase migration and integration to the EDIT Platform for Cybertaxonomy (a process kicked-off in PESI), (ii) the registration of the third PESI Backbone version, as a EU BON taxonomic backbone prototype, to the respective GEOSS CRS and BiodiversityCatalogue services, (iii) the enhancement of relevant associated EDIT Platform functions, like the validation tools, and (iv) the harmonisation with other checklists, like the Catalogue of Life (CoL), FADA and WoRMS, to allow for federated (multiple checklist) searches and a unified data-model mapping (Suppl. materials
A complementary effort is foreseen in LifeWatch where, as part of the Flemish contribution to LifeWatch, the LifeWatch Taxonomic Backbone will be established, especially focussed on the marine environment. This LifeWatch Taxonomic Backbone will integrate various taxonomic checklists, including EU-nomen, and a range of associated biogeographical, ecological, genomic and literature data resources, supporting the LifeWatch infrastructure developments (
Special cases of automated workflows are Virtual Research Environments (VREs), which will provide the next generation of shared research environments. Virtual labs provide an interactive environment in which researchers can access, collect, integrate and explore large amounts of data from multiple resources for analysis, data mining, and visualisation, supported by automated workflows and standardised web services.
PESI is involved in e-Science projects developing virtual labs for scientists (as part of BioVeL) and fishery agencies (as part of iMarine), guiding the accurate application of taxonomy and connecting the related web services, to accelerate the access, integration and application of taxonomic names as critical meta-data (
An example of how virtual labs could increase the analytical value of biodiversity portals is given in Fig.
Other instances of interactive environments can be found in advanced biodiversity data portals. As part of the Focal Points workplans, PESI stimulates the sharing of best practices on portal development, especially regarding taxonomic checklist governance, data exchange and sophisticated web tools. As an example, the Romanian PESI Focal Point receives dedicated support from the myBiOSis portal. The myBiOSis system allows an integrated view of a flexible selection of frequently-used web applications within a single user interface (template), thus easily accommodating the requirements of various projects and usertypes. Although the original myBiOSis modules are developed for recording species occurrences and phenotypic features (Fig.
Similarly the Azores Bioportal (ABP) provides a regional e-infrastructure for the Azores Islands (
A marine virtual research environment (VRE) has been created as the first operating component of the LifeWatch Research Infrastructure (ESFRI). It has been organised through a bottom-up approach from the participating states. The main services designed are: (1) VRE entry page & VRE components (virtual laboratories); (2) VRE calculation tools and biotic indices; (3) VRE derived products as base layers; (4) Biosensor data collaborative platform; (5) Taxonomic Backbone & species traits; 6) VRE training event & marine LW follow up meetings. The components that are strictly related to PESI are (2) and (5). The later is based on the taxonomic backbone developed in the course of WoRMS (World Register of Marine Species) and PESI, while the former on PESI and ViBRANT projects. Another innovative virtual laboratory under development is the one on the 3D representations of micro-CT scanning's of macro-organisms. This laboratory will offer a suit of galleries of interactive 3D representations allowing the user to have access to both morphological and anatomical representations. In its current version the Marine LW VRE is offered through a single portal: http://marine.lifewatch.eu. It offers three modules for its exploration and use: (1) Access; (2) Analyze; (3) Develop (see: Fig.
LifeWatch marine virtual research environment (VRE) (source: http://marine.lifewatch.eu).
Environments are changing rapidly all over the planet, making it imperative that we have our systems for communicating biological information working efficiently and reliably. Data based on proper identification of taxa are essential to monitor changes in nature; information needs to be integrated on the dynamics on species existence (migration, extinction, intrusion) and on instability of the associated ecosystems. Ongoing, critical environmental assessments are important to document and control critical disorders, like the decline of (native) pollinator species, the impact of algal blooms, the effect of overexploitation and the invasion of pest species. Taxonomy is a foundational science, but its reliable application is hindered by the limited knowledge of many aspects of biodiversity and the relative disorganisation and inaccessibility of taxonomic information. PESI contributes to the synthesis and access to existing taxonomic knowledge by maintaining a network of outstanding experts and by taking care about the delivery of persistent standards and data integration routines, securing a high-level access to biodiversity data.
The biodiversity community, in anticipation of the European Commission H2020 call for larger, more integrated networks, is: (i) pushing biodiversity research as an innovative information science (
Annotation workflows
PESI contributes to the development of a next generation linked open-data names architecture, expanding the inter-platform operability and making the workflow orchestration and task automation more efficient between associated name services (Fig.
In the longer term a next generation names architecture is developing feedback mechanisms to automate the cross-annotation of different workflows using taxonomic information. This further virtualisation provides a number of advantages. Firstly, because most biological information (observations and knowledge) is linked to names, this will significantly increase the shared use and discovery of biological information. Secondly, because every virtual interaction provides a virtual documentation, this will enable the generation and accumulation of new 'information facts', finally resulting in a novel data ecosystem, also called 'big data' science.
However, virtualisation and automation aren't self-evident processes. They need a careful monitoring of the existing information environment and an exact knowledge on the amount of virtualisation and automation required, together with a defined strategy on the relevant infrastructural changes to be made. PESI is considering these steps, provisionally called Global Names Architecture (GNA), in close collaboration with important international contributors in this domain, like GBIF, CoL and Global Names.
Global Names Achitecture & Global Names Usage Bank
Taxonomic decisions are based on consideration of data in an interpretive framework. The selection of what data are used, how it is weighted, and the philosophical framework for interpretation, all are individual choices made by the taxonomist. The choice is based on a taxonomist's skills, past experience, data availability, educational background, and the structure of diversity in the organisms under study. Thus, many taxonomists have a natural scepticism towards 'big data' attempts, as they see this a limiting their ability to make taxonomic judgements. However, in the case of a Global Names Architecture (GNA), taxonomy would benefit greatly from an infrastructure handling nomenclature (name description), science (postulating species concepts) and taxonomic practice (all names in use) as discrete systems (Fig.
As an example, ZooBank is the official online nomenclatural registry for zoology, under the auspices of the International Commission for Zoological Nomenclature (ICZN). It is a registration system for new and legacy scientific names for animals, which comprise by far the greatest number of described taxa of any organismal group. PESI and ZooBank have had a closely paralleled, collaborative development (see also 'Results') and it is envisioned that they will become tightly linked in future perspectives.
After beginning in 2008 as a stand-alone system, like many other current nomenclatural and taxonomic sources, it was seen that ZooBank’s effectiveness would be exponentially increased if it were developed as a service within the Global Names Architecture (GNA), operating on top of the Global Names Usage Bank (GNUB,
ZooBank is the forerunner model for GNUB-based registration systems that can be developed in other nomenclatural domains. In addition there are many other services that GNUB can facilitate, leveraging the power of a robust code-based nomenclatural ‘skeleton’ and a usage-based ‘body’. Some of the GNUB services are indicated in the ZooBank interface shown in Fig.
An example ZooBank page (Gasterosteus Saltatrix Linnaeus, 1766), illustrating several GNUB services: (1) user authentication; (2) “fuzzy” searching of GNUB content; (3) APIs and services; (4) ZooBank registration; (5) External Identifier cross-linking; (6) BHL page linking; (7) similar/related name discovery (via GNI’s name searching service); and (8) multi-lingual support. Not shown are services to manage user accounts, de-duplicate records, prototype reconciliation tools, services for journal publishers, and visualization tools for author publication history and other statistics.
GNUB/GNA provides a single shared platform for all cross-links, such that anytime a record is indexed in GNA, it is automatically cross-linked to all other data systems that are indexed in GNA. Unlike most existing biodiversity data initiatives, the components of GNA (particularly GNUB) are not intended to provide novel information; rather, GNUB is an index of core facts that are shared across all of biology. Nothing in GNUB is original or novel content; it merely represents a structured way of organizing information to facilitate broader data integration among other databases that do contain original information. Thus, the GNUB index does not compete with other data resources; but rather serves as a core infrastructure for cross-linking (and thereby empowering) other biological data sources.
Despite the traditional role of scholarly publications to serve as the primary vehicle for disseminating and reusing peer-reviewed scientific findings, it was recognised in PESI that scholarly publishing cannot persist anymore as just a method of communicating final results, because of the obstacle it is for efficient data sharing, reproducibility and reuse (see section e-Publications). Rather, scholarly publishing should become a part of the scientific process itself (
EU-nomen will contribute to the further integration of scholarly publishing services as an integral and interlinked part of the whole data gathering, data mobilisation and research process, synchronised with data sources, data aggregators, attribution and annotation services, and ontology frameworks. Technically this integration of the publishing and research processes will partly be achieved through a commonly developed and shared API library, based on community-agreed data exchange formats.
On expert engagement, a special role in the process will be played by the "data paper" concept as an important instrument of data mobilisation, publication and community involvement (
Because taxonomic information provides the primary identification of an object and is a prerequisite to make other biodiversity data discoverable and available, the evaluation of the state and trends of biodiversity is only possible with help of an elaborated taxonomy. Accordingly the EU BON project, by developing a webportal for enabling an integrated access to European biodiversity data (
EU BON supports advances on the PESI work program, focussing on improving the pan-European terrestrial (Fauna Europaea and Euro+Med PlantBase) checklists quality and completeness. For this purpose the gaps in taxonomic data and knowledge will be further analysed with help of the connected expert networks, for instance as part of the Fauna Europaea data papers preparations. More fundamentally, technical solutions will be needed to advance the experts' data management tools (or 'virtual workbenches') with improved annotation functions, allowing easy review and uptake of discrepancies that might be found to occur between the pan-European checklists and other resources (like regional checklists, observation/monitor data or newly published names). This feature will increase the quality of the pan-European checklists data, speedingup the harvesting of all 'names in use' and close the 'delay gap' between the publication of new names and the actual integration into the pan-European checklists. Presumably, implementing such advanced taxonomic workbenches and publishing tools will also motivate experts to become more engaged in working on taxonomic databases and on editing, updating and annotating existing datasets.
Already from the earliest stage, because of the extended geographic scope of the pan-European checklists, experts and institutes of European neighbourhood countries (including Russia) have been involved in the checklist work programs. Consequently, enhancing the range and collaboration towards a full Palearctic coverage follows the footprints of earlier projects, like Fauna Europaea, Euro+Med PlantBase, WoRMS, EDIT, PESI and other (international or local) initiatives. Such an extension seems inevitable, considering the practical necessity of assessing biodiversity as a borderless science and taking into account the European Union economic and political arguments, aiming a further exploitation of the potentials on cooperation in science, technology and innovation with outer-EU partners (
For two centuries, the Natural History Institutes in Eastern European and Central Asian countries have accumulated an extensive knowledge and huge collections on the flora and fauna of this wide region. The integration and implementation of this information into existing global or European biodiversity databasing initiatives is, however, inadequate because of a suboptimal use of shared virtual and social infrastructures. The proposed "Flora/Fauna/Mycota Palearctica" project will solve this impediment by intensifying the existing collaboration with EU-nomen partners, by extending the partnership towards Northern Africa and Central Asian countries, and by effecting a common work program, integrating available taxonomic resources into a uniform system and the associated experts into a corresponding network. The accumulated information, freely accessible for the world community, will significantly advance the global and regional capacity on (future) biodiversity assessments.
PESI was funded by the European Commission under the Seventh Framework Program (project reference: RI-223806), contributing to the FP7-INFRASTRUCTURES 'Capacities' sub-program under the Coordination and Support Actions (CSA) funding scheme (call: FP7-INFRASTRUCTURES-2007-2). Continuing support for the coordination and management of PESI was given by the University of Amsterdam (via the former Zoological Museum Amsterdam), and key partner institutes, including Botanischer Garten und Botanisches Museum Berlin-Dahlem (BGBM), Royal Botanical Gardnes, Kew (RBG) and Vlaams Instituut voor de Zee (VLIZ).
The pan-European checklists initial set ups were established as part of the respective EC-FP4 ERMS (MAS3-CT97-0146), EC-FP5 Euro+Med (EVR1-CT-2000-40004) and EC-FP5 Fauna Europaea (EVR1-1999-20001) projects. A first pilot on integrating the pan-European checklist was carried out during the EC-FP5 EuroCAT (EVR1-CT-2002-20011) project.
Preparatory work on PESI was done during the EC-FP6 EDIT (GCE 018340) and MarBEF (SUSTDEV 505446) Networks of Excellence (NoEs). Support on e-Publishing and optimising platform interoperability was given by the EC-FP7 ViBRANT (RI-261532) project and support on preparing taxonomic workflows and on installing web services was provided by the EC-FP7 BioVel (INFRA 283359) and EC-FP7 iMarine (CONNECT 283644) projects.
The further development of PESI (as 'EU-Nomen') is partially covered by the EC-FP7 EU BON project (ENV 30845) and partially supported by national LifeWatch nodes, including LifeWatch Flanders and LifeWatch Greece, spin-offs of the EC-FP7 LifeWatch (INFRA 211372) project. Additional support is given by taxonomic museums or institutes, national biodiversity platforms, GBIF nodes or biodiversity agencies, hosting PESI focal points.
We thank the Linnean Society London for its lasting and resolute support during the years. We are grateful to prof. Alessandro Minelli, and his Italian colleagues, for instigating Fauna Italia in 1989-1991, which paved the path for ensuing taxonomic indexing projects in Europe.
Global Nomenclator Architecture (GNOMA) meeting on the contribution of nomenclators to the Global Names Architecture (GNA). ZooBank provisioning meeting on defining strategies for the uploading of ZooBank, especially on the contribution of taxonomic (zoological) key-resources.
Initial scoping meeting on GSDs and nomenclators involvement
Joint EDIT - PESI meeting on Scientific Publishing in NHIs
Also available at: http://vbrant.eu/content/d43-design-robust-services
Eight institutes commited themseleves to a coordinating role for Focal Point activities covering larger regions in the EU and adjacent countries if funding would become available. This figure gives a schematic presentation of this future effort to elaborate.