Biodiversity Data Journal :
Data Paper (Biosciences)
|
Corresponding author: Hui Kim (huikim@mokpo.ac.kr)
Academic editor: Jörg Holetschek
Received: 05 Dec 2024 | Accepted: 22 Jan 2025 | Published: 30 Jan 2025
© 2025 Chin Sung Chang, Kae Sun Chang, Hui Kim
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Chang CS, Chang KS, Kim H (2025) Data mobilisation for historical records of vascular plants in eastern Asia: V. L. Komarov’s expedition to Far-Eastern Russia, China and Korea from 1895 to 1897. Biodiversity Data Journal 13: e143631. https://doi.org/10.3897/BDJ.13.e143631
|
|
Historical collections of herbaria and literature play a crucial role in documenting biodiversity information. The botanical biodiversity of northern Asia is significantly understudied compared to other regions of China and Japan. In particular, the biodiversity patterns in China's three north-eastern provinces, North Korea and the Russian Far East remain poorly understood, with substantial gaps when compared to the records of species distributions in Japan, South Korea and inland China. The Komarov data, orginally written in Russian, required extensive efforts to georeference the 130-year-old Chinese and North Korean place names to their modern equivalents and translate historical names to their current forms. This study aims to restore the Komarov data, including both specimen records and occurrence data, to assist the broader scientific and environmental community in recovering key biodiversity data from the past of northeast Asia. The impetus for this work was the need to assign geographic coordinates to plant specimens collected in the region and to V.L. Komarov's observations as primary occurrence data from 1895 to 1897.
In this study, we present historical occurrence data obtained from the north-eastern Asian plant expedition carried out from 1895 to 1897 by V.L. Komarov in Far-Eastern Russia, Heilongjiang, Jilin, the eastern region of Liaoning in China and the northern region of Korea. The occurrences were georeferenced to more than 350 sites in Russia, China and Korea. All occurrences were georeferenced and species names were cross-checked and taxonomically updated using our own Asian plant checklist. The dataset consists of 21,114 primary occurrence records, comprising 6,956 specimens and 14,158 observation records. The outcome clearly shows that such initiatives can reveal an unexpected amount of highly valuable biodiversity information for “data-poor” regions.
georeferencing, botanical records, V.L. Komarov, primary occurrence data
The uneven distribution of information on the occurrence of species is a key issue in biogeographical analysis as well as in the planning of conservation measures (
V.L. Komarov botanical records from eastern Asia from 1895 to 1897
The datasets were digitised by Hui Kim (data manager), Chin S. Chang was the resource creator and Kae Sun Chang and Chin-Sung Chang were the content providers. With the help of Brahms (
The project aims to mobilise and publish the data about V.L. Komarov's collections deposited at LE and other herbaria and primary occurrence data cited in Flora Manshuriae (
This research was supported by Korea National Arboretum (Data-mining project).
Within the LE Herbarium, two groups have been defined for data mobilisation purposes. The first group consists of herbarium specimen records (Fig.
An example specimen label, Komarov1469, registered as Diervilla florida (Bunge) Siebold & Zucc as well as the syntype of Weigela praecox var. tomentosa Nakai. in the Herbarium of the University of Tokyo (TI), was collected from ‘Ostium Fluvii Tumin-gan’, a place name georeferenced to be located in Undok, Hamgyong-bukto, Democratic People's Republic of Korea. It was currently using the accepted name, Weigela florida (Bunge) A.DC.
Georeferencing: In the case of Komarov's geographic name information, the original Russian-language Flora Manshuriae (
1. The data mobilisation phase includes the extraction of data from the three volumes of Komarov's Flora Mandshuiae, the filling of the initial datasets and the georeferencing. Quality control is carried out at all stages and includes a final cross-check of the data. In particular, the master files and initial datasets will be archived in the BRAHMS database. We used the taxon module provided by BRAHMS to manage scientific names and we used the verbatim scientific names given by Komarov to reflect the most recent nomenclature (
2. The botanical records are taken from the three volumes of Flora Manshuriae, the lists of available material (gazetteers and data) being synchronised and updated according to the latest taxonomy. When only literature was available for species occurrence records, the scientific names recorded in Komarov's flora were transcribed into the current scientific names, with typographical errors removed. The rank of the original scientific name and the corresponding taxonomic status were retained. Where specimens were available, the earliest scientific name recorded on the label was checked against literature and the earliest scientific name recorded was chosen as the default scientific name. For types of new taxa by Komarov as well as other authors, the scientific name of the corresponding type is given in the "typeStatus", for example, "Isosyntype of Acer tschonoskii var. rubripes Kom.". Where the accepted name differed, we presented it in a column called 'acceptedNameUsage'.
3. A multilingual gazetteer has been produced to resolve the confusion of place names in northeast China, Far Eastern Russia and North Korea. The gazetteer thesaurus was created as a tool to correlate the often radically different names given to a single place or feature and to provide geographic coordinates for each.
4. The geographical information on the labels of this period included not only the name of the place where the specimen was collected, but also other information, such as a topographic description of the river or its location on a particular mountain. To know the distribution of a plant species, it is first necessary to georeference the collections so that they can be plotted on a map.
Komarov journeyed through Amur Oblast, Jewish Autonomous Oblast and Khabarovsk Krai regions of Russia in 1895. The following year, in 1896, he explored the Primorsky Krai, Heilongjiang and Jilin regions of China. Subsequently, in 1897, he revisited the Primorsky Krai in Russia. His travels continued to lead him to Rason, Hamgyongbuk-do, Ryangang and Chagang in North Korea, as well as Liaoning and Jilin in China.
40.88 and 49.46 Latitude; 122.87 and 133.43 Longitude.
The majority of specimen records and primary occurrence data belong to class Magnoliopsida (4,669 specimens + 12,228 occurrence data from literature) and Liliopsida (1,605 specimens +1,337), followed by Polypodiopsida (445 specimens + 442), Lycopodiopsida (54 specimens + 43) and Coniferophyta (183 specimens + 108). Our dataset represents 123 families (Fig.
April 1895 to November 1897.
This work is licensed under a Creative Commons Attribution (CC-BY) 4.0 License.
The total dataset contains some 6,956 plant specimen records and 14,158 occurrences recorded by V. L. Komarov in northeast Asia during the three-year period from 1895 to 1897. These data are expected to contribute to biodiversity management and conservation of ecosystems in these highly inaccessible regions.
Column label | Column description |
---|---|
institutionCode | The name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record. |
type | The nature or genre of the resource. |
basisOfRecord | The specific nature of the data record. |
occurrenceID | An identifier for the Occurrence (as opposed to a particular digital record of the occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the occurrenceID globally unique. |
recordedBy | A list (concatenated and separated) of names of people, groups or organisations responsible for recording the original Occurrence. The primary collector or observer, especially the one who applies a personal identifier (recordNumber), should be listed first. |
recordNumber | An identifier given to the Occurrence at the time it was recorded. Often serves as a link between field notes and an Occurrence record, such as a specimen collector's number. |
typeStatus | A list (concatenated and separated) of nomenclatural types (type status, typified scientific name, publication) applied to the subject. |
eventDate | The date-time or interval during which an Event occurred. For occurrences, this is the date-time when the event was recorded. Not suitable for a time in a geological context. |
DAY | The integer day of the month on which the Event occurred. |
MONTH | The integer month in which the Event occurred. |
YEAR | The four-digit year in which the Event occurred, according to the Common Era Calendar. |
taxonomicStatus | The status of the use of the scientificName as a label for a taxon. Requires taxonomic opinion to define the scope of a taxon. Rules of priority then are used to define the taxonomic status of the nomenclature contained in that scope, combined with the expert's opinion. It must be linked to a specific taxonomic reference that defines the concept. |
country | The name of the country or major administrative unit in which the Location occurs. |
countryCode | The standard code for the country in which the Location occurs. |
stateProvince | The name of the next smaller administrative region than country (state, province, canton, department, region etc.) in which the Location occurs. |
county | The full, unabbreviated name of the next smaller administrative region than stateProvince (county, shire, department etc.) in which the Location occurs. |
locality | The specific description of the place. Less specific geographic information can be provided in other geographic terms (higherGeography, continent, country, stateProvince, county, municipality, waterBody, island, islandGroup). This term may contain information modified from the original to correct perceived errors or standardise the description. |
decimalLatitude | The geographic latitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic centre of a Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive. |
decimalLongitude | The geographic longitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic centre of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive. |
geodeticDatum | The ellipsoid, geodetic datum or spatial reference system (SRS) upon which the geographic coordinates given in decimalLatitude and decimalLongitude are based. |
minimumElevationInMetres | The lower limit of the range of elevation (altitude, usually above sea level), in metres. |
vernacularName | A common or vernacular name. |
GENUS | The full scientific name of the genus in which the taxon is classified. |
specificEpithet | The name of the first or species epithet of the scientificName. |
taxonRank | The taxonomic rank of the most specific name in the scientificName. |
infraspecificEpithet | The name of the lowest or terminal infraspecific epithet of the scientificName, excluding any rank designation. |
acceptedNameUsage | The full name, with authorship and date information, if known, of the currently accepted taxon. |
scientificName | The full scientific name, with authorship and date information, if known. When forming part of an Identification, this should be the name in lowest level taxonomic rank that can be determined. This term should not contain identification qualifications, which should instead be supplied in the IdentificationQualifier term. |
kingdom | The full scientific name of the kingdom in which the taxon is classified. |
phylum | The full scientific name of the phylum or division in which the taxon is classified. |
class | The full scientific name of the class in which the taxon is classified. |
order | The full scientific name of the order in which the taxon is classified. |
family | The full scientific name of the family in which the taxon is classified. |
Biographical notes
A Russian botanist, Vladimir Leontievitch Komarov (1869-1945) was born at St. Petersburg and graduated from St. Petersburg University in 1894 (
A rectangular box (N 41.0928 - N 45.3540, E 122.8110 - E 132.0800) encompassing the two-year expedition between 1896 and 1897 was used to determine the contribution of species occurrence data from all vascular plant datasets in the region (Fig.