Biodiversity Data Journal :
Data Paper (Biosciences)
|
Corresponding author: Michael S. Romanov (michael_romanov@inbox.ru)
Academic editor: Dmitry Schigel
Received: 06 Oct 2021 | Accepted: 22 Nov 2021 | Published: 25 Nov 2021
© 2021 Ilya Ukolov, Michael Romanov, Vladimir Arkhipov, Mikhail Kalyakin, Olga Voltzit
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Ukolov II, Romanov MS, Arkhipov VYu, Kalyakin MV, Voltzit OV (2021) Ru-Birds.RU, bird observations from Russia and neighbouring regions: an occurrence dataset. Biodiversity Data Journal 9: e76202. https://doi.org/10.3897/BDJ.9.e76202
|
The dataset covers bird observation occurrences in Russia and neighbouring regions (ex-USSR countries and some other countries of Eastern and Western Europe) from 2001–2021. It is based on the internet platform “Online bird observation diaries” (ru-birds.ru), which allows professional ornithologists and amateur bird lovers to exchange their results and to jointly build a common collection of data. The taxonomic backbone of the occurrence dataset follows the standardised GBIF checklist dataset to ensure correct cross-linking of the names.
Currently, the database contains 541,900 records of occurrences of 713 bird species, which makes it the largest dataset on birds of Russia and neighbouring regions published in GBIF.
The occurrence dataset contributes to filling gaps in the bird distribution in Russia and Eastern Europe. It can be used for a deeper look at their populations, phenology and migrations over this area. The availability of special tools for verification of the entered information makes the database a valuable tool for analysing occurrences of non-native species, studying vagrancy, immigration, invasions and range dynamics.
The dataset is regularly updated. Over the 11 months of 2021, it has increased by 98,165 occurrences.
dataset, birds, Aves, occurrences, birdwatchers, GIS, Russia
The accuracy of the analysis largely depends on the volume and accuracy of the original information (
In the 20th century and earlier, the collection of ornithological data was the privilege of a limited number of professional researchers. During recent decades, the birdwatching movement has been gaining popularity, covering more and more countries across the world. That is why, in many countries, data collection on bird distribution, phenology, migrations and other similar work is being performed mainly by amateurs (
At the same time, with the development of information technology, the possibilities for collecting, exchanging and analysing information have increased many times over. In particular, online databases are being created, which allows us not only to accumulate data collected by independent observers, but also to carry out well-founded scientific analyses, based on samples from these data. These resources provide a convenient way for birdwatchers to share their materials and to become acquainted with the results of their colleagues. Obviously, the larger the number of observers, the more complete the information accumulated in a particular database. Some of these databases have grown into major international online platforms, such as eBird (
The movement of birdwatchers is also developing in Russia, but until 2013, its activity was scattered since there was no single online platform where they could interchange their results. The situation began to change with the creation of the database “Online bird observation diaries” in 2013 (
From 2019, this online dataset was published on GBIF (
The dataset covers bird observations over a large area that is very poorly understood in terms of bird distribution compared to other parts of the world (
The main purpose of this study is the presentation of a dataset on bird observation occurrences in Russia and neighbouring regions, published in the GBIF as a Darwin Core Archive. In addition, the study aims to provide some practically useful information on the related online database and its interface.
The presented dataset is a mirror of the online database “Online bird observation diaries” (Fig.
Front page of the database website RU-BIRDS.RU.
The database allows registered users to keep records of bird observations, see results of their colleagues, generate various analytical reports and much more. The main goals of creating an online database are:
From such a database, scientists can obtain extremely useful data on bird species distribution over time, detect species disappearing from a certain area, species invasions, migrations, occurrences of rare/endangered/non-indigenous species etc. Then they can use this information in scientific research or reports on national ecosystem services (
The database was created on the Russian software “1C: Enterprise”, version 8.3 (
One of the important features of the database is an intuitive interface (different for PC and mobile devices), which allows users to easily upload their data and provides means for error control. The interface is provided with a map showing observation points. A user can set locations of occurrences directly on the map or manually, by entering coordinates in the text form, either by groups or one by one.
It is possible to build and execute queries and produce a variety of reports. The reporting system allows users to make a selection for any period, region, species and observer and sort the output by a number of fields. Moreover, there is a possibility to manually outline the research area and obtain a query for it (Fig.
Useful functionality for ornithologists, ecologists and modellers include:
There are means for providing compatibility of the database with other ornithological projects. The first of them is the automatic import of data into the web GIS "Faunistics" (
Geographic coverage of the dataset (from the dataset GBIF website).
Being created by an amateur ornithologist, the database is well adapted for birdwatchers and possesses a number of special means related to birding activity including:
As an example of database analysis, let us consider the possibility of generating checklists. It is possible to generate a report for any region (both administrative and arbitrary), which presents a list of species in a given region, detailed by month. Different seasons are highlighted in colour (Fig.
Russian Federation, 12 ex-USSR countries (Latvia, Kazakhstan, Belarus, Ukraine, Uzbekistan, Azerbaijan, Georgia, Armenia, Lithuania, Tajikistan, Estonia, Kyrgyzstan) and 12 more countries of Eastern and Western Europe (Great Britain, Belgium, Czech Republic, Luxembourg, Germany, France, Turkey, Finland, Poland, Hungary, Greece and The Netherlands).
There are two different ways of user interaction with the database, either through a browser (usually from PC; Fig.
There are two main sampling methods: observation card and route card. The difference between them is that an observation card is essentially a checklist of observed birds together with their quantities and coordinates, while a route card is always attached to a certain route, which should be created separately. Specifically, a route card has additional fields, such as distance to the birds, the number of males and females, length of the route and time spent.
When working with an observation card, the observer enters observed species in each location together with the number of individuals. It can be done either manually or by selecting the species from the reference species list. Manual entry activates the auto-complete function, which greatly simplifies the input, allowing them to select the desired species by its first letters.
The most convenient way to indicate the location is through using a mobile application in real-time mode. In this case, the coordinates are determined by the mobile device and automatically entered in the field. When working from a PC, the user can enter the location by clicking on the digital map. Finally, the coordinates field can be filled or edited manually.
The database is filled both by professional ornithologists and amateurs; therefore, it is not immune to mistakes in species identification. That is why it possesses a number of means for control of data quality. Some potential errors are prevented at the stage of data entry due to the thought-out interface.
One of them is the standardised list of species (
Another useful feature is the automatic determination of coordinates (by clicking on the map when working from a PC or fully automatically when using a smartphone on-the-go). Finally, some fields, such as date, can be also pre-filled automatically.
All these greatly reduce the likelihood of an error.
The dataset covers bird occurrences from 25 countries, including Russia, ex-USSR and some other European countries (Fig.
36.782 and 81.522 Latitude; 20.933 and -169.614 Longitude.
The taxonomic coverage of the dataset includes 25 orders of birds, following GBIF Backbone Taxonomy (
Order | No. of species | No. of occurrences |
Passeriformes | 332 | 337438 |
Charadriiformes | 127 | 48960 |
Anseriformes | 56 | 44142 |
Accipitriformes | 36 | 26775 |
Piciformes | 14 | 19699 |
Columbiformes | 10 | 16711 |
Pelecaniformes | 9 | 9466 |
Gruiformes | 20 | 7831 |
Falconiformes | 10 | 5975 |
Podicipediformes | 5 | 5010 |
Apodiformes | 4 | 4192 |
Galliformes | 19 | 3693 |
Cuculiformes | 5 | 2886 |
Strigiformes | 17 | 2328 |
Suliformes | 7 | 1837 |
Ciconiiformes | 3 | 1633 |
Coraciiformes | 6 | 1525 |
Bucerotiformes | 1 | 697 |
Gaviiformes | 5 | 553 |
Procellariiformes | 10 | 215 |
Caprimulgiformes | 2 | 173 |
Otidiformes | 3 | 93 |
Pteroclidiformes | 2 | 42 |
Phoenicopteriformes | 1 | 16 |
Psittaciformes | 1 | 9 |
Rank | Scientific Name | Common Name |
---|---|---|
class | Aves | Birds |
This work is licensed under a Creative Commons Attribution Non-Commercial (CC-BY-NC) 4.0 Licence.
Ru-Birds.RU, Birds observations from Russia and neighbouгing regions.
Column label | Column description |
---|---|
occurrenceID | An identifier for the Occurrence. |
licence | A legal document giving official permission to do something with the resource. |
references | A related resource that is referenced, cited or otherwise pointed to by the described resource. |
datasetName | The name identifying the dataset from which the record was derived. |
basisOfRecord | The specific nature of the data record. |
geodeticDatum | The ellipsoid, geodetic datum or spatial reference system (SRS) upon which the geographic coordinates given in decimalLatitude and decimalLongitude are based. |
recordedBy | A person, group or organisation responsible for recording the original Occurrence. |
individualCount | The number of individuals present at the time of the Occurrence. |
organismQuantity | A number or enumeration value for the quantity of organisms. |
organismQuantityType | The type of quantification system used for the quantity of organisms. |
organismName | A textual name or label assigned to an Organism instance. |
eventDate | The date-time or interval during which an Event occurred. For occurrences, this is the date-time when the event was recorded. Not suitable for a time in a geological context. |
year | The four-digit year in which the Event occurred, according to the Common Era Calendar. |
month | The integer month in which the Event occurred. |
day | The integer day of the month on which the Event occurred. |
countryCode | The standard code for the country in which the Location occurs. |
stateProvince | The name of the next smaller administrative region than country (state, province, canton, department, region etc.) in which the Location occurs. |
decimalLatitude | The geographic latitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic centre of a Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive. |
decimalLongitude | The geographic longitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic centre of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive. |
coordinateUncertaintyInMeters | The horizontal distance (in metres) from the given decimalLatitude and decimalLongitude describing the smallest circle containing the whole of the Location. Leave the value empty if the uncertainty is unknown, cannot be estimated or is not applicable (because there are no coordinates). Zero is not a valid value for this term. |
verbatimCoordinateSystem | The coordinate format for the verbatimLatitude and verbatimLongitude or the verbatimCoordinates of the Location. |
identifiedBy | A list (concatenated and separated) of names of people, groups or organisations who assigned the Taxon to the subject. |
scientificName | The full scientific name of the taxon (species or subspecies). |
kingdom | The full scientific name of the kingdom in which the taxon is classified. |
phylum | The full scientific name of the phylum or division in which the taxon is classified. |
class | The full scientific name of the class in which the taxon is classified. |
order | The full scientific name of the order in which the taxon is classified. |
family | The full scientific name of the family in which the taxon is classified. |
genus | The full scientific name of the genus in which the taxon is classified. |
specificEpithet | The name of the first or species epithet of the scientificName. |
infraspecificEpithet | The name of the lowest or terminal infraspecific epithet of the scientificName, excluding any rank designation. |
taxonRank | The taxonomic rank of the most specific name in the scientificName. |
scientificNameAuthorship | The authorship information for the scientificName, formatted according to the conventions of the applicable nomenclaturalCode. |
It is interesting to see which of the species in the dataset are most common and which are rarest. The ten most common and the 36 rarest species are presented in Table
Common name | Scientific name | No. of occurrences |
---|---|---|
Great Tit |
Parus major |
24919 |
Hooded crow |
Corvus cornix |
19076 |
Mallard |
Anas platyrhynchos |
15519 |
Fieldfare |
Turdus pilaris |
14215 |
Chaffinch |
Fringilla coelebs |
13462 |
Blue tit |
Parus caeruleus |
11833 |
Black-billed Magpie |
Pica pica |
11418 |
Rock Dove |
Columba livia |
10791 |
Pied Wagtail |
Motacilla alba |
10601 |
Common Raven | Corvus corax | 10026 |
Common name | Scientific name | No. of occurences |
---|---|---|
Tree Swallow |
Tachycineta bicolor |
1 |
Green-winged Teal |
Anas carolinensis |
1 |
Radde’s Accentor |
Prunella ocularis |
1 |
Black-headed Mountain-Finch |
Leucosticte brandti |
1 |
Asian Dowitcher |
Limnodromus semipalmatus |
1 |
Chinese Egret |
Egretta eulophotes |
1 |
Gray Bunting |
Ocyris variabilis |
1 |
Aquatic Warbler |
Acrocephalus paludicola |
1 |
Relict Gull |
Larus relictus |
1 |
Semipalmated Plover |
Charadrius semipalmatus |
1 |
Pallid Scops Owl |
Otus brucei |
1 |
Buller's Shearwater |
Puffinus bulleri |
1 |
Spotted Greenshank |
Tringa guttifer |
1 |
Black-faced Spoonbill |
Platalea minor |
1 |
Sociable Plover |
Chettusia gregaria |
1 |
Least Sandpiper |
Calidris minutilla |
1 |
Surf Scoter |
Melanitta perspicillata |
1 |
Japanese Accentor |
Prunella rubida |
1 |
Green-backed Heron |
Butorides striata |
1 |
Sooty Shearwater |
Puffinus griseus |
1 |
Rufous-bellied Woodpecker |
Hypopicus hyperythrus |
1 |
Chinese Pond-Heron |
Ardeola bacchus |
1 |
Wilson's Snipe |
Gallinago delicata |
1 |
Golden-crowned Sparrow |
Zonotrichia atricapilla |
1 |
Common Loon |
Gavia immer |
1 |
Great Knot |
Calidris tenuirostris |
1 |
Streaked Shearwater |
Calonectris leucomelas |
1 |
Plumbeous Water-Redstart |
Rhyacornis fuliginosa |
1 |
Black-headed Penduline-Tit |
Remiz macronyx |
1 |
Brown-headed Gull |
Larus brunnicephalus |
1 |
Sombre Tit |
Parus lugubris |
1 |
Ivory Gull |
Pagophila eburnea |
1 |
Marbled Murrelet |
Brachyramphus marmoratus |
1 |
Yellow-legged Buttonquail |
Turnix tanki |
1 |
Pallas' Sea Eagle |
Haliaeetus leucoryphus |
1 |
Pechora pipit |
Anthus gustavi |
1 |
The project is continuously developing. The database is currently being integrated with EuroBirdPortal (https://eurobirdportal.org). The dataset is actively used and cited in scientific publications (e.g.
The database fills the information vacuum concerning the number and distribution of birds that exist on the territory of Russia and neighbouring countries. Placing a dataset on GBIF and publishing its description makes the database accessible to an international circle of specialists. At the same time, this expands the potential circle of project participants.
We hope that, due to this, the volume of the database and its geographic coverage will continue to grow and, in the future, we will see new scientific papers using data from the “Online bird observation diaries” system.
Participation of M. Kalyakin and O. Voltzit was done as a part of the Scientific Project of the State Order of the Government of Russian Federation to Lomonosov Moscow State University No. 121032300105-0. Additionally, we are deeply grateful to all bird lovers and professional bird watchers who contributed their data to this database.
Ilya Ukolov created the online database and the GBIF occurrence dataset, edited and commented on the manuscript. Michael Romanov and Vladimir Arkhipov wrote the manuscript, discussed and edited it. Mikhail Kalyakin and Olga Volzit provided official support for the database and edited the manuscript.