Biodiversity Data Journal :
Data Paper (Biosciences)
|
Corresponding author: Daphne Z. Hoh (daphnehohzhiwei@gmail.com)
Academic editor: Etielle Andrade
Received: 11 Jul 2022 | Accepted: 29 Oct 2022 | Published: 03 Nov 2022
© 2022 Daphne Hoh, Chia-Ling Fong, Huai Su, Pengyu Chen, Chia-Chen Tsai, Kelly Tseng, Melissa Liu
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Hoh DZ, Fong C-L, Su H, Chen P, Tsai C-C, Tseng KWH, Liu MJY (2022) A dataset of sea turtle occurrences around the Taiwan coast. Biodiversity Data Journal 10: e90196. https://doi.org/10.3897/BDJ.10.e90196
|
|
We describe a dataset of sea turtle sightings around the coast of Taiwan and its islands (
This dataset contains 3,515 occurrence records of sea turtles (Cheloniidae) and is currently the largest public dataset of sea turtle sighting records in Taiwan. Post-publication of this dataset to the GBIF platform demonstrated that the number of Green sea turtle Chelonia mydas records in Taiwan is one of the largest in the world (last accessed date: 15-10-2022). The data served as the foundation for understanding biogeography and sea turtle ecology in Taiwan's coastal waters.
sighting data, citizen science, coastal waters, photo identification
People involved in citizen-science programmes have played a major role in contributing occurrence data of diverse organisms worldwide (
Sea turtles are migratory Chelonian species that travel between nesting and foraging sites during their life cycle. In Taiwan, much is known about the nesting ecology (
The purpose of preparing the current dataset was to publicly open the data for advancement, especially in the scientific and conservation communities. The data of TurtleSpot Taiwan have allowed a basic understanding in biogeography of foraging sea turtles in Taiwan and some ecological observations of sea turtles in the wild, such as witnessing the recovery of some injured turtles, types of behaviour, intra- and inter-species interactions and physical abnormalities.
1. Data collection: Citizens who encountered sea turtles reported their sightings to us via our Facebook Group. Reporters post a regular post to the Group following our reporting format (Fig.
Format of reporting sea turtle sighting to TurtleSpot Taiwan. Facebook icon made by Freepik via Flaticon.
2. Quality control of sighting report received: Each sighting reported to the Group was first checked by the group administration prior to approval. The group administration checked if the post followed the reporting format mentioned above and the sighting provider will be requested to provide any of the missing information unless unavailable. Once the submitted post passed the quality check, the post will be approved by the group administration to be visible in the Facebook Group.
3. Data transcription: Sighting information contained in the post/report was transcribed into Google Sheets as raw data.
4. Determine additional information from the sighting report: We recorded additional information about the occurrence through the sighting reporter’s notes of onsite observation and our identification through the provided photos and videos. Additional information included the biological characteristics of the sighted individual turtle (sex, life stage, behaviour, associated taxa) and physical abnormality of the turtle (e.g. fishing line entanglement, tumour and others).
5. Sea turtle individual identification: If clear photos of the left face of the sighted turtle were provided in the report, we use the Photo Identification (Photo ID) method to identify the turtle individual. Currently, we use two methods to perform Photo ID: (1) compare the facial scute pattern manually and (2) HotSpotter (
6. Open data preparation: The language used in most of the recorded data is Traditional Chinese. Nevertheless, valuable information including sighting location, method, common name and life stages which allowed future data use was translated into English. We converted the occurrence data into Darwin Core Archive standard in Google Sheets, an online spreadsheet tool, using the Darwin Core Archive Assitant Add-on (
Most of the sighting data were from Taiwan and its islands (Fig.
Four species of sea turtles were recorded in the dataset, including Green turtle (Chelonia mydas), Hawksbill (Eretmochelys imbricata), Olive Ridley (Lepidochelys olivacea) and Kemp's Ridley (Lepidochelys kempii). Most of the sea turtle sightings in the dataset were of Green and Hawksbill turtles (97.3% and 2.4%). Occurrences that failed to assign species (n = 11) were recorded as Cheloniidae.
Rank | Scientific Name | Common Name |
---|---|---|
kingdom | Animalia | Animal |
phylum | Chordata | |
subphylum | Vertebrata | |
superclass | Reptilia | |
order | Testudines | |
suborder | Cryptodira | |
superfamily | Chelonioidea | Sea turtle |
family | Cheloniidae | |
genus | Chelonia | |
genus | Eretmochelys | |
genus | Lepidochelys | |
species | mydas | Green |
species | imbricata | Hawksbill |
species | olivacea | Olive Ridley |
species | kempii | Kemp's Ridley |
TurtleSpot was officially founded in June 2017. Hence, most sighting records range from June 2017 to December 2021, comprising about 89% (n = 3,128) of the dataset (Fig.
The dataset in the current work is licensed under a Creative Commons Attribution (CC-BY) 4.0 Licence. Any image and video accessed through the URL from the dataset are licensed under the Creative Commons Attribution (CC-BY-NC) 4.0 Licence.
The dataset contains data of two major categories: data associated with the occurrence and data related to the biological characteristics of the sighted turtle individual. The former category consists of information during the sighting event such as date, time, location, geographical coordinates, observation method and species. The latter category characterised the observed turtle individual using our controlled vocabulary (see Suppl. material
Some additional remarks on the dataset:
1. On average, 57 sighting reports were received monthly;
2. More than half (n = 2,235; 63.6%) of the data were provided by citizen scientists. The remaining data (n = 1,280; 36.4%) was records contributed by two of the co-authors as part of the citizen-science programme;
3. So far, only turtle sightings in Taiwan were given a turtle ID.
Data fields were standardised into 46 Darwin Core terms as listed in the following table. The column label and some of the relevant descriptions are written as listed in the List of Darwin Core terms (accessed June 2022; created by the TDWG Darwin Core Maintenance Group). A more specific description of the column used in the current dataset was also added if applicable.
The dataset is publicly opened in GBIF (see Resource link) and users can download the occurrence dataset in CSV format through the ‘Download’ section of the dataset page. The dataset can also be downloaded using GBIF API-based tools such as ‘rgbif’ and ‘pygbif’ for further analyses.
Column label | Column description |
---|---|
occurrenceID | An identifier for the Occurrence (as opposed to a particular digital record of the occurrence). |
catalogNumber | An identifier unique for the record within the dataset. |
rightsHolder | A person or organisation owning or managing rights over the resource. |
recordedBy | Names of the sighting reporter/citizen scientist. |
year | The four-digit year in which the Event occurred, according to the Common Era Calendar. Year of sighting. |
month | The integer month in which the Event occurred. Month of sighting. |
day | The integer day of the month on which the Event occurred. Day of sighting. |
eventDate | Sighting date. |
eventTime | The time or interval during which an Event occurred. |
country | The name of the country in which the Location occurs. |
countryCode | The standard code for the country in which the Location occurs. |
higherGeography | A list of geographic names less specific than the information captured in the locality term. |
locality | Name of the sighting location or dive site. |
locationRemarks | More specific location compared to locality, usually the name of the dive site. |
decimalLatitude | The geographic latitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic centre of a Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive. |
decimalLongitude | The geographic longitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic centre of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive. |
coordinateUncertaintyInMetres | The horizontal distance (in metres) from the given decimalLatitude and decimalLongitude describing the smallest circle containing the whole of the Location. |
georeferenceRemarks | A note stating that our GPS coordinates were estimated from the dive site or sighting location. |
geodeticDatum | The ellipsoid, geodetic datum or spatial reference system (SRS) upon which the geographic coordinates given in decimalLatitude and decimalLongitude are based. |
verbatimDepth | The original description of the depth below the local surface. This is an estimation provided by the sighting reporter. |
samplingProtocol | Sighting method of the Occurrence. |
associatedReferences | An URL links to the Facebook post from the sighting reporter in our Facebook Group, which we define as a single Occurrence event. The link may be broken if the sighting reporter decided to delete the post. |
basisOfRecord | The specific nature of the data record. |
individualCount | Our purpose of preparing this occurrence dataset is to identify each turtle individual. Hence, if a sighting report contains more than one sea turtle, this occurrence record will be duplicated as a new row. Hence, the individual count of each data is only '1'. |
kingdom | The full scientific name of the kingdom in which the taxon is classified. |
taxonRank | The taxonomic rank of the most specific name in the scientificName. |
vernacularName | A common or vernacular name. |
scientificName | The full scientific name. |
taxonID | An identifier for the set of taxon information (data associated with the Taxon class). We use the URL of species in GBIF Backbone Taxonomy checklist. |
behaviour | The behaviour shown by the subject at the time the Occurrence was recorded. |
occurrenceRemarks | Condition of the turtle during the sighting (e.g. alive, dead, stranded). |
dynamicProperties | Any physical abnormality that was observed (e.g. injury, tumour, debris entanglement). |
associatedTaxa | A simple description of association and vernacular name of taxa in which this Occurrence is to each of them. |
lifeStage | The age class or life stage of the organism at the time the Occurrence was recorded. Estimated via physical appearence of the sighted turtle. |
sex | The sex of the biological individual represented in the Occurrence. Determination of sex is only applicable to adult sea turtles through the size of their tail. Sex determination is successful only when photos/videos of the tail are available. |
organismName | A textual name or label assigned to an Organism instance. Mostly named by the citizen scientists. |
licence | A legal document giving official permission to do something with the resource. The licence in this column is applied to the text data of this dataset only. |
identificationID | Turtle ID. Every identifiable turtle individual has a unique ID. |
associatedMedia | An URL links to the website of TurtleSpot Turtle Photo ID database, which show media and information about this particular individual. |
identifiedBy | Names of people who identified the turtle individual via Photo-ID method. |
informationWithheld | Additional information that exists, but that has not been shared in the given record. |
occurrenceStatus | A statement about the presence or absence of a Taxon at a Location. All value is 'present' in the current dataset. |
eventRemarks | Notes about the incomplete sighting eventDate. |
continent | The name of the continent in which the Location occurs. |
county | The full, unabbreviated name of the next smaller administrative region than stateProvince (county, shire, department etc.) in which the Location occurs. |
island | The name of the island on or near which the Location occurs. |
The authors thank Dr Yoko Nozawa and his laboratory members of Biodiversity Research Centre, Academia Sinica, Taiwan, and Taiwan Ocean Conservation Administration for supporting this community project. We thank Hao-Chia Huang, Marble Lo, Anita W, Liu Shih, Lyvia Chong, Aiden Lo, Peil-Shan Zhuang and Te Hsiang Wong for their assistance in data filling. We thank the Taiwan Keep Walking Fund for funding the website construction of Taiwan Turtle Photo ID database. We thank slowIsFast Ltd. for advising the data management process. Taiwan Biodiversity Information Facility (TaiBIF) is funded by the Ministry of Science and Technology, Taiwan (Grant ID: MOST 111-2621-B-001-003), which supported the publication of this data paper.
DZH, CLF, HS and PC conceived the study; DZH and CLF designed the scientific protocol; CLF, HS and CCT performed most of the individual turtle identification; HS and CCT contributed a substantial amount of data; DZH wrote the paper and prepared figures; DZH, CLF, HS, PC, CCT and KWHT performed data curation; MJYL assisted throughout the data management process; all authors approved the manuscript.