|
Biodiversity Data Journal :
Research Article
|
|
Corresponding author: Chi Man Leong (dannycmleong@uic.edu.cn)
Academic editor: Sheryl Yap
Received: 19 Mar 2025 | Accepted: 03 Jun 2025 | Published: 12 Jun 2025
© 2025 Kaiyun Zheng, Mark K. L. Wong, Toby Tsang, Chi Man Leong
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Zheng K, Wong MKL, Tsang TPN, Leong CM (2025) Bridging Citizen Science and Expert Surveys in urban biodiversity monitoring: Insights from insect diversity in Macao. Biodiversity Data Journal 13: e153402. https://doi.org/10.3897/BDJ.13.e153402
|
|
Urban ecosystems present unique challenges for biodiversity monitoring, demanding efficient methods to document species diversity in rapidly changing environments. This study quantifies insect diversity in Macao SAR — a hyper-urbanised region — by integrating data on 1,339 species documented in expert-led surveys and 1,012 species recorded in citizen-science observations between 2019 and 2023. Striking divergence emerged between the expert and citizen-science datasets: only 462 species (33.5% of total diversity) were detected by both groups, with experts documenting 877 unique taxa often requiring specialised collection or morphological analysis, while citizen scientists contributed 550 distinctive species through spatially explicit, image-based records. Together, these approaches achieved 96.59% estimated species coverage within five years, demonstrating that combining community-driven data with expert methods accelerates comprehensive biodiversity documentation. Citizen-science platforms played a pivotal role by providing high-resolution geotagged imagery which enabled experts to validate records and resolve taxonomic ambiguities. Meanwhile, expert surveys detected cryptic taxa overlooked by citizen scientists. The rapid species coverage achieved through this synergy highlights the transformative potential of integrated frameworks. By mobilizing the scalability of citizen science to fill spatial and taxonomic gaps, while leveraging expert precision to ensure rigour, urban biodiversity monitoring can adapt to the rapid pace of ecological change. These findings advocate for collaborative strategies that harness public participation and scientific validation to optimise conservation efforts in data-deficient and highly-stressed ecosystems.
citizen science, iNaturalist, conservation, biodiversity, urban ecosystems
With the acceleration of global urbanisation, cities are increasingly recognised not only as centres of human activity, but also as complex, dynamic ecosystems that harbour significant biodiversity. Biodiversity studies are essential for understanding how species respond to environmental changes and for developing effective strategies for natural resource management and ecosystem sustainability (
Traditionally, expert-led surveys — employing systematic methods like plot sampling, transect walks and expedition trips — have been the cornerstone of biodiversity monitoring (
Despite its promise, CS is not without challenges. Significant biases persist in the geographic and taxonomic coverage of CS data, often reflecting a disparity between urban and natural habitats (
To understand how CS compares with expert-led surveys and contributes to biodiversity monitoring in an urban, non-Western context, we focused on Macao SAR, a city in subtropical Asia which houses one of the world’s densest human populations in a highly urban landscape (
Data Collection
We compiled two comprehensive, but distinct datasets of all insect species occurrences within Macao SAR: a CS dataset and an expert dataset. To compile the CS dataset, CS data were obtained from the iNaturalist platform (www.inaturalist.org), which facilitates public participation in biodiversity monitoring. We extracted all records for the class Insecta within Macao — defined by the geographic bounds 22.1068°N to 22.2335°N and 113.5367°E to 113.5653°E — including all observations recorded up to 31 December 2023. To ensure data quality, we filtered the dataset to include only research-grade observations (i.e. records with majority agreement on identifications and accompanied by photographs, precise georeferences and observation dates). The expert dataset was compiled from a comprehensive expert-derived insect checklist for Macao, assembled from handbooks, monographs, published articles and official records (
Data Analysis
To quantify insect community diversity and assess sampling trends in Macao, we applied rarefaction and species accumulation analyses to the CS dataset.
For comparative analysis, the insect species recorded via iNaturalist were consolidated into a comprehensive list and then compared with the expert checklist. Species were categorised into three groups: those exclusively documented on iNaturalist (CS dataset-only), those unique to the expert dataset (expert dataset-only) and those shared between both sources. This categorisation was performed separately for each of the 14 insect orders recorded to examine differences in total species richness and assess the extent of species overlap. To assess the distribution of species richness data across the three groups (CS dataset-only, expert dataset-only and shared species), we performed a Shapiro-Wilk test for normality using the shapiro.test() function from the stats package (
To assess the overall trend in insect diversity across the two data sources, we conducted a linear regression analysis. To evaluate overall insect diversity trends across the two data sources, we conducted a linear regression analysis using the lm() function from the stats package (
To quantify insect community diversity and assess sampling trends in Macao, we applied rarefaction and species accumulation analyses to the CS dataset. To estimate species coverage for the CS dataset, both overall and for each insect order, we employed the iNEXT model using the iNEXT() function from the iNEXT package (
All statistical analyses were performed in R (version 2023.03.0+384, R 4.2.3;
Overview of data sources
We compiled two distinct datasets to assess urban insect diversity in Macao. The CS dataset, derived from iNaturalist, comprised 23,535 observations recorded between 2010 and 2023 (Fig.
Map showing observations of insect species occurring within the boundaries of Macao (in black) in the iNaturalist.org dataset during the study period of 2010–2023 (A). Plots show cumulative trends over the study period in the number of identifiers which had both a verifiable User ID and a Research Grade identification (B), the number of observations of insects (C) and the number of insect species observed (D).
Insect Diversity in iNaturalist
CS contributions revealed pronounced variation in species documentation across insect orders (Table
Comparison of insect species records in the CS and expert datasets. The table presents the number of species recorded for each insect order in both CS and expert datasets, as well as the number of shared species between the two datasets. The columns "% Shared (Expert = E)" and "% Shared (Citizen Science = CS)" represent the proportion of shared species out of the total recorded species for each order in expert and CS datasets, respectively.
|
Order |
E |
CS |
Shared Sp |
% Shared (E) |
% Shared (CS) |
|
Lepidoptera |
565 |
418 |
219 |
39% |
52% |
|
Coleoptera |
233 |
182 |
83 |
36% |
46% |
|
Hymenoptera |
185 |
97 |
48 |
26% |
49% |
|
Hemiptera |
139 |
142 |
49 |
35% |
35% |
|
Diptera |
94 |
50 |
16 |
17% |
32% |
|
Odonata |
49 |
49 |
33 |
67% |
67% |
|
Orthoptera |
46 |
45 |
11 |
24% |
24% |
|
Blattodea |
14 |
12 |
5 |
36% |
42% |
|
Mantodea |
5 |
6 |
2 |
40% |
33% |
|
Zygentoma |
3 |
1 |
0 |
0% |
0% |
|
Neuroptera |
2 |
3 |
1 |
50% |
33% |
|
Phasmatodea |
1 |
3 |
1 |
100% |
33% |
|
Dermaptera |
1 |
3 |
0 |
0% |
0% |
|
Siphonaptera |
1 |
0 |
0 |
0% |
0% |
|
Thysanoptera |
1 |
0 |
0 |
0% |
0% |
|
Embioptera |
0 |
1 |
0 |
0% |
0% |
|
Total |
1,339 |
1,012 |
462 |
Avg. 29% |
Avg. 28% |
Comparison of citizen science and expert records of insect diversity
To assess the effectiveness of CS versus expert-led surveys of insect biodiversity, we compared the insect species lists from iNaturalist and the expert-derived literature checklist. Table 1 details the number of species per insect order for both datasets, along with the number of shared species. For instance, Lepidoptera were represented by 565 species in the expert dataset and 418 in the CS dataset, while Coleoptera and Hemiptera showed similar patterns, albeit with slightly lower absolute species richness in the CS data. Conversely, the expert dataset documented substantially more species of Hymenoptera (185) and Diptera (94) compared to the CS records (97 and 50, respectively). The percentage of shared species — calculated as the proportion of species common to both sources —varied by taxonomic order; several orders, such as Odonata, exhibited high overlap amongst datasets (67%), whereas others, such as Phasmatodea and Hymenoptera, had substantially fewer shared species. Across all orders, the average proportion of species common to both datasets was approximately 29% for the expert dataset and 28% for the CS dataset, indicating a moderate level of concordance. Since the data for all three groups — CS, expert (E) and shared species richness —were not normally distributed (Shapiro-Wilk test, p < 0.001), we used the non-parametric Friedman test to compare them. The Friedman test results (χ² = 20.237, df = 2, p < 0.001) indicate significant differences in species representation amongst the three groups, suggesting systematic variation in how insect biodiversity is documented across datasets.
Linear regression analysis of insect orders
To assess whether CS data reliably capture overall insect diversity patterns, we performed a linear regression analysis comparing the number of species recorded per insect order by CS and expert surveys (Fig.
Linear regression between values of insect species richness in the CS and expert datasets. The x-axis represents the number of species documented in expert datasets, while the y-axis shows the number of species recorded in the CS dataset. Each point represents an insect order, labelled accordingly. The solid red line is the regression line, showing the overall relationship between the two datasets and the dashed green line represents the 1:1 line, where species richness would be equal between the two sources.
Species Coverage and Temporal Trends
The CS dataset achieved 96.59% estimated species coverage across all insect orders, as indicated by species accumulation and rarefaction curves (Fig.
Sample completeness curves for A Blattodea; B Coleoptera; C Dermaptera; D Diptera; E Embioptera; F Hemiptera; G Hymenoptera; H Lepidoptera; I Mantodea; J Neuroptera; K Odonata; L Orthoptera; M Phasmida; N Zygentoma. Due to only a single species being observed, curves for Zygentoma and Embioptera could not be visualised.
This study provides a detailed assessment of how well CS initiatives document insect diversity in a highly-urbanised city and how they complement traditional expert-derived records. The results present a complex picture: CS data show a strong correlation with expert data in terms of insect orders and diversity (R² = 0.9786, Fig.
First, the rapid accumulation of CS data — evidenced by the marked increase in observations and species richness post-2018 in Macao (Fig.
However, despite these strengths, our analysis also reveals inherent limitations. Notably, certain insect orders — such as Hymenoptera and Diptera — are systematically under-represented in the CS dataset compared to the expert‐derived records. This disparity likely stems from several factors. First, smaller and less conspicuous species are inherently more challenging for non-specialists to detect and identify, leading to a bias towards larger, more visually striking organisms like many Lepidoptera (
Our analysis of species diversity by insect order (Table
From an applied ecology perspective, the implications of our findings extend to urban conservation and management. The high sampling coverage and rapid data accumulation achieved by CS platforms have the potential to revolutionise urban biodiversity monitoring. For instance, the extensive spatial and temporal data provided by CS can provide information for urban planning initiatives by identifying biodiversity hotspots and areas where invasive species are emerging (
In addition, targeted training programmes for citizen scientists could enhance their ability to detect and accurately record under-represented taxa, such as Hymenoptera and Diptera (
In conclusion, while CS data have some limitations — particularly in detecting rare or cryptic species — they serve as an invaluable complement to expert surveys in urban biodiversity monitoring. Our study highlights this by documenting a rare species Mortonagrion hirosei (
This research is supported by the Beijing Normal-Hong Kong Baptist University (UICR0100020, UICR0700050-23, UlCR0600072).