Biodiversity Data Journal :
Research Article
|
Corresponding author: Renate Wöger (renate.nowicki@web.de)
Academic editor: John-James Wilson
Received: 21 Sep 2020 | Accepted: 09 Oct 2020 | Published: 27 Nov 2020
© 2020 Renate Wöger, Roland Wöger, Matthias Nuss
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Wöger R, Wöger R, Nuss M (2020) DNA barcodes for Aotearoa New Zealand Pyraloidea (Lepidoptera). Biodiversity Data Journal 8: e58841. https://doi.org/10.3897/BDJ.8.e58841
|
Identification of pyraloid species is often hampered by highly similar external morphology requiring microscopic dissection of genitalia. This becomes especially obvious when mass samples from ecological studies or insect monitoring have to be analysed. DNA barcode sequences could accelerate identification, but are not available for most pyraloid species from New Zealand. Hence, we are presenting a first DNA-barcode library for this group, providing 440 COI barcodes (cytochrome C oxidase I sequences) for 73 morphologically-identified species, which is 29% of Pyraloidea known from New Zealand. Results are analysed using the Barcode Index Number system (BIN) of BOLD and the Automatic Barcode Gap Discovery method (ABGD).
Using BIN, the 440 barcodes reveal 82 clusters. A perfect match between BIN assignment and morphological identification was found for 63 species (86.3%). Four species (5.5%) share BINs, each with two species in one BIN, of which Glaucocharis epiphaea and Glaucocharis harmonica even share the same barcode. In contrast, six species (8.2%) split into two or more BINs, with the highest number of five BINs for Orocrambus ramosellus. The interspecific variation of all collected specimens of New Zealand Pyraloidea averages 12.54%. There are deep intraspecific divergences (> 2%) in seven species, for instance Orocrambus vulgaris with up to 6.6% and Scoparia ustimacula with 5.5%.
Using ABGD, the 440 barcodes reveal 71 or 88 operational taxonomic units (OTUs), depending on the preferred partition. A perfect match between OTU and morphological identification was found for 56 species (76.7%) or 62 species (84.9%). ABGD delivers four or seven species sharing OTUs and four or ten species split into more than one OTU.
Morphological re-examination, as well as the analysis of a concatenated dataset of COI and the nuclear markers EF1α and GADPH for species split into more than one BIN or OTU, do not support a higher number of species. Likewise, there is no evidence for Wolbachia infection as a trigger for these sequence variations.
Pyralidae, New Zealand, Crambidae, Scopariinae, COI, barcode, BIN, ABGD
The DNA barcode is a 658 bp mitochondrial cytochrome oxidase I gene (COI) sequence (
Even though there has already been a great number of DNA barcode campaigns for Lepidoptera with an increasing number of barcode libraries (e.g.
Taxonomically, the pyraloid fauna of New Zealand is well studied (
We surveyed Pyraloidea in New Zealand during January and February of the years 2017 and 2018. Moths were attracted to artificial UV light for 3–4 hours after nightfall. Each collecting locality has been visited one to six times, depending on travel logistics and weather conditions. The moths were collected at 12 sites, of which three sites are in the Taranaki region on the North Island and nine sites are scattered over the South Island. Specimens studied originate from different ecoregions like Podocarp forests and domains of horticulture on the North Island (Taranaki), as well as beech forest (Karamea), tussock grassland (Central Otago) and coastal shrub (Waikawa) on the South Island. The data record is biased towards man-made habitats, as well as geographically towards the South Island.
At each locality, all attracted pyraloids were collected. Specimens were killed using ammonia or ethyl acetate, pinned and dried for transportation.
Specimens were identified by the authors using the database of Landcare Research Auckland (
Nomenclature and taxonomy are based on the Global Information System on Pyraloidea (GlobIZ) (
After fieldwork, collected moths were labelled and sorted to morpho-species. Species with deep morphological variation were additionally sorted into morpho-groups. One to three specimens, depending on the number of available specimens, of every group of unambiguously-identified species and every morpho-group, were chosen for DNA barcoding. DNA barcodes were obtained from the collected material and additionally from loaned specimens from Landcare Research Auckland, New Zealand.
Genomic DNA was extracted from dried abdomens by using the Genomic DNA from tissue kit (Macherey-Nagel, Germany), following the manufacturer‘s standard protocol for animal tissue.
Specimens older than 20 years were examined following the above-mentioned protocol under UV radiation to avoid DNA contamination.
Extracted DNA was used for amplifying the 5P fragment of the mitochondrial DNA cytochrome C oxidase I gene "barcoding region" (COI Barcode) via PCR with the primer combination HybHCO/HybLCO (
For species split into more than one BIN, we amplified and sequenced the nuclear markers EF1α and GADPH.
We amplified EF1α PCR with the primer combination HybOskar (5' -TAA TAC GAC TCA CTA TAG GG GGC CCA AGG AAA TGG GCA AGG G-3')/HybEFrcM4 (5'-ACA GCV ACK GTY TGY CTC ATR TC-3') and GADPH PCR with the primer combination HybFrigga/Burre (
For sequencing work, we mandated Macrogen Europe, Amsterdam, Netherlands.
Sequences of COI, EF1α and GADPH were aligned manually using BioEdit version 7.2.6.1 (
We analysed our data using the Barcode Index Number system (BIN) (
For species split into more than one BIN, we arranged combined datasets with COI sequences and the nuclear markers EF1α and GADPH. Phylogenetic analysis was made with these concatenated sequences via the Maximum Likelihood method (
Specimen details such as collection sites, DNA-Barcode, GADPH and EF1α sequences were uploaded to the BOLD system and are publicly available in the dataset: NZPYR New Zealand Pyraloidea (also see: Suppl. material
We recovered DNA-barcodes > 500 bp for 440 specimens, with the oldest specimen being from 1993. The number of barcode sequences varies from 1 to 64 sequences per species. BOLD analyses revealed 82 Barcode Index Numbers (BINs) representing 73 morphologically-identified species. These represent 29% of New Zealand Pyraloidea, based on Nuss et. al (2020). For 63 species (86.3%), there was a perfect match between BIN and morphological species identification.
Thirty-four of these BINs already existed on BOLD, with sequences supplied by other BOLD users. We enlarged these BINs with 315 sequences. For six of these BINs, we additionally supplied the species names as they were only identified as Scopariinae. Furthermore, we established 48 new BINs with a total of 125 sequences.
The analysed specimens showed a mean interspecific genetic distance of 12.54% (pairwise analysis, K2P model, n = 61.096 comparisons, SE < 0.01). The mutual comparison of genera showed a mean congenetic distance of 7.99% (pairwise analysis, K2P model, n = 25.274 comparisons, SE < 0.01).
Intraspecific variation showed a mean distance of 0.47%, minimum distance of 0% and a maximum of 6.6% (pairwise analysis, K2P model, comparisons of barcodes with > 500 bp, SE 0.01). The mean distance to the nearest-neighbour (NN) averaged 5.99% with a minimum of 0% and a maximum of 11.04% (pairwise analysis, K2P model, comparisons of barcodes with > 500 bp, SE 0.03) (Tables
Species with a COI pairwise distance < 4 % (Kimura 2 Parameter, sequences > 500 bp) to the nearest-neighbor, N = number of examined specimens.
Species (N) |
Nearest-neighbour species (N) |
COI pairwise distance [%] |
Glaucocharis epiphaea (1) |
Glaucocharis harmonica (1) |
0.0 |
Glaucocharis helioctypa (1) |
Glaucocharis lepidella (5) |
0.67 |
Eudonia axena (1) |
Eudonia submarginalis (64) |
2.66 |
Eudonia diphteralis (3) |
Eudonia submarginalis (64) |
2.76 |
Glaucocharis chrysochyta (2) |
Glaucocharis selenaea (2) |
3.61 |
all other species |
> 4 |
Species with a maximum intraspecific distance > 2.5 % (pairwise distance, Kimura 2 Parameter, sequences > 500 bp), N = number of tested specimens.
Species (N) |
mean intraspecific distance [%] |
max intraspecific distance [%] |
Orocrambus vulgaris (16) |
2.01 |
6.6 |
Orocrambus ramosellus (22) |
1.44 |
5.54 |
Scoparia ustimacula (2) |
5.52 |
5.52 |
Orocrambus apicellus (3) |
3.16 |
4.29 |
Orocrambus vitellus (58) |
0.73 |
3.76 |
Orocrambus ordishi (4) |
2.19 |
3.03 |
Eudonia submarginalis (64) |
0.86 |
2.95 |
all other species |
< 2.5 |
Regarding the two most species-rich subfamilies, the specimens of Scopariinae show a mean distance to the nearest-neighbour of 5.4% (pairwise distance, Kimura 2 Parameter, sequences > 500 bp, SE 0.04) with a maximum of 9.0% between Eudonia trivirgata and Antiscopa elaphra and a minimum of 2.7% between Eudonia axena and Eudonia submarginalis. With a mean distance of 5.6% in Crambinae (pairwise distance, Kimura 2 Parameter, sequences > 500 bp, SE 0.1), there is a maximum of 11.7% between Gadira acerella and Orocrambus cyclopicus and a minimum of 0.0% between Glaucocharis epiphaea and Glaucocharis harmonica.
There are two BIN assignments which contain two different species each: G. epiphaea with G. harmonica and G. helioctypa with G. lepidella. One of these pairs, G. epiphaea and G. harmonica, even share an identical barcode sequence.
Most of the morphologically-identified species show an intraspecific variation of less than 2%, but seven species (9.6%) show deep variations of up to 6.6%. Six species (8.2%) are spread over more than one BIN. Orocrambus apicellus, Scoparia ustimacula and Gadira acerella appeared each with 2 BINs and Orocrambus ordishi and Orocrambus vulgaris each with 3 BINs. Orocrambus ramosellus appeared in 5 BINs.
The specimens of Orocrambus vitellus show a maximum intraspecific distance of 3.76%, but are found in only one BIN. On the contrary, Gadira acerella shows a maximum intraspecific distance of 1.96% and is found in two BINs.
Specimens of Eudonia submarginalis form five clusters in the barcode Neighbour-Joining analysis (Kimura 2 model, sequences > 500 bp, see Suppl. material
The eight specimens of Orocrambus creneus, found near Sutton Salt Lake, form a distinct cluster in the barcode Neighbour-Joining analysis (Kimura 2 model, sequences > 500 bp, see Suppl. material
For the species, which appeared in more than one BIN, the concatenated analysis of COI + EF1α + GADPH revealed mean intraspecific distances from 1.12% (O. ordishi) to up to 2.0% (S. ustimacula) and maximum intraspecific distances from 1.55% (O. ordishi) to up to 3.13% (O. ramosellus) Table
Mean and maximum intraspecific distances (species split into more than one BIN) analysed with EF1α and GADPH and concatenated sequences (pairwise distance, Kimura 2 Parameter, sequences > 500 bp), N = number of specimens. The particular number of BINs is from COI analysis.
Species |
N |
EF1α |
N |
GADPH |
N |
concatenated (COI + EF1 α + GADPH) |
|||
mean intrasp. dist. [%] |
max intrasp. dist. [%] |
mean intrasp. dist. [%] |
max intrasp. dist. [%] |
mean intrasp. dist. [%] |
max intrasp. dist. [%] |
||||
O. apicellus (2 BINs) |
2 |
1.13 |
1.13 |
3 |
0.11 |
0.17 |
3 |
1.44 |
1.75 |
O. ordishi (3 BINs) |
2 |
0.81 |
0.81 |
4 |
0.25 |
0.46 |
4 |
1.12 |
1.55 |
O. ramosellus (5 BINs) |
5 |
0.79 |
1.53 |
6 |
0.39 |
1.07 |
6 |
1.72 |
3.13 |
O. vulgaris (3 BINs) |
3 |
0.62 |
0.81 |
3 |
0.39 |
0.62 |
4 |
1.25 |
1.82 |
S. ustimacula (2 BINs) |
2 |
0.54 |
0.54 |
2 |
0.93 |
0.93 |
2 |
2.00 |
2.00 |
Maximum Likelihood tree using Kimura 2 parameter distance model inferred from EF1α and GADPH sequences (species split into more than one BIN). Bootstrap (1000 replicates) values >= 75% are displayed, branch lengths represent genetic distances between nodes. The scale bar indicates 0.01 K2P distance. The COI BIN number is given for each specimen.
Maximum Likelihood tree using Kimura 2 parameter distance model inferred from COI, EF1α and GADPH sequences (species split into more than one BIN). Bootstrap (1000 replicates) values >= 75% are displayed, branch lengths represent genetic distances between nodes. The scale bar indicates 0.01 K2P distance. The COI BIN number is given for each specimen.
Due to the age of the specimens of Glaucocharis epiphaea and Glaucocharis harmonica (barcode sharing), as well as of Gadira acerella, which is split into two BINs, the amplification and analysis of EF1α and GADPH was not successful.
The automatic barcode gap discovery reveals the presence of a barcode gap at 4% (Fig.
ABGD (Automatic barcode gap discovery) partition analysis of 440 COI sequences of New Zealand Pyraloidea (pairwise distance, Kimura 2 Parameter, sequences > 500 bp, nbr: number of runs) generated via https://bioinfo.mnhn.fr/abi/public/abgd/ (last access: 10.09.2020)
The partition with 88 putative species reveals two OTUs which contain two different species each: G. epiphaea with G. harmonica and G. helioctypa with G. lepidella, which is identical to the BIN assignment. Following the partition with 71 putative species, Eudonia axena, Eudonia diphteralis and Eudonia submarginalis together share one OTU Table
Species split into more than one OTU/BIN (pairwise distance, Kimura 2 Parameter, sequences > 500 bp). BIN assignment in comparison to the number of putative species following ABGD.
Species | Number of BINs (BOLD) | Putative species (ABGD) partition with 88 OTUs | Putative species (ABGD) partition with 71 OTUs |
O. apicellus | 2 | 2 | 2 |
O. ordishi | 3 | 3 | 1 |
O. ramosellus | 5 | 5 | 2 |
O. vulgaris | 3 | 3 | 2 |
S. ustimacula | 2 | 2 | 2 |
G. acerella | 2 | 2 | 1 |
E. leptalea | 1 | 2 | 1 |
E. submarginalis | 1 | 2 | 1 |
O. vitellus | 1 | 4 | 1 |
P. farinaria | 1 | 2 | 1 |
From the 250 pyraloid species known from New Zealand (
Considering the accordance between BIN assignment and morphological species identification, former barcode campaigns showed a success rate of about 90% (e.g.
In our survey, there is a collecting bias towards manmade habitats, like pastures and suburban places. Some common species like Orocrambus flexuosellus and Eudonia submarginalis were found at nearly all study sites. In contrast, uncommon species like Delogenes limodoxa and Glaucocharis elaina were only found as singletons in one or two protected natural habitats. This imbalance influences the arrangement of OTUs and BINs, so that several BINs are represented by only one specimen.
Barcode sharing has been found for many lepidopteran taxa in previous studies (e.g.
In contrast, six species (8.2%) were split into two to five BINs. For the specimens involved in these BIN splits, the Maximum Likelihood analysis of the concatenated sequences of COI, EF1α and GADPH (Figs
Several studies suppose a Wolbachia infection as a trigger for BIN splitting (e.g.
The ABGD method results in two different partitions with 88 and 71 putative species, respectively. Depending on the considered partition, the number of OTU sharing and split species is different. Thus, ABGD delivers diverse outcomes and it remains to the user to select and interpret one or more results. Similar to the results obtained with the BIN assignments, we do not see any morphological delimitation supporting different species in these cases of split OTUs.
Several studies have compared results from BIN assignment and ABGD (e.g.
Seventy-one percent of the New Zealand pyraloid species were not available for study due to a limited collecting effort and a bias towards man-made habitats. Further additions to the DNA barcode library will require research on the species that are largely or exclusively restricted to natural habitats and having a restricted area of distribution like O. sophistis and Gadira leucophthalma (
We thank Robert Hoare from Landcare Research Auckland for his kind support and the loan of specimens of New Zealand Pyraloidea. The Department of Conservation (DOC) New Zealand kindly provided collecting permissions. Special thanks go to the members of Forest and Bird Te Wairoa Reserve and Peter and Margaret from Dolly’s Farm Taranaki for a very cordial welcome, as well as to Taranaki Regional Council for making the Hollard Gardens available for study. We thank our editor John-James Wilson and the reviewers Donald Lafontaine, Bernard Landry and Richard Mally for providing important suggestions to improve the paper. Staff and resources provided by Senckenberg Museum of Zoology, Dresden are gratefully acknowledged.
Senckenberg Museum of Zoology, Dresden, Germany
Sample IDs, species names, collection sites, BOLD accession numbers
Kimura 2 model, sequences > 500 bp, with species names, collecting ID, subfamily, collecting localities, specimen ID in BOLD database, subfamilies are coloured
pairwise distance, Kimura 2 Parameter, sequences > 500 bp; generated via: https://bioinfo.mnhn.fr/abi/public/abgd/