Biodiversity Data Journal :
Research Article
|
Corresponding author: Jiansheng Qiu (qiu1803@qq.com)
Academic editor: Jader Oliveira
Received: 15 Sep 2022 | Accepted: 23 Oct 2022 | Published: 09 Nov 2022
© 2022 Kai Hu, Sanpeng Yu, Niannian Zhang, Maojuan Tian, Qiming Ban, Zhongliang Fan, Jiansheng Qiu
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Hu K, Yu S, Zhang N, Tian M, Ban Q, Fan Z, Qiu J (2022) The first complete mitochondrial genome of Matsucoccidae (Hemiptera, Coccoidea) and implications for its phylogenetic position. Biodiversity Data Journal 10: e94915. https://doi.org/10.3897/BDJ.10.e94915
|
|
The mitochondrial genome (mitogenome) has been extensively used to better understand the phylogenetic relationships within the hemipteran suborder Sternorrhyncha, but sequenced mitogenomes remain unavailable for the entire family Matsucoccidae to date. To address this, here we sequenced the complete mitogenome of Matsucoccus matsumurae; the first for this family. The mitogenome is 15,360 bp in size and comprises the typical set of 37 mitochondrial genes and a large non-coding region (AT-rich region). Gene order, nucleotide composition and codon usage of protein-coding genes (PCGs) of M. matsumurae differ considerably from those of the other two sequenced Coccidae species. All PCGs were initiated by the ATN start codons and ended with the TAA/G or single T-- stop codons. Nine transfer RNA genes could be folded into typical clover-leaf secondary structures. The length and AT content of the ribosomal RNA genes are highly conserved in the Coccoidea mitogenomes. In contrast, the AT-rich control region is highly variable in size and in the number of tandem repeats. The sliding window analysis showed that the cox1 gene is the most conserved amongst the 13 PCGs, while the ratios of non-synonymous to synonymous substitution rates indicated that the evolution of this mitogenome has been dominated by positive selection. Phylogenetic analyses, based on nucleotide sequence data of 37 mitochondrial genes and amino acid sequence data of 13 PCGs using Bayesian Inference and Maximum Likelihood methods, showed that Matsucoccidae diverged before the Coccidae.
mitogenome, scale insect, phylogenetic analysis, secondary structure, Sternorrhyncha
The family Matsucoccidae was erected in 1984 by Koteja (
In insects, the mitochondrial genome (mitogenome) is typically a covalently closed circular double-stranded DNA molecule, usually 15 ~ 18 kb in length, that encodes 37 genes, including 13 protein-coding genes (PCG), two ribosomal RNA genes (rRNA) and 22 transfer RNA genes (tRNA) (
Currently (23 April 2022), there are no available mitogenomes for Matsucoccidae. In this study, we sequenced, annotated and analysed the mitogenome of Matsucoccus matsumurae (the type species of Matsucoccidae) in detail. Furthermore, we sampled all sternorrhynchan superfamilies to investigate the phylogenetic position of Matsucoccidae and also provide insight into the superfamily-level phylogenetic relationships within the suborder Sternorrhyncha.
Adult specimens of M. matsumurae (Fig.
The whole genomic DNA of M. matsumurae was sequenced using next-generation sequencing (Illumina HiSeq X10, Biomarker Technologies Corporation, Beijing, China). About 2.13 Gb clean data were assembled into a complete circular mitogenome using NOVOPlasty v.4.3.1 (
The annotation of mitogenome was conducted using Geneious Prime v.2022.0.1 (Biomatters, Auckland, New Zealand). The locations and sequences of tRNA genes were determined by the MITOS Web Server (http://mitos2.bioinf.uni-leipzig.de/index.py) (
The organisation tables, nucleotide composition and relative synonymous codon usage (RSCU) of the mitogenomes of Coccoidea species were calculated and produced using PhyloSuite v.1.2.2 (
A total of 34 mitogenomes from five superfamilies of Hemiptera were used for the phylogenetic analyses (Table
Superfamily |
Family |
Species |
Accession number |
Reference |
Aleyrodoidea |
Aleyrodidae |
Pealius machili |
|
|
Bemisia tabaci |
|
|||
Aleurocanthus camelliae |
|
|||
Psylloidea |
Calophyidae |
Calophya schini |
|
|
Calophya californica |
Unpublished |
|||
Carsidaridae |
Paracarsidara gigantea |
|
||
Allocarsidara bakeri |
|
|||
Homotomidae |
Mycopsylla gardenensis |
|
||
Homotoma ficus |
|
|||
Liviidae |
Diaphorina citri |
Unpublished |
||
Livia junci |
|
|||
Phacopteronidae |
Pseudophacopteron sp. |
|
||
Psyllidae |
Cyamophila willieti |
|
||
Freysuila caesalpiniae |
|
|||
Triozidae |
Trioza urticae |
|
||
Pariaconus pele |
|
|||
Aphalaridae |
Anoeconeossa unicornuta |
|
||
Blastopsylla occidentalis |
|
|||
Aphidoidea |
Anoeciidae |
Anoecia fulviabdominalis |
Unpublished |
|
Aphididae |
Uroleucon sonchi |
Unpublished |
||
Aphis craccivora |
|
|||
Brevicoryne brassicae |
|
|||
Hormaphididae |
Hamamelistes spinosus |
|
||
Pseudoregma bambucicola |
|
|||
Mindaridae |
Mindarus keteleerifoliae |
Unpublished |
||
Pemphigidae |
Schlechtendalia peitan |
Unpublished |
||
Melaphis rhois |
|
|||
Phloeomyzidae |
Phloeomyzus passerinii |
Unpublished |
||
Thelaxidae |
Kurisakia onigurumii |
Unpublished |
||
Coccoidea |
Coccidae |
Didesmococcus koreanus |
|
|
Saissetia coffeae |
|
|||
Matsucoccidae |
Matsucoccus matsumurae |
This study |
||
Fulgoroidea |
Achilidae |
Betatropis formosana |
Unpublished |
|
Caliscelidae |
Youtuus erythrus |
|
The complete mitogenome of M. matsumurae is a closed circular double-stranded DNA molecule (Fig.
Circular map of the mitogenome of Matsucoccus matsumurae. The outer circle shows the gene map of M. matsumurae, with the genes outside the map encoded on the major strand (H-strand), whereas genes on the inside of the map are encoded on the minor strand (L-strand). Genes are represented by different colour blocks.
Gene orders in Hemiptera mitogenomes. With the exception of Coccoidea (Didesmococcus koreanus, Matsucoccus matsumurae and Saissetia coffeae), each superfamily (plus the infraorder Cimicomorpha) is represented by one species. Genes shown with “-“ signs are located on the minor strand (L-strand), while others are located on the major strand (H-strand).
The overall nucleotide composition of the M. matsumurae mitogenome is 47.2% A, 6.4% C, 0.6% G and 45.8% T. It therefore exhibits a strong AT bias of 91.1%, which is higher than in other Coccoidea insects (82.5% for D. koreanus and 84.7% for S. coffeae) (Table
Nucleotide composition and skewness of mitogenomes of Didesmococcus koreanus, Matsucoccus matsumurae and Saissetia coffeae.
Feature |
Length |
A+T% |
AT-skew |
GC-skew |
D. koreanus / M. matsumurae / S. coffeae |
||||
Whole genome |
15143/15360/15389 |
82.5/91.1/84.7 |
0.142/0.056/0.189 |
-0.357/-0.267/-0.369 |
PCGs |
10599/10626/10632 |
81.9/90.3/84.1 |
-0.084/-0.115/-0.072 |
-0.108/0.016/-0.107 |
tRNAs |
1300/1330/1373 |
87.9/92.3/89.3 |
0.043/0.054/0.054 |
0.019/-0.059/-0.007 |
rRNAs |
1914/1890/1965 |
86.2/92.7/87.2 |
-0.027/-0.062/-0.099 |
0.426/0.304/0.440 |
Control region |
1350/1404/1536 |
77.7/92.9/82.1 |
-0.071/0.015/-0.045 |
-0.252/-0.818/-0.295 |
Nucleotide diversity (Pi value) of 13 PCGs and two rRNAs amongst the three scale insect species is shown in Fig.
The non-synonymous/synonymous (Ka/Ks) substitution ratio can be used to estimate whether a sequence is undergoing purifying, neutral or positive selection. The rates of non-synonymous (Ka) and synonymous substitutions (Ks) and their ratio (Ka/Ks) were calculated for the 13 PCGs of each of the three scale insect species using Aphis craccivora as the reference sequence (Fig.
The 13 PCGs (length: 10,626 bp) account for 69.2% of the complete mitogenome of M. matsumurae. All PCGs were initiated by the typical start codon ATN (ATA/T/G/C) and ended with the TAA/G stop codon or their incomplete form T-. This is almost identical to D. koreanus (
All three sequenced scale insect mitogenomes encode 22 tRNA genes (Suppl. material
The AT nucleotide content of rrnS and rrnL genes of the three scale insects ranges from 86.2% to 92.7% (Table
The AT-rich region is believed to be involved in regulating the transcription and replication of DNA in insects (
Phylogenetic analyses of 32 species of Sternorrhyncha, including two outgroups, based on ML and BI analyses of nucleotide sequence data of 37 mitochondrial genes, yielded largely congruent topologies, with most branches receiving strong support (Figs
All analyses consistently recovered the monophyly of Sternorrhyncha and its four superfamilies (Coccoidea, Aphidoidea, Aleyrodoidea and Psylloidea) with strong support (BS = 86; PP = 1.00) (Figs
This study was supported by the Projects of Guizhou Science and Technology Platform and Talent Team (QKHPTRC [2016] 5669) and Survey and Research on Exotic Species in Guizhou Province (AS [2017] 09, KJZXSA [2018] 016, and KJZXSA [2019] 014).
We thank Ivan Jakovlić (Bio-Transduction Lab, Wuhan, China) for proofreading of the manuscript and giving us valuable advice.
Investigation, formal analysis and writing—original draft, Kai Hu; resources, Sanpeng Yu, Maojuan Tian, Qiming Ban and Zhongliang Fan; writing—review and editing, Kai Hu, Sanpeng Yu, Niannian Zhang and Jiansheng Qiu; and validation, project administration and funding acquisition, Jiansheng Qiu. All authors have read and agreed to the published version of the manuscript.
All authors report no conflicting interests.
The best partitioning schemes and substitution models for PCG123 + tRNA + rRNA dataset comprising 13 PCGs, 22 tRNAs and two rRNAs of 34 species of Hemiptera used for ML phylogenetic analyses
The best partitioning schemes and substitution models for PCG123 + tRNA + rRNA dataset comprising 13 PCGs, 22 tRNAs and two rRNAs of 34 species of Hemiptera used for BI phylogenetic analyses
BI phylogenetic tree for Sternorrhyncha, based on the amino acid sequence data of 13 PCGs from Matsucoccus matsumurae and other 33 species belonging to five superfamilies of Hemiptera. Bayesian posterior probabilities (PP) are indicated on the branches