Biodiversity Data Journal :
Research Article
|
Corresponding author: Badrul Munir Md-Zain (abgbadd1966@yahoo.com)
Academic editor: Anna Sandionigi
Received: 28 Jun 2022 | Accepted: 09 Sep 2022 | Published: 27 Sep 2022
© 2022 Nur Azimah Osman, Muhammad Abu Bakar Abdul-Latiff, Abd Rahman Mohd-Ridwan, Salmah Yaakop, Kayal Vizi Karuppannan, Badrul Munir Md-Zain
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Osman NA, Abdul-Latiff MAB, Mohd-Ridwan AR, Yaakop S, Karuppannan KV, Md-Zain BM (2022) Metabarcoding data analysis revealed the plant dietary variation of long-tailed macaque Macaca fascicularis (Cercopithecidae, Cercopithecinae) living in disturbed habitats in Peninsular Malaysia. Biodiversity Data Journal 10: e89617. https://doi.org/10.3897/BDJ.10.e89617
|
The long-tailed macaque (Macaca fascicularis) has a wide range in both Peninsular Malaysia and Borneo. Although the primates are especially vulnerable to habitat alterations, this primate lives in disturbed habitats due to human-induced land-use. Thus, this study presents a faecal metabarcoding approach to clarify the plant diet of long-tailed macaques from five locations in Peninsular Malaysia to represent fragmented forest, forest edge, island and recreational park habitats. We extracted genomic DNA from 53 long-tailed macaque faecal samples. We found 47 orders, 126 families, 609 genera and 818 species across these five localities. A total of 113 plant families were consumed by long-tailed macaques in Universiti Kebangsaan Malaysia, 61 in the Malaysia Genome and Vaccine Institute, 33 in Langkawi Island, 53 in Redang Island and 44 in the Cenderawasih Cave. Moraceae (33.24%) and Fabaceae (13.63%) were the most common families consumed by long-tailed macaques from the study localities. We found that habitat type impacted diet composition, indicating the flexibility of foraging activities. This research findings provide an understanding of plant dietary diversity and the adaptability of this macaque with the current alteration level that applies to long-tailed macaque conservation management interest in the future.
Malaysian primates, Southeast Asia, DNA metabarcoding, trnL, next-generation sequencing
Macaca fascicularis (Raffles, 1821) is a cercopithecine primate also known as long-tailed macaque, crab-eating macaque or cynomolgus macaque. The native range of this species includes most of mainland Southeast Asia, such as Malaysia, Indonesia, Singapore, Thailand, Laos, Vietnam, Cambodia, Myanmar and the Philippines (
Long-tailed macaques live in several habitats with varying disturbance levels in Malaysia, including degraded and secondary rainforests, lowland primary rainforests, shrubland, mangroves, islands, agricultural areas, recreational parks and human settlements (
Thus, in this study we investigated the plant food diversity of the long-tailed macaques, found in disturbed habitats through the metabarcoding technique. We performed a comprehensive fresh faecal sampling of this macaque across these habitats. Therefore, molecular methods have quantified the diet of long-tailed macaque where feeding is difficult to observe. Knowledge of the fundamental aspects of dietary diversity from various habitats can help identify priority conservation areas and effectively manage these species in the conflict area. Metabarcoding diet data may assist government authorities, the Department of Wildlife and National Parks and non-governmental organisations in improving management plans and conserving long-tailed macaque.
Study area
Analysed samples in this present study were collected from non-invasive faecal material of long-tailed macaques. The samples were obtained from five localities representing four different living environments in Peninsular Malaysia: Universiti Kebangsaan Malaysia, UKM (fragmented forest); Malaysia Genome and Vaccine Institute, MGVI (fragmented forest); Langkawi Island (forest edge); Redang Island (island); and Cenderawasih Cave (recreational park) (Fig.
Other long-tailed macaque samples were collected around Redang Island, located in Kuala Nerus District, Terengganu, Malaysia (
DNA extraction
According to the manufacturer’s protocols, the innuPREP Stool DNA Kit (Analytik Jena, Jena, Germany) extracted DNA from approximately 400 mg of long-tailed macaque faeces. First, the surface and interior of the faecal pellet were sampled for each extraction (
List of samples and localities of M. fascicularis used for plant diet analysis.
# |
Pool samples code |
Number of samples |
Locality |
Type of habitat |
Coordinate |
1 |
FB1 |
12 |
UKM, Selangor |
Fragmented forest |
|
2 |
FB2 |
5 |
UKM, Selangor |
Fragmented forest |
|
3 |
FB3 |
5 |
UKM, Selangor |
Fragmented forest |
|
4 |
FB4 |
5 |
UKM, Selangor |
Fragmented forest |
|
5 |
FB5 |
7 |
UKM, Selangor |
Fragmented forest |
|
6 |
FB6 |
5 |
UKM, Selangor |
Fragmented forest |
|
7 |
FB8 |
5 |
MGVI, Selangor |
Fragmented forest |
|
8 |
FK9 |
3 |
Langkawi Island, Kedah |
Forest edge |
|
9 |
FT10 |
3 |
Redang Island, Terengganu |
Island |
|
10 |
FR11 |
3 |
Cenderawasih Cave, Perlis |
Recreational Park |
|
Total |
10 |
53 |
- |
- |
- |
PCR amplification
The trnL intron was amplified using the previously described primers targeting the P6 loop, creating a single amplicon of approximately 90 bp (
Illumina MiSeq-DNA sequencing of the trnL gene
The amplicons were sent to Apical Scientific Sdn. Bhd. for next generation sequencing (NGS). Dual indices were attached to the amplicon PCR using an Illumina Nextera XT Index Kit v.2 according to the manufacturer’s protocol. The libraries’ quality was measured using the Agilent Bioanalyzer 2100 System by Agilent DNA 1000 Kit and fluorometric quantification by Helixyte GreenTM Quantifying Reagent. Finally, the libraries were normalised and pooled, based on the protocol recommended by Illumina and followed by sequencing with the MiSeq platform using 150 PE (Illumina Inc., San Diego, CA, USA).
Statistical analysis
All next-generation sequence data were deposited into the National Center of Biotechnology Information, under Sequence Read Archive accession numbers; SRR19576857, SRR19577193, SRR19577505, SRR19577582, SRR19577635, SRR19577594, SRR19577595, SRR19577593, SRR19577662 and SRR19577637. The quality filtering and demultiplexing of the resulting sequences were conducted using the CLC Genomic Workbench software v.12.0 (CLC) (Qiagen, Hilden, Germany) at the Genetics Laboratory in the Department of Technology and Natural Resources, Faculty of Applied Sciences and Technology, Universiti Tun Hussein Onn (UTHM), Pagoh, Johor, Malaysia. The Illumina data quality scores were initially assessed using the FASTQ file. The next step is to assign taxonomy to the reads and tally the occurrences of species. A common approach is to cluster reads at some level of similarity into representative sequences of pseudo-species called Operational Taxonomic Units (OTUs), where all reads within 97% similarity are clustered together and represented by a single OTU sequence. This approach is frequently used due to the presence of sequencing errors in the NGS reads. OTUs were aligned using the MUSCLE tool in CLC. Chimera screening and taxonomy assignment is done using the SILVA v.138 database. Rarefaction curves were plotted with the number of OTUs observed with a given sequencing depth using CLC. The alpha diversity used to assess plant species richness in the long-tailed macaque was generated using the PAST software v.4.03. The principal coordinate analysis (PCoA) using the CLC software displayed the relationship between the samples. A Venn diagram was generated to determine the shared and unique OTUs amongst the localities of these macaques at 97% similarity. A phylogenetic dendrogram and heatmap were constructed with 1,000 bootstraps following the Bray-Curtis distance to assess the relationship of the plant species community amongst these samples. Statistical significance was set at p < 0.05.
Fifty-three fresh faecal samples were collected from five localities. However, because of opportunistic collection methods, we could not link the faecal samples to a specific age-sex class or known individual. In addition, samples were easily distinguishable from other primates according to the physical characteristics of the fresh faeces by size, smell, form and colour. The faecal samples were collected in sterile 45 ml tubes and fixed in 95% ethanol for long-term storage (
The NGS produced 595,455 reads from five pooled samples of M. fascicularis in different localities, ranging from 46,736 to 371,005. The final dataset obtained by sequence filtering excluded low-quality sequence reads, chimera and subsequently OTU clustering. A total of 407,354 OTU sequence reads were generated at the 97% similarity cut-off. UKM long-tailed macaque population showed the most OTU sequence reads (272,047) followed by Cenderawasih Cave (47,254), MGVI (43,157) and Redang Island (29,798), whereas the fewest reads were from Langkawi Island (15,098) (Fig.
The number of observed operational taxonomic units, alpha diversity indices for the plant DNA from five localities of M. fascicularis.
Localities |
Sequences |
OTUs |
Shannon (H') |
Chao 1 |
UKM, Selangor |
272047 |
2103 |
3.806 |
2198 |
MGVI, Selangor |
43157 |
523 |
2.897 |
647.5 |
Langkawi Island, Kedah |
15098 |
428 |
2.781 |
562.5 |
Cenderawasih Cave, Perlis |
47254 |
542 |
2.889 |
650.8 |
Redang Island, Terengganu |
29798 |
565 |
3.676 |
702.4 |
Total |
407354 |
4161 |
- |
- |
Diet richness and composition of the M. fascicularis
The trnL intron sequence of the M. fascicularis samples was amplified. The OTUs were assigned to 47 orders, 126 families, 609 genera and 818 species across these five pool samples. A total of 113 families of plants were in UKM, MGVI (61), Langkawi Island (33), Redang Island (53) and Cenderawasih Cave samples (44). Moraceae (33.24%) and Fabaceae (13.63%) were the most common families recorded from all study localities. The unknown family was recorded at 21.22%. Ficus is the most abundant genus recorded in the samples from UKM (41.04%) and Cenderawasih Cave (35.68%). Brosimum (49.75%) is the most abundant genus in MGVI samples (Fig.
Heatmap analysis revealed significant interindividual variability in the plant communities’ consumption in composition level at the five localities foraged by M. fascicularis. The 30 most abundant genera were adopted in the hierarchical clustering using weighted pair clustering based on Bray-Curtis measurements to evaluate the relationships between this macaque (Fig.
Beta-diversity on UniFrac-based principal coordinate analysis (PCoA) showed a dietary relationship amongst the five localities (Fig.
Additionally, these localities had overlapping plant genera (Fig.
This is the first study to report on the diversity of plants consumed by long-tailed macaques in disturbed habitats in Peninsular Malaysia using an advanced approach that combines DNA metabarcoding and Illumina NGS of trnL chloroplast genes. The data obtained from the five localities in the four different environmental settings presented a novel finding for understanding the diet of long-tailed macaques. Generally, long-tailed macaques fed on several food plants belonging to at least 693 species from 113 different families in a mixed landscape consisting of urban, agro-forested areas and forest fragments in Malaysia at UKM rather than in the other localities. These data supported the species richness of the food plants and sampling efforts performed on six macaque groups in UKM. As a lowland secondary dipterocarp fragmented forest, UKM Permanent Forest Reserve harbours important plant resources, including more than 500 species of seed plants surrounding the UKM main campus (
We projected that anthropogenic habitat alteration would lead to lower habitat quality for long-tailed macaques. Although the forests at UKM, MGVI, Langkawi Island and Redang Island have been minimally altered, human interaction still occurs. The disturbance level varied between the study areas. This macaque occupied a heavily altered habitat characterised by scarcity of plant food abundance and frequent human visits to the Cenderawasih Cave. However, we did not survey the availability of anthropogenic food. Thus, it is hard to verify whether this macaque highly depends on anthropogenic foods or its natural food resource. The amount of food from natural sources is consumed more by the long-tailed macaques than that from visitors at the high-altitude rainforest of Telaga Warna, West Java, Indonesia (
Non-invasive fresh faecal sampling was conducted in this study without capturing, touching or restraining the long-tailed macaques (
Our study reveals the dietary variation of long-tailed macaques in disturbed habitats by trnL DNA metabarcoding. Using a non-invasive method allows DNA metabarcoding to reveal the diet of long-tailed macaques where it has always been difficult to obtain using direct observation. Therefore, our results propose that long-tailed macaques consume various food plants that help them survive in disturbed habitats at the edge and centre of the fragmentary and human interference areas. Furthermore, we provide data on these species’ dietary requirements and plant species availability in different habitats. Knowledge of the fundamental aspects of dietary diversity from various habitats is increasingly employed to identify priority conservation areas and effectively manage these species in the conflict area. Other food resources are needed to understand their feeding behaviour since long-tailed macaques are omnivorous. Notably, metabarcoding diet data may assist government authorities, the Department of Wildlife and non-governmental organisations in improving management plans and conservation of long-tailed macaques in the conflict area. Knowing which plants are consumed by cercopithecine primates will guide translocation processes from disturbed habitats to the undisturbed forests that harbour these important resources.
We thank the Department of Wildlife and National Park (DWNP) Peninsular Malaysia especially YBhg. Dato’ Abdul Kadir bin Abu Hashim, Director General of the DWNP. Universiti Kebangsaan Malaysia, Universiti Tun Hussein Onn Malaysia for providing necessary facilities and assistance during field observation and laboratory work. The authors are deeply indebted to the Department of Wildlife and National Parks Peninsular Malaysia for providing research permit: JPHL&TN(IP):100-34/1.24Jld16(14). The authors appreciate the support from Universiti Kebangsaan Malaysia and Universiti Tun Hussein Onn Malaysia for funding the present work through the Grants GUP-2019-037, ST-2021-017, AP-2015-004, ST-2019-008, GPPS-UTHM-2019-H528 and FRGS/1/2018/WAB13/UTHM/03/2 by the Ministry of Education Malaysia.
UKM Universiti Research Grant (GUP) and Ministry of Education Malaysia.
GUP-2019-037, ST-2021-017, AP-2015-004, ST-2019-008, GPPS-UTHM-2019-H528 and FRGS/1/2018/WAB13/UTHM/03/2
Universiti Kebangsaan Malaysia and Universiti Tun Hussein Onn Malaysia.
Research methods, reported in this manuscript, adhered to the legal requirements of Malaysia and were approved by Department of Wildlife and National Parks (PERHILITAN), Peninsular Malaysia, KM10 Jalan Cheras, Kuala Lumpur, Malaysia under research permit JPHL&TN(IP):100-34/1.24Jld16(14).
N.A.O., M.A.B.A.L., A.R.M.R. and B.M.M.Z. designed the project. N.A.O. and M.A.B.A.L. were involved in field sample collection. N.A.O., M.A.B.A.L and A.R.M.R performed the laboratory work. N.A.O., M.A.B.A.L and A.R.M.R wrote the manuscript and analysed the data. M.A.B.A.L., A.R.M.R., B.M.M.Z. and S.Y. reviewed and edited the manuscript. M.A.B.A.L., K.V.K. and B.M.M.Z. provided resources, project administration and funding acquisition. All authors read and approved the final manuscript.
The authors declare that they have no conflict of interests.