Biodiversity Data Journal :
OMIC Data Paper
|
Corresponding author: Christina Pavloudi (christina.pavloudi@embrc.eu), Ioulia Santi (ioulia.santi@embrc.eu)
Academic editor: Lyubomir Penev
Received: 04 Dec 2024 | Accepted: 25 Feb 2025 | Published: 12 Mar 2025
© 2025 Christina Pavloudi, Ioulia Santi, Iñigo Azua, Zuriñe Baña, Mauro Bastianini, Caroline Belser, Jone Bilbao, Julie Bitz-Thorsen, Caroline Broudin, Mathieu Camusat, Ibon Cancio, Louis Caray-Counil, Raffaella Casotti, Jade Castel, Thierry Comtet, Cymon Cox, Claire Daguin, Oihane Díaz de Cerio, Katrina Exter, Cécile Fauvelot, Miguel Frada, Pierre Galand, Laurence Garczarek, Jose González Fernández, Laure Guillou, Pascal Hablützel, Hanneloor Heynderickx, Céline Houbin, Anne Kervella, Apostolos Krystallas, Rune Lagaisse, Arnaud Laroquette, Lyvia Lescure, Eva Lopes, Melina Loulakaki, Bruno Louro, Catarina Magalhaes, Maria Maidanou, Francesca Margiotta, Marina Montresor, Fabrice Not, Estefanía Paredes, Isabella Percopo, Erwan Péru, Julie Poulain, Kim Præbel, Fabienne Rigaut-Jalabert, Sarah Romac, Melanthia Stavroulaki, Jesús Souza Troncoso, Eric Thiébaut, Wilfried Thomas, Andrzej Tkacz, Anna Chiara Trano, Patrick Wincker, Nicolas Pade
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Pavloudi C, Santi I, Azua I, Baña Z, Bastianini M, Belser C, Bilbao J, Bitz-Thorsen J, Broudin C, Camusat M, Cancio I, Caray-Counil L, Casotti R, Castel J, Comtet T, Cox C, Daguin C, Díaz de Cerio O, Exter K, Fauvelot C, Frada MJ, Galand P, Garczarek L, González Fernández J, Guillou L, Hablützel PI, Heynderickx H, Houbin C, Kervella A, Krystallas A, Lagaisse R, Laroquette A, Lescure L, Lopes E, Loulakaki M, Louro B, Magalhaes C, Maidanou M, Margiotta F, Montresor M, Not F, Paredes E, Percopo I, Péru E, Poulain J, Præbel K, Rigaut-Jalabert F, Romac S, Stavroulaki M, Souza Troncoso J, Thiébaut E, Thomas W, Tkacz A, Trano AC, Wincker P, Pade N (2025) First release of the European marine omics biodiversity observation network (EMO BON) shotgun metagenomics data from water and sediment samples. Biodiversity Data Journal 13: e143585. https://doi.org/10.3897/BDJ.13.e143585
|
|
The European Marine Omics Biodiversity Observation Network (EMO BON) is an initiative of the European Marine Biological Resource Centre (EMBRC) to establish a persistent genomic observatory amongst designated European coastal marine sites, sharing the same protocols for sampling and data curation. Environmental samples are collected from the water column and, at some sites, soft sediments and hard substrates (Autonomous Reef Monitoring Structures - ARMS), together with a set of mandatory and discretionary metadata (including Essential Ocean Variables - EOVs). Samples are collected following standardised protocols at regular and specified intervals and sequenced in large six-monthly batches at a centralised sequencing facility. The use of standard operating procedures (SOPs) during data collection, library preparation and sequencing aims to provide uniformity amongst the data collected from the sites. Coupled with strict adherence to open and FAIR (Findable, Accessible, Interoperable, Reusable) data principles, this ensures maximum comparability amongst samples and enhances reusability and interoperability of the data with other data sources. The observatory network was launched in June 2021, when the first sampling campaign took place.
Here we report the first data release from the European Marine Omics Biodiversity Observation Network (EMO BON) (
This dataset includes raw DNA sequences obtained from shotgun metagenomics sequencing of water and sediment samples from 13 selected observatories across Europe and the Red Sea. The raw sequence data are released in the European Nucleotide Archive (ENA) (
This dataset contributes to the ongoing efforts of the Ocean Biodiversity Information System (OBIS), which aims at filling the gaps in our current knowledge on biodiversity of the world's oceans. Processed data will be published in OBIS and in the Global Biodiversity Information Facility (GBIF), using the DNA extension of the Darwin Core format. In addition, processed data, once available in OBIS and GBIF, will be incorporated in the European Marine Observation and Data Network (EMODnet) and the European Digital Twin of the Ocean (European DTO) initiatives.
Sample collection was conducted under the standard operating procedures of the EMO BON Handbook (
Sediment samples were collected and processed, based on all the three proposed protocols. Briefly, observatories NRMCB and RiaFormosa used the 'Soft Substrate Standard Operating Procedures 1 – SoSOP 1 (intertidal sediments)", while OOB and ROSKOGO used the "Soft substrate Standard Operating Procedures 2 – SoSOP 2 (coastal sediments by diving)" and BPNS used "Soft substrate Standard Operating Procedures 3 ‒ SoSOP 3 (coastal sediments by research vessel)". Regardless of the choice of protocol, the steps regarding collection of sediment for microbial community assessment include the use of sediment cores and the subsequent slicing of the top 5 cm layer. As for the water samples, four replicates are collected for the sediment sampling; DNA/RNA shield was added in two of the replicates, which were the ones shipped for sequencing.
The dataset's geographical range includes 14 locations (13 observatories) across eight ecoregions, based on the Marine Ecoregions of the World (MEOW), proposed by
DNA extraction was performed at Genoscope, which was the chosen centralised facility to minimise biases and follow the same standardised procedures. For DNA extraction of the water column filter samples, the same protocol as described by
Sequencing was also performed at Genoscope. Metagenome libraries were constructed according to the available DNA: 10 to 100 ng of genomic DNA were sonicated to obtain fragments of around 350 bp, using the Covaris E220 instrument (Covaris, Woburn, MA, USA). Fragments were repaired, 3'-adenylated and NEXTflex PCR freebarcodes adapters (Bioo Scientific, Austin, TX, USA) were added using the NEBNext® Ultra II DNALibrary prep kit for Illumina (New England Biolabs, Ipswich, MA, USA). Ligation products were purified by AMPure XP beads 0:8 volume (Beckmann Coulter, Brea, CA, USA). DNA fragments (> 200 bp) were amplified by PCR (2 PCR reactions, 14 cycles) using Illumina adapter-specific primers and NEBNext® Ultra II Q5 Master Mix (NEB). All libraries were subjected to size profile analysis conducted by Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) and to qPCR quantification using the KAPA Library Quantification Kit for Illumina Libraries (KapaBiosystems, Wilmington, MA, USA). All metagenomic libraries validated by the quality-control were sequenced using 151-bp pairwise read chemistry on an Illumina NovaSeq6000 sequencer, using S4 Flowcells (Illumina, San Diego, CA, USA). A minimum of 40,000 million useful paired-end reads were obtained per sample. Short Illumina reads were bioinformatically post-processed sensu
The target of the dataset was to assess prokaryotic and eukaryotic diversity associated with the collected samples.
Archaea, Bacteria, Eukaryota
Details for the samples can be found in Suppl. material
CC BY 4.0
This work used resources provided by the European Marine Omics Biodiversity Observation Network (EMO BON) project, coordinated by the European Marine Biological Resource Centre (EMBRC). For the NRMCB observatory, the captain and crew of RV Vettoria, Ferdinando Tramontano and Carmen Minucci and the NEREA Team (www.nerea-observatory.org) are acknowledged for support to sampling. For the BPNS observatory, captain and crew of RV Simon Stevin are acknowledged for operational support. For the PiEGetxo observatory, we also acknowledge TED2021-132109B-C21 research project funded by MCIN/AEI /10.13039/501100011033 and by the European Union NextGenerationEU/PRTR. For Station Biologique de Roscoff, the captain and crew of RV Neomysis, Stéphanie Cabioch, Noël Guidal, Arnaud Perrey and the Service Mer et Plongée Team are acknowledged for operational support to sampling. This work was supported by the Genoscope, the Commissariat à l'Énergie Atomique et aux Énergies Alternatives (CEA) and France Génomique (ANR-10-INBS-0009).