Biodiversity Data Journal :
Research Article
|
Corresponding author: Ed Baker (edwbaker@gmail.com)
Academic editor: Therese Catanach
Received: 04 Jun 2019 | Accepted: 29 Oct 2019 | Published: 30 Oct 2019
© 2019 Ed Baker, Sarah Vincent
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Baker E, Vincent S (2019) A deafening silence: a lack of data and reproducibility in published bioacoustics research? Biodiversity Data Journal 7: e36783. https://doi.org/10.3897/BDJ.7.e36783
|
A study of 100 papers from five journals that make use of bioacoustic recordings shows that only a minority (21%) deposit any of the recordings in a repository, supplementary materials section or a personal website. This lack of deposition hinders re-use of the raw data by other researchers, prevents the reproduction of a project's analyses and confirmation of its findings and impedes progress within the broader bioacoustics community. We make some recommendations for researchers interested in depositing their data.
bioacoustics, open data, reproducibility, sound libraries, acoustic vouchers
The importance of sharing the datasets used in biological research has been discussed recently by a number of authors, for example, in ecology (e.g.
The datasets used in bioacoustic research vary in scale from a single short recording to continuous recordings at a site over several years. These recordings may be used to identify (e.g.
The twenty most recently published articles (as of the end of 2017) covering bioacoustic topics were selected from each of the journals Bioacoustics, ZooKeys, ZooTaxa, Journal of Animal Behaviour and Marine Mammal Science. Primary research articles were identified using the search terms "acoustics" and "bioacoustics" on the journal's website; only articles making use of recorded sounds were selected. The journals chosen cover a subject-specific journal (Bioacoustics), a modern semantically enhanced (
Both authors independently examined the papers for evidence that the underlying sound recordings were available in either physical or digital repositories. Two scoring systems were used to categorise the papers:
A CSV file of the papers analysed (including their DOIs) and scored values is available in Suppl. materials
Out of all the articles in the study, 79% did not deposit any sound recordings, 12% deposited a sample of the studied data and only 9% deposited the full acoustic dataset (Fig.
Where complete data were deposited, the majority used a repository (8%) or the supplementary materials of the journal article (7%). Only 2% used a personal website (Fig.
It can be seen from these results that only a minority (21%) of the published studies analysed deposit the sound recordings on which their results are based. In addition, even when authors claim deposition, there can still be problems. Issues identified in this study include not depositing recordings, broken URLs and providing the wrong URL for a repository (for a list see Suppl. material
This lack of deposition is potentially problematic for the reproducibility of research and also hinders the re-use of recordings by other researchers. Given widespread lack of deposition described in this paper, below we provide some recommendations which researchers may use to improve the accessibility of their bioacoustic data.
It may be argued for well-studied, easily identified taxa with relatively stable taxonomy, that there is no need to deposit recordings and that a well-documented methodology is sufficient to ensure reproducibility. The reasons why this does not extend to all taxa have been discussed previously in the context of voucher specimens for biodiversity and community ecology by
Dedicated bioacoustic repositories often have the advantage of integrating with other components of the biodiversity informatics landscape, for example using Darwin Core (
Many of the advances in large-scale ecoacoustics will rely on large datasets that are labelled suitably for machine learning algorithms. While these datasets are becoming available for well-studied groups of organisms (
More work needs to be done by the bioacoustics community to create an environment where the data underpinning research are made available, ideally using the FAIR Data Principles of being findable, accessible, interoperable and reusable. A discussion of the FAIR principles for scientific data is provided in
The recommendations below are based on issues we have identified in the research for this paper.
The scoring of the articles used in this study.
The scoring of articles used in this study.
Full metadata for the articles analysed.
Full metadata for the articles analysed.