Biodiversity Data Journal :
Data paper
|
A dataset for examining trends in publication of new Australian insects
Corresponding author:
Academic editor: Pavel Stoev
Received: 19 Jun 2014 | Accepted: 04 Jul 2014 | Published: 07 Jul 2014
© 2014 Robert Mesibov
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Mesibov R (2014) A dataset for examining trends in publication of new Australian insects. Biodiversity Data Journal 2: e1160. https://doi.org/10.3897/BDJ.2.e1160
|
Australian Faunal Directory data were used to create a new, publicly available dataset, nai50, which lists 18318 species and subspecies names for Australian insects described in the period 1961–2010, together with associated publishing data. The number of taxonomic publications introducing the new names varied little around a long-term average of 70 per year, with ca 420 new names published per year during the 30-year period 1981–2010. Within this stable pattern there were steady increases in multi-authored and 'Smith in Jones and Smith' names, and a decline in publication of names in entomology journals and books. For taxonomic works published in Australia, a publications peak around 1990 reflected increases in museum, scientific society and government agency publishing, but a subsequent decline is largely explained by a steep drop in the number of papers on insect taxonomy published by Australia's national science agency, CSIRO.
Taxonomy, Insecta, Australia
This paper examines trends in publication of new Australian insects over the past 50 years. It is based on 'nai50' (Suppl. material
The nai50 dataset is based on a snapshot of AFD insect data in early 2014, and will not be updated. It should therefore not be used as a substitute for the AFD as a source of taxonomic information on Australian insects. Another reason not to use nai50 as a taxonomic resource is that the AFD contains errors and formatting inconsistencies (see below). While I tried to correct as many of these as possible in nai50, I may have inadvertently introduced new errors.
Australia
6.3 MB plain-text, tab-separated table with 22 columns and 18319 rows (including header row)
Column label | Column description |
---|---|
AFD_higher_taxon | AFD names list from which I sourced the species or subspecies name |
Order | Insect order in which the species or subspecies is placed |
Genus | Genus name |
Subgenus | Subgenus name (if any) |
Species | Species name |
Subspecies | Subspecies name (if any) |
Name_type | Valid, synonym or replacement name |
Rank | Species or subspecies |
Orig_combination | Original combination? (Y or N) |
Author | Taxonomic author |
Year | Year of publication of name |
Full_author_name | Taxonomic author(s) for the species or subspecies name, e.g. 'Bellamy, C.L. & Williams, G.A.' |
No_auths | Number of taxonomic authors for the species or subspecies name |
A_in_B | yes/no, for separate taxonomic name authorship within a publication, e.g. 'Peterson, M. in Peterson, M. & Bellamy, C.L.' |
Pub_ID | a unique serial number for the publication in which the species or subspecies was described |
Citation | Full reference citation |
Pub_type | Article in Journal, Book, Chapter in Book or Section in Article |
Pub_country | The country in which the work describing the species or subspecies was published |
Pub_ento | yes/no, for whether the work describing the species or subspecies was published in an entomological journal or book |
J_title | Journal title from Citation field, or 'na' for Book and Chapter in Book |
A_publr_class | For works published in Australia, whether the publisher was a government agency, a museum, a scientific society or 'other', e.g. a private individual; 'na' for works not published in Australia |
A_publr | For works published in Australia, the name of the publisher, e.g. CSIRO; 'na' for works not published in Australia |
At the time of this study, AFD metadata noted that taxonomic information on Chrysomeloidea, Heteroptera, Orthorrhapha, Staphylinoidea and Tenebrionoidea was incomplete. Further, some of the 'complete' insect groups in the AFD were not up to date, although the currency of the lists was not always clear from the metadata. For example, the Aleyrodoidea metadata gives a compilation date of 31 December 2001, yet the names list for this taxon includes species from a 2012 publication.
Another limitation concerns introduced species, which I wanted to exclude from my compilation of new Australian insects. The AFD database can export a list of introduced insects through its 'Advanced Search' function (http://www.environment.gov.au/biodiversity/abrs/online-resources/fauna/afd/search/advanced), but membership of the list depends on compilers annotating a name as 'introduced', and I was advised by AFD staff that not all compilers have done so (C. Geromboux, in litt. 13 April 2014).
I chose the date range 1961–2010 to represent 'the last 50 years'. The slight backdating increased the chances that AFD names lists were up to date for the purposes of this study, and also allowed me to class the data by decades, i.e. 1960s, 1970s etc. (I follow the convention that decades begin with a year ending in '1', since there was no year '0' in the Western calendar.)
In early 2014, the AFD allowed for export of names lists for any higher taxa, e.g an insect order or superfamily, as comma-separated value (CSV) files. A download limit prevented export of a names list for all insects, so I chose the largest higher taxa for which export was possible. I downloaded most of the lists in March 2014, more in April 2014 while AFD staff addressed an access problem on the AFD Web portal, and the remainder in June 2014.
The CSV files were merged, converted to a tab-separated value (TSV) file and processed as described below. Processing was done on the command line using GNU/Linux utilities and the AWK programming language, or in Gnumeric spreadsheet software. The merged AFD names lists were filtered to:
Of the original fields in the AFD names lists, I retained GENUS, SUBGENUS, SPECIES, SUBSPECIES, NAME_TYPE, RANK, AUTHOR, YEAR, ORIG[INAL]_COMBINATION, PUB_PUB_TYPE and PUB_PUB_FORMATTED. Field contents were modified as follows:
In the following list of fields in nai50, the fields I added are marked with an asterisk and described in bracketed notes:
*AFD_higher_taxon (AFD names list from which I sourced the species or subspecies name)
*Order (insect order in which the species or subspecies is placed)
Genus
Subgenus
Species
Subspecies
Name_type
Rank
Orig_combination [Y or N]
Author
Year
*Full_author_name (taxonomic author(s) for the species or subspecies name, e.g. 'Bellamy, C.L. & Williams, G.A.')
*No_auths (number of taxonomic authors for the species or subspecies name)
*A_in_B (yes/no, for separate taxonomic name authorship within a publication, e.g. 'Peterson, M. in Peterson, M. & Bellamy, C.L.')
*Pub_ID (a unique serial number for the publication in which the species or subspecies was described)
Citation
Pub_type (Article in Journal, Book, Chapter in Book or Section in Article)
*Pub_country (the country in which the work describing the species or subspecies was published)
*Pub_ento (yes/no, for whether the work describing the species or subspecies was published in an entomological journal or book)
*J_title (journal title from Citation field, or 'na' for Book and Chapter in Book)
*A_publr_class (for works published in Australia, whether the publisher was a government agency, a museum, a scientific society or 'other', e.g. a private individual; 'na' for works not published in Australia)
*A_publr (for works published in Australia, the name of the publisher, e.g. CSIRO; 'na' for works not published in Australia)
I used programmatic checks on nai50 to find AFD omissions, errors and formatting inconsistencies, which were numerous. Omissions and errors were corrected with the aid of original publications and online resources compiled by taxon specialists. I am not a taxon specialist for any insect group, so the taxonomic errors I corrected were mainly those detected by programmatic checks, such as comparing the AFD fields YEAR and PUB_PUB_YEAR, and AUTHOR and PUB_PUB_AUTHOR for all names. For checking the link between name and citation, the BioNames project (http://bionames.org/) was particularly helpful. Bibliographic data on publications came from Worldcat (https://www.worldcat.org/) and the National Library of Australia (http://trove.nla.gov.au/).
Data summaries and working tables were generated from nai50 using AWK commands. For charting and consistency checks, working tables were imported into Gnumeric spreadsheet software. The statistics in the analysis are purely descriptive.
The dataset 'nai50' is a 6.3 MB plain-text, tab-separated table with 22 columns and 18319 rows (including header row). It contains the names of 17905 species (97.7% of all names) and 413 subspecies (2.3%) of Australian insects described in the period 1961–2010. The 18318 total may include some introduced insects and is likely to omit some recently described species and subspecies, as well as species and subspecies in data gaps in the AFD (see 'AFD limitations' above). However, I regard the total as large enough for the primary purpose of this study, which is to identify trends in publication.
The 18318 species and subspecies in nai50 were described in 3628 publications by ca 1460 taxonomic authors. The latter number is tentative because it is not always clear from a citation alone whether 'Smith, A.' is the same author as 'Smith, A.B.'. For the same reason, I did not identify individual taxonomists in the list of taxonomic authors. Note that the 1460 figure counts 'Smith, A. & Jones, B.' and 'Jones, B. & Smith, A.' as different taxonomic authors, but 'Smith, A.' and 'Smith, A. in Jones, B. & Smith, A.' as the same taxonomic author.
The 3628 taxonomic works were published in 51 countries, with Australia accounting for 1499 of the publications (41.3%) and 9796 of the species and subspecies names (53.5%). Publications in the USA, UK, Germany and New Zealand contributed more than half of the remaining taxonomic works and names (Fig.
Most new names were published in journal articles (16397; 89.5%), followed by books (1652; 9.0%), chapters in books (256; 1.4%) and sections of articles (13; <0.1%).
The AFD recorded 523 of the 17905 species names (2.9%) and 68 of the 413 subspecies names (16.5%) as synonymised. Synonymised names have been included in the trend analysis below.
The 18318 species and subspecies were in 26 orders, with Coleoptera, Diptera, Hymenoptera and Hemiptera contributing more than three-quarters of all names (Fig.
A single publication can contain names with different numbers of taxonomic authors, e.g. 'Smith in Jones & Smith, 1998' for one name and 'Jones & Smith, 1998' for another. The following trends count publications in a year, but nine of the 3628 publications have been double-counted for the reason just mentioned.
There was a marked and steady decline in the proportion of publications with single-authored names (Fig.
The number of publications introducing names with two taxonomic authors increased steadily over the 50-year sampling period, with an increase in three-author names starting in the 1990s (Fig.
Having more taxonomic authors, however, did not lead to correspondingly greater taxonomic productivity, i.e. more new names per publication, and single taxonomic authors consistently introduced the majority of new Australian species and subspecies (Table
Mean new names/publication by decade in nai50 with 1, 2 and 3 taxonomic authors (number of new names).
Decade |
1 author |
2 authors |
3 authors |
---|---|---|---|
1960s |
3.7 (2323) |
2.1 (126) |
1.5 (3) |
1970s |
5.1 (2898) |
2.9 (263) |
1.3 (8) |
1980s |
5.5 (3473) |
5.1 (844) |
1.2 (5) |
1990s |
5.9 (3316) |
4.7 (875) |
2.9 (76) |
2000s |
6.4 (2693) |
4.9 (1099) |
6.5 (300) |
All years |
5.2 (14703) |
4.4 (3207) |
4.7 (392) |
Another strong trend in authorship was a steady increase in the number of publications containing names with 'A in B' authorship, e.g. 'Smith in Jones & Smith, 1998' (Fig.
Following a strong increase at the beginning of the 1980s, the number of new species and subspecies varied around an average of ca 420 per year for the last 30 years of the sampling period, with an average of 430 in the 1980s, 427 in the 1990s and 410 in the 2000s (Fig.
The publication of new names in journals was fairly steady over the 50-year sampling period (Fig.
There was a small increase in the average number of new names per publication (Table
Mean new names/publication by decade in nai50 for Australian, non-Australian and all publications (number of publications).
Decade |
Australian |
non-Australian |
All |
---|---|---|---|
1960s |
5.8 (188) |
2.7 (499) |
3.6 (687) |
1970s |
6.9 (299) |
3.0 (371) |
4.7 (670) |
1980s |
6.2 (385) |
4.6 (419) |
5.4 (804) |
1990s |
6.9 (375) |
4.3 (394) |
5.6 (769) |
2000s |
6.6 (252) |
5.5 (446) |
5.9 (698) |
All years |
6.5 (1499) |
4.0 (2129) |
5.0 (3628) |
The proportion of publications containing only one new Australian insect name declined, but not dramatically, from ca 55% in the 1960s to ca 45% in the 2000s (Fig.
The number of publications containing new Australian insect names varied surprisingly little around a long-term average of ca 70 publications per year (Fig.
Number of publications and number of new names in Australia by publisher class and publisher in nai50, 1961–2010. Publisher abbreviations: AAD = Australian Antarctic Division, ABRS = Australian Biological Resources Study, CSIRO = Commonwealth Scientific and Industrial Research Organisation, QldGovt = Queensland state government, TasGovt = Tasmanian state government, AM= Australian Museum, MAGNT = Museums and Art Galleries of the Northern Territory, NMV = Museum Victoria, QM = Queensland Museum, QVMAG = Queen Victoria Museum and Art Gallery, SAM = South Australian Museum, WAM = Western Australian Museum, AustEntSoc = Australian Entomological Society, BOIC = Butterflies and Other Invertebrates Club, EntSocNSW = Entomological Society of New South Wales, EntSocQld = Entomological Society of Queensland, FieldNatsSA = Field Naturalists Society of South Australia, FieldNatsVic = Field Naturalists Club of Victoria, FieldNatsWA = Western Australian Naturalists Club, RoySocQld = Royal Society of Queensland, RoySocSA = Royal Society of Soyj Australia, RoySocTas = Royal Society of Tasmania, RoySocVic = Royal Society of Victoria, RoySocWA = Royal Society of Western Australia, RoyZooSocNSW = Royal Zoological Society of New South Wales, UniQld = University of Queensland.
Publisher class |
Publisher |
No. of publications |
No. of new names |
Publication |
---|---|---|---|---|
agency |
AAD |
1 |
6 |
Australian New Zealand Antarctic Research Expedition Reports |
agency |
ABRS |
1 |
9 |
book |
agency |
CSIRO |
5 |
18 |
Australian Journal of Marine and Freshwater Research |
agency |
CSIRO |
127 |
1019 |
Australian Journal of Zoology |
agency |
CSIRO |
87 |
1725 |
Australian Journal of Zoology Supplementary Series |
agency |
CSIRO |
21 |
265 |
Invertebrate Systematics |
agency |
CSIRO |
172 |
1679 |
Invertebrate Taxonomy |
agency |
CSIRO |
24 |
660 |
books |
agency |
QldGovt |
5 |
18 |
Queensland Journal of Agricultural and Animal Sciences |
agency |
TasGovt |
1 |
1 |
book |
museum |
AM |
2 |
162 |
Memoirs of the Australian Museum |
museum |
AM |
42 |
273 |
Records of the Australian Museum |
museum |
AM |
1 |
208 |
Records of the Australian Museum, Supplement |
museum |
MAGNT |
1 |
8 |
Occasional Papers of the Northern Territory Museum of Arts and Sciences |
museum |
MAGNT |
14 |
46 |
The Beagle, Records of the Museums and Art Galleries of the Northern Territory |
museum |
NMV |
27 |
284 |
Memoirs of Museum Victoria |
museum |
NMV |
19 |
144 |
Memoirs of the National Museum of Victoria |
museum |
QM |
70 |
444 |
Memoirs of the Queensland Museum |
museum |
QVMAG |
2 |
2 |
Records of the Queen Victoria Museum |
museum |
SAM |
51 |
252 |
Records of the South Australian Museum |
museum |
SAM |
2 |
119 |
Records of the South Australian Museum, Monograph Series |
museum |
WAM |
36 |
122 |
Records of the Western Australian Museum |
museum |
WAM |
2 |
8 |
Records of the Western Australian Museum, Supplement |
museum |
WAM |
1 |
84 |
Special Publications of the Western Australian Museum |
society |
AustEntSoc |
122 |
285 |
Australian Journal of Entomology |
society |
AustEntSoc |
349 |
791 |
Journal of the Australian Entomological Society |
society |
BOIC |
1 |
1 |
Butterfly & Other Invertebrates Club Inc. Newsletter |
society |
EntSocNSW |
18 |
31 |
General and Applied Entomology |
society |
EntSocNSW |
2 |
17 |
Journal of the Entomological Society of Australia |
society |
EntSocQld |
47 |
69 |
Australian Entomological Magazine |
society |
EntSocQld |
51 |
68 |
Australian Entomologist |
society |
EntSocQld |
27 |
57 |
Journal of the Entomological Society of Queensland |
society |
FieldNatsSA |
2 |
2 |
South Australian Naturalist |
society |
FieldNatsVic |
5 |
9 |
Victorian Naturalist |
society |
FieldNatsWA |
1 |
1 |
Western Australian Naturalist |
society |
LinnSocNSW |
60 |
187 |
Proceedings of the Linnean Society of New South Wales |
society |
RoySocQld |
8 |
35 |
Proceedings of the Royal Society of Queensland |
society |
RoySocSA |
60 |
493 |
Transactions of the Royal Society of South Australia |
society |
RoySocTas |
10 |
33 |
Papers and Proceedings of the Royal Society of Tasmania |
society |
RoySocVic |
5 |
13 |
Proceedings of the Royal Society of Victoria |
society |
RoySocWA |
3 |
10 |
Journal of the Royal Society of Western Australia |
society |
RoyZooSocNSW |
3 |
78 |
The Australian Zoologist |
other |
private |
2 |
2 |
Calodema |
other |
private |
5 |
51 |
books |
other |
UniQld |
3 |
6 |
University of Queensland Papers, Department of Entomology |
other |
UniQld |
1 |
1 |
University of Queensland Papers, Department of Zoology |
Five-year moving averages for the three main Australian publisher classes (Fig.
Tracking publications (Fig.
The nai50 dataset provides an objective basis for identifying long-term research and publishing trends in Australian entomology, and is readily extendable. Since the dataset is now in the public domain, interested users are welcome to keep it up to date, extend it backwards in time and add new fields, such as author age and affiliation at time of publication.
Users are also welcome to search for and correct errors, which are undoubtedly still present in nai50. For every hour exploring and analysing nai50, I spent several hours detecting, investigating and correcting omissions, errors and formatting inconsistencies in AFD data. AFD data validation, both at the time of data entry by specialist compilers and later by AFD staff, could usefully be extended and improved. In correspondence with the author, AFD staff have said they are aware of the data cleaning issues in AFD and hope to address them more effectively when additional resources are made available to the project.
For help with AFD data I am very grateful to Alice Wells and Christy Geromboux of the Australian Biological Resources Study, Canberra. This study was funded by the author and was inspired by the occasion of the 50th anniversary conference of the Australian Entomological Society in 2014.
6.3 MB plain-text, tab-separated table with 22 columns and 18319 rows (including header row).
Numbers of publications and new names from the nai50 dataset, by country.
Numbers of new insect names in nai50 dataset, by order.
Percentage of publications by year in nai50 with single-author names.
Number of publications by year in nai50 with two-author (squares) and three-author (triangles) names.
Number of publications by year in nai50 with 'A in B' names.
Number of new species and subspecies names by year in nai50, with 5-year moving average.
Number of new species and subspecies names by year in nai50 published in books and journal articles.
Percentage of publications by year in nai50 in entomology books and journals.
Percentage of publications by year in nai50 containing only one new species or subspecies name.
Number of publications by year in nai50, with 5-year moving average.
Percentage of Australian publications by year in nai50.
Five-year moving average of Australian publications in nai50 by year and publisher class.
Number of publications and new names by year in nai50 in Australian Journal of Zoology, Invertebrate Taxonomy, Invertebrate Systematics and Zootaxa.