Genome sequencing of Rhinorhipus Lawrence exposes an early branch of the Coleoptera
© The Author(s). 2018
Received: 2 February 2018
Accepted: 2 April 2018
Published: 2 May 2018
Rhinorhipidae Lawrence, 1988 is an enigmatic beetle family represented by a single species, Rhinorhipus tamborinensis Lawrence, 1988, from Australia, with poorly established affinities near the superfamily Elateroidea (click beetles, soldier beetles and fireflies) or the more inclusive series (infraorder) Elateriformia. Its evolutionary position may inform the basal relationships of the suborder Polyphaga, the largest clade of Coleoptera.
We analyzed four densely sampled DNA datasets of major coleopteran lineages for mitogenomes, rRNA genes and single copy nuclear genes. Additionally, genome sequencing was used for incorporation of R. tamborinensis into a set of 4220 orthologs for 24 terminals representing 12 polyphagan superfamilies. Topologies differed to various degrees, but all consistently refute the proposed placement of Rhinorhipidae in Elateroidea and instead indicate either sister relationships with other Elateriformia, frequently together with Nosodendridae, another divergent small family hitherto placed in Derodontoidea, or in an isolated position among the deepest lineages of Polyphaga. The phylogenomic analyses recovered Rhinorhipus in a sister position to all other Elateriformia composed of five superfamilies. Therefore, we erect the new superfamily Rhinorhipoidea Lawrence, 1988, stat. Nov., with the type-family Rhinorhipidae. The origins of the Rhinorhipidae were dated to the Upper Triassic/Lower Jurassic at the very early phase of polyphagan diversification.
Thus, Rhinorhipidae adds another example to several recently recognized ancient relict lineages which are interspersed within contemporaneous hugely species-rich lineages of Coleoptera.
Rhinorhipidae is among only eleven out of nearly 200 beetle families for which molecular data have been unavailable in previous studies [1–4]. Rhinorhipus has been found in several localities in Queensland in the 1970′s, even in high numbers in some places, but since then the species has not been seen again despite intensive search efforts (G. Monteith and H. Escalona, pers. comm.). Recently, our expedition to Queensland yielded a fresh specimen suitable for DNA isolation and integration with existing molecular phylogenetic analyses. Extensive DNA datasets have been produced in the last decade for the phylogenetic analysis of the Coleoptera [2–4, 16, 17] and Elateriformia [18–20], and we here incorporate Rhinorhipus sequences into rRNA, mitogenomic and nuclear protein coding genes (PCGs) datasets. Additionally, we obtained shotgun genome data of Rhinorhipus tamborinensis for a phylogenomic analysis using several thousand orthologs available for a growing number of coleopteran lineages [21–29].
The classification of polyphagan beetles employs a hierarchy of series (infraorders) and superfamilies, into which most families can be firmly placed [12–15]. The taxonomic limits of these higher-level groups are increasingly well defined by large-scale molecular studies [1–4]. However, the backbone of the phylogeny specifying the relationships among these lineages emerging in various morphological and molecular analyses still differs in substantive ways, while support is generally low and the presented trees in some cases were constrained to illustrate the most meaningful relationships [1–4, 13, 17]. Equally there is disagreement about the dating of the Coleoptera tree, due to uncertainty about the placement of age-calibrated fossils [2–4, 30]. Resolution of basal relationships in Coleoptera will be improved with the greater completeness of taxon sampling of deep lineages, but this is hampered if DNA-grade specimens are difficult to obtain, which particularly affects the inclusion of species-poor, rare relict lineages. The resulting poor taxon sampling may also be responsible for discrepancies with morphological studies that are less constrained by the availability of specimens . Placement of these lineages is particularly important for studies of diversification which frequently are based on comparisons of species numbers between sister lineages or rate shifts within a tree [31, 32]. Therefore, the aim of this study is to determine the phylogenetic position and taxonomic status of the enigmatic Rhinorhipus, as part of the effort to establish the deep divergences of Coleoptera and an increasingly complete sampling of the major lineages constituting the earliest branches. Additionally, by dating the origin of this unique lineage relative to other lineages representing the early beetle evolution, we obtain a clearer picture of the arrangement of species rich and poor lineages making up the great diversity of Coleoptera.
Further datasets covered all major beetle lineages to investigate the relationships of Rhinorhipus and the polyphagan series. Using the mitogenome 15-gene 83-taxa dataset of broad representation of Coleoptera, the ML analyses of deep relationships within Polyphaga recovered Rhinorhipus as a sister to Nosodendron (BS 95%) and both of them combined as a sister to the superfamily Staphylinoidea (BS 89%, Fig. 2b, Additional file 1: Figure S2) or to a grade of paraphyletic Staphyliniformia of the arrangement (Staphylinoidea (Hydrophiloidea (Scarabaeoidea (Elateriformia)))). The poor recovery of the Staphyliniformia is typical for mitogenome data . The analysis was included to show that the mitogenomic phylogeny does not support the placement of Rhinorhipidae in Elateroidea.
Deep relationships within Polyphaga according to the eight-gene 139-taxa nuclear dataset  recovered Rhinorhipus as a sister to Nosodendron in the ML analysis (BS 99%), but here the clade was in the sister position to Elateriformia (BS 92%; Fig. 2c). The BI analysis recovered Rhinorhipus as a separate deeply branching lineage, as sister to Polyphaga minus Scirtiformia and Derodontiformia (i.e., the core Polyphaga) with high support (BS 100%), although the core Polyphaga clade was supported only with BS 53% (Additional file 1: Figure S4). Nosodendron was recovered as sister to Bostrichiformia + Cucujiformia (BS 53%, Additional file 1: Figure S4).
When included in the 66-gene 376-taxa dataset , both ML analyses of the amino acids (using iQ-TREE and RAxML) recovered Rhinorhipus as an independent deeply rooted lineage in the sister position to core Polyphaga (Fig. 2d, BS 100%) and Nosodendron was recovered as a sister to Bostrichiformia + Cucujiformia (Additional file 1: Figure S5), in agreement with the position obtained by Zhang et al. . Rhinorhipus was recovered as a sister to Nosodendron in the ML analysis of the nucleotide dataset using RAxML (BS 100%) and they combined were the sister to Elateriformia (BS 94%; Additional file 1: Figure S7). High BS values were recovered for alternative relationships in the 66-gene analyses (Additional file 1: Figures S5–S7).
Dating the tree
The overview of current phylogenetic analyses (PCG – nuclear protein coding genes; AA – amino acid dataset; nucl. – nucleotide dataset; ML – maximum likelihood; BI – Bayesian Inference; iQ – iQ-TREE; StaphF, ScarF, ElatF – the series Staphyliniformia, Scarabaeiformia, Elateriformia; core Polyphaga – all polyphagan series except Scirtiformia and Derodontidae)
# of taxa
# of genes
(Rhin. + Nosod.)(StaphF(ScarF(ElatF)))
((Rhin. + Nosod.)StaphF)(ScarF(ElatF))
(Rhinorhipus + Nosodendron)(ElatF)
Rhinorhipus + Nosodendron(ElatF)
PCG AA (#1)
ML (iQ, no filtering)
PCG nucl. (#2)
ML (iQ,1st + 2nd)
PCG AA (#3)
ML (iQ, Aliscore)
PCG AA (#4)
ML (iQ, Aliscore)
The relationships with Nosodendron needs further data to be robustly supported. Nosodendron was unavailable for transcriptomic analysis and only some analyses indicate its sister relationships with Rhinorhipus (Fig. 2b, c but not Fig. 2a–d). Nosodendridae was placed in Bostrichiformia  or Derodontiformia  and its relationships to Elateriformia has never been inferred from morphology. Even if further analyses support the relationships of these taxa, their morphological disparity and inferred ancient origins fully support the superfamily rank for each of these unique lineages [14, 15]. Lawrence  listed the following characters of Rhinorhipus which falsify the relationships with Bostrichoidea including Nosodendron: procoxae conical with fully exposed but immovable trochantins; prosternal process fitting into cavity on mesosternum; metendosternite with well developed ventro-lateral processes; wing with elongate radial cell and serricorn folding type; first three ventrites connate; Malphigian tubules free (cited from ). The relationships of Rhinorhipus and Sandalus as a representative of Dascilloidea which are another ancient lineage of Elateriformia (Fig. 3c, d and Fig. 4) is supported by: the relatively long occipital region, distinctly raised antennal sockets, strongly and abruptly declined fronto-clypeal region without a sharp carina, long mandible, strongly projecting mesocoxae, setose metatrochantin, long anterior process of metendosternite, well developed empodium with three or more setae. However, many of these characters occur commonly in numerous distantly related beetle lineages and cannot be considered as synapomorphies of Sandalus and Rhinorhipus.
The morphological distinctiveness of Rhinorhipus, respectively Rhinorhipoidea, is demonstrated by the combination of the following characters: the head is hypognathous, with long temporal regions, without transverse occipital ridge or epicranial suture; the cranium has a short, median occipital endocarina and raised antennal insertions; the fronto-clypeal region is strongly declined and does not have a fronto-clypeal suture; the clypeus is long and narrow; the corporotentorium is very broad; the oral cavity is blocked by hairs; labrum is membranous, highly reduced, mandibles are long and have a setose dorsal cavity at the base; maxillae are highly reduced, membranous, and setose. The prothorax has a pronotum without lateral carinae, is anteriorly constricted and has a pair of elongate, vertical cavities; procoxae are a slender and conical, trochantins are completely visible, the promeso-thoracic interlocking mechanism is weakly developed. The mesothorax has a moderately developed mesosternal cavity reaching to the middle of the sternum and a pair of well developed procoxal housings on the mesepisterna; metasternum has only a moderately short median suture. The metendosternite has very long, curved lateral arms, an anterior process with a foramen at its base and a pair of expanded, ear-like, ventro-lateral processes. Each elytron has 12 more or less complete rows of deep punctures. The legs have the enlarged and mesally produced metatrochanters, apically expanded hind tibiae, the simple tarsal segments, without any pads, brushes, or membranous lobes, the well developed empodium with two or three setae, and the pectinate tarsal claws. Rhinorhipus has six Malphigian tubes. The detailed information on morphological characters supporting further contradicting morphology-based relationships is given in the Additional file 1: Text.
Multi-gene rRNA and mtDNA phylogenies have become dominant in Coleoptera phylogenetics in the last two decades [e.g., 1, 2, 19, 20], and only the recent datasets of McKenna et al.  and Zhang et al.  analyzed multiple protein coding nuclear markers. All of these datasets, including the most recent of nearly 100 genes, produced topologies sensitive to the data treatment and choice of phylogenetic algorithm, which affected especially the placement of ancient lineages represented by species poor orphan lineages such as Nosodendron and Rhinorhipus (Table 1, Figs. 2 and 5, Additional file 1: Figures S1–S11) [1–4]. These data also run the risk of using non-homologous regions; e.g. at least a quarter of all genes used by Zhang et al.  which originally consisted of 95 loci were affected by apparent paralogy as judged against the existing reference genomes of Coleoptera (see Additional file 1: Table S5) for excluded loci, and even the loci in McKenna et al.  are affected by this problem, which required to make choices about which locus to select as an ortholog. In contrast, the use of shotgun Illumina sequencing produced several thousand one-to-one orthologs that can be recovered consistently from all available Coleoptera genomes and transcriptomes. Shotgun sequencing produced a nearly complete set of these orthologs for Rhinorhipus at an approximately 60× coverage. Bioinformatics pipelines [36, 37] that assemble these short reads against reference sequences can readily create thousands of gene sequences from genomic DNA, without the need for isolating mRNA, which is usually more difficult to obtain. Thus, greater taxon sampling is possible with relatively little effort and we can expect a stabilization of the remaining critical relationships among basal branches of the Coleoptera once more data area available.
The monotypic Rhinorhipus now identified as sister lineage to Elateriformia, or possibly even to the core Polyphaga, represents another example of a species-poor relict beetle lineage, such as Jurodidae and Crowsoniellidae in Archostemata, Aspidytidae and Meruidae in Adephaga, and Decliniidae in Polyphaga. It is perplexing why these lineages persist among others that are extremely species rich. Given its origin at the time where the continents were interconnected, and the habitat and presumed life style of Rhinorhipus, there is no obvious ecological or morphological trait that diminishes their chance of dispersal, diversification and species survival. Nevertheless, Rhinorhipus has a limited distributional range in Australia which harbors many relict lineages and is apparently rare in nature.
With the long-standing mystery about the great variation in species richness of Coleoptera in mind , these lineages are important for establishing the basal relationships and reconstruction of the dominant ecological role of earliest lineages in the Late Permian and Triassic ecosystems. Already, the realization of the ‘ancestral five’ in the earliest full tree of Coleoptera and the recognition of the core Polyphaga  had a major impact on the understanding of beetle relationships and dating. Placing the remaining small groups, including the enigmatic Rhinorhipidae, will be of equal importance for accurate evolutionary relationships and ecological characterization of the oldest beetle lineages. The growing phylogenetic evidence repeatedly indicates the presence of a much higher number of independent ancient beetle lineages than expected just two decades ago [2–4]. These families frequently live in soil, where they depend on organic moist detritus and the presence of molds and fungi and importantly, all these lineages apparently missed opportunities exploited by other lineages, such as the diversification of the angiosperms. They survived harsh conditions following the Permian extinction event , but were not able to diversify at the same pace as competing lineages in the changing world of the late Jurassic and Cretaceous. Thus, Rhinorhipidae stands out as a unique Australian witness of the early phase of beetle diversification in the early Mesozoic when the first extant beetles families evolved.
Material, DNA extraction, and sequencing
A single adult specimens of R. tamborinensis was collected in the Main Range National Park, Goomburra Section (27°59.0’S 152°21.4′E) on Nov. 28., 2014. DNA was extracted using the DNeasy kit (Qiagene Inc.). The dsDNA concentration was measured using a Qubit 2.0 Fluorometer (Life Technologies Corp., Carlsbad, CA). The voucher specimen was deposited in the collection of Department of Zoology, Palacky University, Olomouc.
The SSU and LSU rRNA and cox1 and rrnL mtDNA fragments were obtained using the Sanger method and procedures reported earlier  (GenBank accession numbers for rRNA fragments AB123456–78). The complete mitochondrial genome was sequenced using the mitochondrial metagenomics approach [40, 41]. The extracted raw DNA was pooled in equimolar concentration with DNA from 20 other Coleoptera from distant lineages not connected to the current study. A TruSeq DNA library was constructed with the pooled DNA and sequenced using the Illumina MiSeq platform (Illumina Inc., San Diego, CA) (2 × 300 bp paired-end sequencing; TruSeq library with 800–950 bp insert size) in 45% of an Illumina flow cell. The Illumina output was processed and assembled in three independent assemblers, followed by super-assembly and circularization in Geneious 7.1.9 (Biomatters Ltd., Auckland, New Zealand) as described in . The mitogenome of R. tamborinensis was identified by a Blast match to a cox1 sequence obtained from the same specimen by PCR-Sanger sequencing. The mitochondrial genome was annotated using gene predictions with MITOS  and manually refined in Geneious (GenBank Accession Number KT825140).
Total genomic DNA of R. tamborinensis was shotgun sequenced with Illumina X Ten platform (Illumina Inc., San Diego, CA) for 2 × 150 bp paired-end reads. The sequencing service was provided by Novogene Co., Ltd. (Beijing, China). Raw paired-end reads were filtered using the Novogene pipeline. The filtering steps included the removal of read pairs if either one read contains adapter contamination; if more than 10% of bases are uncertain in either one read; or if the proportion of low quality bases is over 50% in either one read. The quality of reads was visualized with FastQC. The genomic data were deposited in GenBank (Accession Number PRJNA448980).
Construction of data matrices
Various sequences for R. tamborinensis were incorporated in existing datasets. In all cases, protein coding genes (PCGs) were individually aligned using the “Translation Align” option with the FFT-NS-i-× 2 algorithm of MAFFT 7.2 , and ribosomal genes rrnS and rrnL were aligned using the Q-INS-I algorithm in MAFFT 7.2 . The GBlocks masking method for alignment variable regions was not applied, unlike in some of the original data sets. The following matrices were produced:
(1) A four-gene dataset of Elateriformia composed of nuclear complete SSU and partial LSU rRNA and mitochondrial cox1 and rrnL genes was assembled from earlier published data [9, 18–20] (Additional file 1: Table S1). The dataset contained 564 terminals and 4966 homologous positions.
(2) Mitochondrial genomes were retrieved from GenBank to represent all principal beetle lineages (Additional file 1: Table S2). The 15 mitochondrial genes were extracted using Geneious 8.0.5. Individual gene alignments were trimmed and concatenated for a final dataset of 82 taxa (including R. tamborinensis) and 15 genes with 12,940 homologous positions.
(3) An eight-gene nuclear dataset (SSU rRNA, LSU rRNA, wingless, alpha-spectrin; arginine kinase, phosphoenolpyruvate carboxykinase, carbamoyl-phosphate synthase domain, elongation factor-1α) was assembled for 139 taxa selected from a set of 367 taxa reported by McKenna et al. . The homologous reads for R. tamborinensis were extracted from shotgun reads using the HybPiper 1.2 pipeline  and mapped to reference genes in Geneious 7.1.9. The dataset contained 52 taxa from the series Elateriformia and extensive representation of all other polyphagan series and beetle suborders (Additional file 1: Table S3). Intron regions were manually removed and the exons were concatenated for each gene alignment. All sequences were trimmed to contain only complete codons triplets and alignments were concatenated using FASconCAT-G 1.02 (https://www.zfmk.de/en/research/research-centres-and-groups/fasconcat) to produce a super-matrix consisting of 10,999 aligned positions. The partition scheme and selected models are reported in Additional file 1: Table S4.
(4) A 66-gene 376-taxa matrix was assembled from data of Zhang et al.  (Additional file 1: Table S4). The homologous genes for R. tamborinensis were extracted from shotgun genome sequencing as described above. The final dataset excluded 28 genes of Zhang et al.  which were found to contain up to 21 copies in at least some beetle genomes and thus could not homologized confidently (Additional file 1: Table S5). Exons were extracted and concatenated as described above to produce a supermatrix of 56,340 aligned positions.
Transcriptomes (available on Oct. 6, 2017, Additional file 1: Table S4) were downloaded from the NCBI Transcriptome Shotgun Assembly (TSA) database. Additionally, transcripts of Rhagophthalmus sp., Chauliognathus flavipes and Phrixothrix hirtus were downloaded from the SRA archive and assembled using SOAPdenovo-Trans-31mer 1.04 . Lantern and body transcripts of P. hirtus were merged into a single dataset (Additional file 1: Table S4) . Raw data were first examined in FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc) to screen them for irregularities. Removal of low-quality reads and TruSeq adaptor sequences (Illumina Inc., San Diego, CA) was performed with Trimmomatic-0.36 .
The ortholog set was obtained by searching the OrthoDB 9.1 database  for one-to-one orthologs among Coleoptera in available genome sequences of Agrilus planipennis, Anoplophora glabripennis, Dendroctonus ponderosae, Leptinotarsa decemlineata, Onthophagus taurus, and Tribolium castaneum. OrthoDB 9.1 specified 4225 protein coding single copy genes for the above species and the Coleoptera reference node. We used Orthograph 0.6.1  to search the above transcriptomes for the corresponding sequences. Default settings were used except for extend-orf = 1 and orf-overlap-minimum = 0.5. We searched whole genome data of Rhinorhipus with HybPiper 1.2 pipeline  for each of the target orthologs using blastx of the Blast 2.6.0+ software package  with --evalue 1e-5 to search. Reads were then separated into directories by gene and assembled with SPAdes 3.10.1 . The resulting contigs were processed with Exonerate 2.2.0  with the protein2genome model using sequences from reference species with the best cumulative blast score to recover the exon sequences. We then merged the acquired exons of R. tamborinensis at both amino acid and nucleotide level with the corresponding ortholog groups. We removed terminal stop codons and masked internal stop codons at the translational level and nucleotide levels using the perl script summarize_orthograph_results.pl .
Alignments from each orthology group were checked for the presence of outlier sequences using BLOSUM62 distance scores  and following the methods reported by Misof et al. . We used Pal2Nal  to generate multiple sequence alignments of nucleotides corresponding to amino acids and Aliscore 2.0 [57, 58] to identify random similarity within alignments which were removed using Alicut 2.3 (https://github.com/mptrsen/scripts/blob/master/ALICUT_V2.3.pl).
The matrix of all 4220 orthologs and 9,994,362 homologous positions was assembled and derived datasets were used for tree construction. Supermatrix 1 – at amino acid level without any alignment masking (2,308,506 positions); Supermatrix 2 - at nucleotide level, 1st + 2nd codon positions only, outliers filtered out (6,636,362 positions); Supermatrix 3 – at amino acid level after masking with Aliscore (2,100,404 positions); and Supermatrix 4 – using a subset of Supermatrix 3 containing only ortholog alignments with representation of all taxa and after the alignment masking with Aliscore (943 orthologs, 378,949 positions). AliStat 1.3 (https://github.com/thomaskf/AliStat) was used to generate distributions of missing data in the supermatrices.
IQ-TREE 1.5.5  and RaxML  were used to calculate maximum likelihood (ML) trees using the IQ-TREE web server , with partitions identified by the ModelFinder tool of IQ-TREE using the Bayesian Information Criterion [62, 63]. The partitions, models and parameters are listed in Additional file 1: Tables S10–S14. The ultrafast bootstrap option was used with 1000 bootstrap iterations . The iQ analyses were run with the -spp parameter allowing each partition to have its own evolutionary rate. Bayesian inference (BI) was conducted with PhyloBayes  on the CIPRES web portal  using two independent chains under a GTRCAT model. Analyses were run checking for convergence every 1800 s excluding the first 500 cycles, and the runs were stopped when the maxdiff value was lower than 0.1. A consensus tree was obtained discarding the first 10,000 cycles as a burn-in fraction and taking 1 tree every 10 cycles for the remaining (bpcomp options -× 10,000 10 –c 0).
Gene tree incongruence was tested for Supermatrices 3 (all gene trees) and 4 (gene trees with representation of all taxa) by visualizations of the dominant bipartitions among individual loci based on the individual IQ-TREE ML gene topologies by constructing supernetworks using the SuperQ method implemented in Spectre selecting the ‘balanced’ edge-weight with ‘JOptimizer’ optimization function, and applying no filter [67, 68]. This methodology decomposes all gene trees into quartets to build supernetworks where edge lengths correspond to quartet frequencies. Resulting supernetworks were visualized in SplitsTree 4.14.6 .
Bayesian dating analyses
Dating analyses were calibrated using fossils relevant to the origin of Dytiscoidea and Geadephaga, Elateriformia and Scarabaeiformia [30, 70, 71]. The earliest fossils of Elateroidea date back to the Hettangian and Sinemurian deposits (190.8–201 mya; Elateridae, Elaterophanes; ); the fossil of Scarabaeiformia to the Jurassic Formation of Switzerland (196.5–201.6 mya; Aphodiites ; and the fossil of Dytiscoidea to the Hassberge formation in Germany (221.5 mya; Protonectes germanicus . We applied a truncated normal distribution allowing a soft tail to the past as recommended for fossil calibration  with a minimum age hard bound at 190.8 mya and a 95% range of 192.4–239.8 mya (mean = 190.8 mya stdev = 25.0 mya) as a prior for the node representing the split of Elateroidea from other Elateriformia; a minimum age hard bound at 196 mya and a 95% range of 197.6–245 mya (mean = 196 mya stdev = 25.0 mya) for the split between Scarabaeoidea and its sister taxon; and a minimum age hard bound at 221.5 mya and a 95% range of 223.1–270.5 mya (mean = 221.5 mya stdev = 25.0 mya) for the origin of Adephaga.
Divergence times at nodes were estimated with BEAST 1.8.4  on the fixed topologies from ML and BI analyses, applying the best-fit substitution model and partition scheme as estimated in PartitionFinder 2 . For the molecular clock settings, the dataset was partitioned in 3 partitions (PCGs, rrnL and rrnS for mitochondrial genomes; rRNA genes, 1st + 2nd and 3rd codon positions of PCGs for the 8-gene dataset) applying an uncorrelated lognormal clock to each partition and a Yule speciation prior. Analyses were run twice in parallel with 50 million generations sampling one tree every 5000 generations. Consensus trees were estimated with TREEANNOTATOR  combining both runs and discarding the 50% initial trees as burn-in after checking the ESS of the tree likelihood and ensuring that values had reached a plateau in TRACER 1.6 (http://beast.bio.ed.ac.uk/Tracer). The inferred diversification events were verified against the set of 19 fossils listed by Toussaint et al. . No prior was set for the root age.
The authors are obliged to G. Monteith and C. Lambkin (Queensland Museum, Brisbane), S. L. Cameron (Queensland University of Technology, Brisbane), A. Keto (Australia Rainforest Conservation Society, Springbrook), A. Slipinski and H. Escalona (CSIRO, Canberra) for their support during our trip to Queensland and to R. Bilkova for technical assistance. The Department of Environment and Heritage Protection, Queensland Government and the Wildlife Trade and Biosecurity Branch of the Department of Environment, Australia are acknowledged for export and research permits.
The study was funded by the IGA and GACR projects (PrF2018, 18-14942S to L.B., D.K., M.Mo. and Ma.B.), Leverhulme Trust (F/00696/P to A.P.V. and L.B.) and the NHM Biodiversity Initiative.
Availability of data and materials
The DNA sequences reported in this article can be accessed in GenBank under accessions AB123456–9, KT82514 and AB123456.
LB, APV and MB participated in the study design, DK, MMo, CA, Ma.B and FK carried out data analyses, MMa, KS and LB collected material in the field, all authors contributed to the draft of the manuscript and discussed results. All authors gave final approval for publication.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Bocak L, Barton C, Crampton-Platt A, Chesters D, Ahrens D, Vogler AP. Building the Coleoptera tree-of-life for >8000 species: composition of public DNA data and fit with Linnaean classification. Syst Entomol. 2014;39:97–110. https://doi.org/10.1111/syen.12037.View ArticleGoogle Scholar
- Hunt T, Bergsten J, Levkanicova Z, Papadopoulou A, John OS, Wild R, et al. A comprehensive phylogeny of beetles reveals the evolutionary origins of a superradiation. Science. 2007;318:1913–6.View ArticlePubMedGoogle Scholar
- McKenna DD, Wild AL, Kanda K, Bellamy CL, Beutel RG, Caterino MS, et al. The beetle tree of life reveals that Coleoptera survived end-Permian mass extinction to diversify during the cretaceous terrestrial revolution. Syst Entomol. 2015;40:835–80. https://doi.org/10.1111/syen.12132.View ArticleGoogle Scholar
- Zhang SQ, Che LH, Li Y, Dan L, Pang H, Ślipiński A, Zhang P. Evolutionary history of Coleoptera revealed by extensive sampling of genes and species. Nat Commun. 2018;205:2041–1723. https://doi.org/10.1038/s41467-017-02644-4.Google Scholar
- Nikitsky NB, Lawrence JF, Kirejtshuk AG, Gratshev VGA. New beetle family, Decliniidae fam. N., from the Russian far east and its taxonomic relationships (Coleoptera, Polyphaga). Russ Entomol J. 1993;2:3–10.Google Scholar
- Lawrence JF, Nikitsky NB, Kirejtshuk AB. Phylogenetic position of Decliniidae (Coleoptera: Scirtoidea) and comments on the classification of Elateriformia (sensu lato). In: Pakaluk J, Ślipiński SA, editors. Biology, phylogeny, and classification of Coleoptera: papers celebrating the 80th birthday of Roy a. Crowson. Warszawa: Muzeum i Instytut Zoologii PAN; 1995. p. 373–410.Google Scholar
- Ribera I, Beutel RG, Balke M, Vogler AP. Discovery of Aspidytidae, a new family of aquatic Coleoptera. Proc Biol Sci. 2002;269:2351–6. https://doi.org/10.1098/rspb.2002.2157.View ArticlePubMedPubMed CentralGoogle Scholar
- Spangler PJ, Steiner WE Jr. A new aquatic beetle family, Meruidae, from Venezuela (Coleoptera: Adephaga). Syst Entomol. 2005;30:339–57. https://doi.org/10.1111/j.1365-3113.2005.00288.x.View ArticleGoogle Scholar
- Bocak L, Kundrata R, Andújar CF, Vogler AP. The discovery of Iberobaeniidae (Coleoptera: Elateroidea): a new family of beetles from Spain, with immatures detected by environmental DNA sequencing. Proc Biol Sci. 2016;283:20152350. https://doi.org/10.1098/rspb.2015.2350.View ArticlePubMedPubMed CentralGoogle Scholar
- Lawrence JF. Rhinorhipidae, a new beetle family from Australia, with comments on the phylogeny of the Elateriformia. Invertebr Taxon. 1988;2:1–53.View ArticleGoogle Scholar
- Lawrence JF, Newton AF. Families and subfamilies of Coleoptera (with selected genera, notes, references and data on family-group names). In: Pakaluk J, Ślipiński SA, editors. Biology, phylogeny, and classification of Coleoptera: papers celebrating the 80th birthday of Roy a. Crowson. Warszawa: Muzeum i Instytut Zoologii PAN; 1995. p. 779–1006.Google Scholar
- Leschen RAB, Beutel RG, Lawrence JF. Handbook of zoology, Arthropoda: Insecta. Coleoptera, beetles; volume 2: morphology and systematics (Elateroidea, Bostrichiformia, Cucujiformia partim). In: Kristensen NP, Beutel RG, editors. Handbook of zoology, Arthropoda: Insecta. Berlin and New York: Walter de Gruyter GmbH & Co. KG; 2010.Google Scholar
- Bouchard P, Bousquet Y, Davies AE, Alonso-Zarazaga MA, Lawrence JF, Lyal CHC, et al. Family-group names in Coleoptera (Insecta). ZooKeys. 2011;88:1–972. https://doi.org/10.3897/zookeys.88.807.View ArticleGoogle Scholar
- Lawrence JF, Ślipiński SA, Seago AE, Thayer MK, Newton AF, Marvaldi AE. Phylogeny of the Coleoptera based on morphological characters of adults and larvae. Annal Zool. 2011;61:1–217. https://doi.org/10.3161/000345411X576725.View ArticleGoogle Scholar
- Beutel RG, Leschen RAB. Coleoptera, beetles; volume 1: morphology and systematics (Archostemata, Adephaga, Myxophaga, Polyphaga partim). In: Kristensen NP, Beutel RG, editors. Handbook of zoology, Arthropoda: Insecta. 2nd ed. Berlin and Boston: Walter de Gruyter GmbH & Co. KG; 2016.Google Scholar
- Timmermans MJTN, Dodsworth S, Culverwell CL, Bocak L, Ahrens D, Littlewood DTJ, et al. Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics. Nucleic Acids Res. 2010;38(21):e197. https://doi.org/10.1093/nar/gkq807.View ArticlePubMedPubMed CentralGoogle Scholar
- Timmermans MJTN, Barton C, Haran J, Ahrens D, Ollikainen A, Culverwell L, et al. Family-level sampling of mitochondrial genomes in Coleoptera: compositional heterogeneity and Phylogenetics. Genome Biol Evol. 2016;8:161–75. https://doi.org/10.1093/gbe/evv241.View ArticleGoogle Scholar
- Bocakova M, Bocak L, Hunt T, Teravainen M, Vogler AP. Molecular phylogenetics of Elateriformia (Coleoptera): evolution of bioluminescence and neoteny. Cladistics. 2007;23:477–96. https://doi.org/10.1111/j.1096-0031.2007.00164.x.View ArticleGoogle Scholar
- Kundrata R, Bocakova M, Bocak L. The comprehensive phylogeny of the superfamily Elateroidea (Coleoptera: Elateriformia). Mol Phylogenet Evol. 2014;76:162–71. https://doi.org/10.1016/j.ympev.2014.03.012.View ArticlePubMedGoogle Scholar
- Kundrata R, Jäch M, Bocak L. Molecular phylogeny of the Byrrhoidea-Buprestoidea complex (Coleoptera, Elateriformia). Zool Scr. 2017;46(2):150–64. https://doi.org/10.1111/zsc.12196.View ArticleGoogle Scholar
- Sharkey CR, Fujimoto MS, Lord NPSS, Mckenna DD, Suvorov A, et al. Overcoming the loss of blue sensitivity through opsin duplication in the largest animal group, beetles. Sci Rep. 2017;7:8. https://doi.org/10.1038/s41598-017-00061-7.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang K, Hong W, Jiao H, Zhao H. Transcriptome sequencing and phylogenetic analysis of four species of luminescent beetles. Sci Rep. 2017;7:1814. https://doi.org/10.1038/s41598-017-01835-9.View ArticlePubMedPubMed CentralGoogle Scholar
- Parker DJ, Cunningham CB, Walling CA, Stamper CE, Head ML, Roy-Zokan EM, et al. Transcriptomes of parents identify parenting strategies and sexual conflict in a subsocial beetle. Nat Commun. 2015;6:8449. https://doi.org/10.1038/ncomms9449.View ArticlePubMedPubMed CentralGoogle Scholar
- Fallon TR, Li FS, Vicent MA, Weng JK. Sulfoluciferin is biosynthesized by a specialized luciferin sulfotransferase in fireflies. Biochemistry. 2016;55(24):3341–4. https://doi.org/10.1021/acs.biochem.6b00402.View ArticlePubMedGoogle Scholar
- Peters RS, Meusemann K, Petersen M, Mayer C, Wilbrandt J, Ziesmann T, et al. The evolutionary history of holometabolous insects inferred from transcriptome-based phylogeny and comprehensive morphological data. BMC Evol Biol. 2014;14(1):52. https://doi.org/10.1186/1471-2148-14-52.View ArticlePubMedPubMed CentralGoogle Scholar
- Poelchau M, Childers C, Moore G, Tsavatapalli V, Evans J, Lee CY, et al. The i5k workspace@NAL—enabling genomic data access, visualization and curation of arthropod genomes. Nucleic Acids Res. 2014;43:(D1) https://doi.org/10.1093/nar/gku983.
- McKenna DD, Scully ED, Pauchet Y, Hoover K, Kirsch R, Geib SM, et al. Genome of the Asian longhorned beetle (Anoplophora glabripennis), a globally significant invasive species, reveals key functional and evolutionary innovations at the beetle–plant interface. Genome Biol. 2017;17(1):227. https://doi.org/10.1186/s13059-016-1088-8.View ArticleGoogle Scholar
- Keeling CI, Yuen MM, Liao NY, Docking TR, Chan SK, Taylor GA, et al. Draft genome of the mountain pine beetle, Dendroctonus ponderosae Hopkins, a major forest pest. Genome Biol. 2013;14(3) https://doi.org/10.1186/gb-2013-14-3-r27.
- Richards S, et al. (Tribolium genome sequencing consortium). The genome of the model beetle and pest Tribolium castaneum. Nature. 2008;452(7190):949–55. https://doi.org/10.1038/nature06784.View ArticlePubMedGoogle Scholar
- Toussaint E, Seidel M, Arriaga-Varela E, Hájek J, Král D, Sekerka L, et al. The peril of dating beetles. Syst Entomol. 2017;42:1–10. https://doi.org/10.1111/syen.12198.View ArticleGoogle Scholar
- Slowinski JB, Guyer C. Testing whether certain traits have caused amplified diversification: an improved method based on a model of random speciation and extinction. Am Nat. 1993;142:1019–24. https://doi.org/10.1086/285586.View ArticlePubMedGoogle Scholar
- Rabosky DL, Grundler M, Anderson C, Title P, Shi JJ, Brown JW, et al. BAMMtools: an R package for the analysis of evolutionary dynamics on phylogenetic trees. Methods Ecol Evol. 2014;5(7):701–7. https://doi.org/10.1111/2041-210X.12199.View ArticleGoogle Scholar
- Wolfe JM, Daley AC, Legg D, Edgecombe GD. Fossil calibrations for the arthropod tree of life. Earth Sci Rev. 2016;160:43–110. https://doi.org/10.1016/j.earscirev.2016.06.008.View ArticleGoogle Scholar
- Labandeira CC, Eble GJ. The fossil record of insect diversity and disparity. In: Anderson J, Thackeray F, Van Wyk B, De Wit M, editors. Gondwana alive: biodiversity and the evolving biosphere. Johannesburg: Witwatersrand University Press; 2000.Google Scholar
- Kirejtshuk AG, Poschmann M, Prokop J, Garrouste R, Nel A. Evolution of the elytral venation and structural adaptations in the oldest Palaeozoic beetles (Insecta: Coleoptera: Tshekardocoleidae). J Syst Palaeontol. 2014;12(5):575–600. https://doi.org/10.1080/14772019.2013.821530.View ArticleGoogle Scholar
- Johnson MG, Gardner EM, Liu Y, Medina R, Goffinet B, Shaw AJ, et al. HybPiper: extracting coding sequence and introns for Phylogenetics from high-throughput sequencing reads using target enrichment. Appl Plant Sci. 2016;4(7):1600016. https://doi.org/10.3732/apps.1600016.View ArticleGoogle Scholar
- Allen JM, Boyd B, Nguyen NP, Vachaspati P, Warnow T, Huang D, et al. Phylogenomics from whole genome sequences using aTRAM. Syst Biol. 2017;66:786–98. https://doi.org/10.1093/sysbio/syw105.PubMedGoogle Scholar
- Gould SJ. A special fondness for beetles. Nat Hist. 1993;102:4–6.Google Scholar
- Holz M. Mesozoic paleogeography and paleoclimates – a discussion of the diverse greenhouse and hothouse conditions of an alien world. J S Am Earth Sci. 2015;61:91–107. https://doi.org/10.1016/j.jsames.2015.01.001.View ArticleGoogle Scholar
- Andújar C, Arribas P, Ruzicka F, Crampton-Platt A, Timmermans MJTN, Vogler AP. Phylogenetic community ecology of soil biodiversity using mitochondrial metagenomics. Mol Ecol. 2015;24(14):3603–17. https://doi.org/10.1111/mec.13195.View ArticlePubMedGoogle Scholar
- Crampton-Platt A, Timmermans MJTN, Gimmel ML, Kutty SN, Cockerill TD, Vun Khen C, Vogler AP. Soup to tree: the phylogeny of beetles inferred by mitochondrial metagenomics of a Bornean rainforest sample. Mol Biol Evol. 2015;32(9):2302–16. https://doi.org/10.1093/molbev/msv111.View ArticlePubMedPubMed CentralGoogle Scholar
- Andújar C, Arribas P, Linard B, Kundrata R, Bocak L, Vogler AP. The mitochondrial genome of Iberobaenia (Coleoptera: Iberobaeniidae): first rearrangement of protein-coding genes in the beetles. Mitochondrial DNA A DNA Mapp Seq Anal. 2017;28(2):156–8. https://doi.org/10.3109/19401736.2015.1115488.PubMedGoogle Scholar
- Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, et al. MITOS: Improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 2013;69(2):313–9. https://doi.org/10.1016/j.ympev.2012.08.023.View ArticlePubMedGoogle Scholar
- Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80. https://doi.org/10.1093/molbev/mst010.View ArticlePubMedPubMed CentralGoogle Scholar
- Katoh K, Toh H. Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based framework. BMC Bioinformatics. 2008;9:212. https://doi.org/10.1186/1471-2105-9-212.View ArticlePubMedPubMed CentralGoogle Scholar
- Xie Y, Wu G, Tang J, Luo R, Patterson J, Liu S, et al. SOAPdenovo-trans: de novo transcriptome assembly with short RNA-Seq reads. Bioinformatics. 2014;30(12):1660–6. https://doi.org/10.1093/bioinformatics/btu077.View ArticlePubMedGoogle Scholar
- Amaral DT, Silva JR, Viviani VR. Transcriptional comparison of the photogenic and non-photogenic tissues of Phrixothrix hirtus (Coleoptera: Phengodidae) and non-luminescent Chauliognathus flavipes (Coleoptera: Cantharidae) give insights on the origin of lanterns in railroad worms. Gene Rep. 2017;7:78–86. https://doi.org/10.1016/j.genrep.2017.02.004.View ArticleGoogle Scholar
- Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. https://doi.org/10.1093/bioinformatics/btu170.View ArticlePubMedPubMed CentralGoogle Scholar
- Zdobnov EM, Tegenfeldt F, Kuznetsov D, Waterhouse RM, Simão FA, Ioannidis P, et al. OrthoDB v9.1: cataloging evolutionary and functional annotations for animal, fungal, plant, archaeal, bacterial and viral orthologs. Nucleic Acids Res. 2016;45:(D1) https://doi.org/10.1093/nar/gkw1119.
- Petersen M, Meusemann K, Donath A, Dowling D, Liu S, Peters RS, et al. Orthograph: a versatile tool for mapping coding nucleotide sequences to clusters of orthologous genes. BMC Bioinformatics. 2017;18(1) https://doi.org/10.1186/s12859-017-1529-8.
- Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421. https://doi.org/10.1186/1471-2105-10-421.View ArticlePubMedPubMed CentralGoogle Scholar
- Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–77. https://doi.org/10.1089/cmb.2012.0021.View ArticlePubMedPubMed CentralGoogle Scholar
- Slater G, Birney E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005;6(1):31. https://doi.org/10.1186/1471-2105-6-31.View ArticlePubMedPubMed CentralGoogle Scholar
- Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992;89(22):10915–9. https://doi.org/10.1073/pnas.89.22.10915.View ArticlePubMedPubMed CentralGoogle Scholar
- Misof B, Liu S, Meusemann K, Peters RS, Donath A, Mayer C, et al. Phylogenomics resolves the timing and pattern of insect evolution. Science. 2014;346:763. https://doi.org/10.1126/science.1257570.View ArticlePubMedGoogle Scholar
- Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34(Web Server https://doi.org/10.1093/nar/gkl315.
- Misof B, Misof KA. Monte Carlo approach successfully identifies randomness in multiple sequence alignments: a more objective means of data exclusion. Syst Biol. 2009;58(1):21–34. https://doi.org/10.1093/sysbio/syp006.View ArticlePubMedGoogle Scholar
- Kück P, Meusemann K, Dambach J, Thormann B, von Reumont BM, Wägele JW, Misof B. Parametric and non-parametric masking of randomness in sequence alignments can be improved and leads to better resolved trees. Front Zool. 2010;7(1):10. https://doi.org/10.1186/1742-9994-7-10.View ArticlePubMedPubMed CentralGoogle Scholar
- Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Mol Biol Evol. 2015;32:268–74. https://doi.org/10.1093/molbev/msu300.View ArticlePubMedGoogle Scholar
- Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90.View ArticlePubMedGoogle Scholar
- Trifinopoulos J, Nguyen LT, von Haeseler A, Minh BQ. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016;44(W1):W232–5. https://doi.org/10.1093/nar/gkw256.View ArticlePubMedPubMed CentralGoogle Scholar
- Kalyaanamoorthy S, Minh BQ, Wong TK, Haeseler AV, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14(6):587–9. https://doi.org/10.1038/nmeth.4285.View ArticlePubMedPubMed CentralGoogle Scholar
- Chernomor O, von Haeseler A, Minh BQ. Terrace aware data structure for phylogenomic inference from supermatrices. Syst Biol. 2016;65:997–1008. https://doi.org/10.1093/sysbio/syw037.View ArticlePubMedPubMed CentralGoogle Scholar
- Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018;35(2):518–22. https://doi.org/10.1093/molbev/msx281.View ArticlePubMedGoogle Scholar
- Lartillot N, Philippe H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol Biol Evol. 2004;21(6):1095–109. https://doi.org/10.1093/molbev/msh112.View ArticlePubMedGoogle Scholar
- Miller MA, Pfeiffer W, Schwartz T. Creating the CIPRES science gateway for inference of large phylogenetic trees. In: 2010 gateway computing environments workshop, GCE, vol. 2010. https://doi.org/10.1109/GCE.2010.5676129.
- Grunewald S, Spillner A, Bastkowski S, Bogershausen A, Moulton V. SuperQ: computing Supernetworks from quartets. IEEE/ACM Trans Comput Biol Bioinform. 2013;10(1):151–60. https://doi.org/10.1109/tcbb.2013.8.View ArticlePubMedGoogle Scholar
- Bastkowski S, Mapleson D, Spillner A, Wu T, Balvociute M, Moulton V, SPECTRE. A suite of PhylogEnetiC tools for reticulate evolution. 2017; https://doi.org/10.1101/169177.
- Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2005;23(2):254–67. https://doi.org/10.1093/molbev/msj030.View ArticlePubMedGoogle Scholar
- Rasnitsyn AP, Quicke DLJ. History of insects. Dordrecht, The Netherlands: Kluwer Academic Publisher; 2002.View ArticleGoogle Scholar
- Behrensmeyer AK, Turner A. Taxonomic occurrences of Elateriformia and Scarabaeiformia recorded in the Paleobiology database Fossilworks. 2015. See. http://fossilworks.org
- Whalley PES. The systematics and palaeogeography of the lower Jurassic insects of Dorset, England. Bull Br Mus Nat Hist. 1985;39:107–89.Google Scholar
- Heer O. Die Urwelt der Schweiz. Friedrich Schulthess: Zürich; 1865.Google Scholar
- Prokin AA, Makarov KV, Ponomarenko AG, Bashkuev AS. New beetle larvae (Coleoptera: Coptoclavidae, Caraboidea, Polyphaga) from the upper Triassic of Germany. Russ Entomol J. 2013;22(4):259–74.Google Scholar
- Ho SYW, Phillips MJ. Accounting for calibration uncertainty in phylogenetic estimation of evolutionary divergence times. Syst Biol. 2009;58(3):367–80. https://doi.org/10.1093/sysbio/syp035.View ArticlePubMedGoogle Scholar
- Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29(8):1969–73. https://doi.org/10.1093/molbev/mss075.View ArticlePubMedPubMed CentralGoogle Scholar
- Lanfear R, Frandsen PB, Wright AM, Senfeld T, Calcott B. PartitionFinder 2: new methods for selecting partitioned models of evolution for molecular and morphological phylogenetic analyses. Mol Biol Evol. 2017;34:772–3. https://doi.org/10.1093/molbev/msw260.PubMedGoogle Scholar