Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 May 24;12(5):577.
doi: 10.3390/v12050577.

Comparative Genomics Unveils Regionalized Evolution of the Faustovirus Genomes

Affiliations

Comparative Genomics Unveils Regionalized Evolution of the Faustovirus Genomes

Khalil Geballa-Koukoulas et al. Viruses. .

Abstract

Faustovirus is a recently discovered genus of large DNA virus infecting the amoeba Vermamoeba vermiformis, which is phylogenetically related to Asfarviridae. To better understand the diversity and evolution of this viral group, we sequenced six novel Faustovirus strains, mined published metagenomic datasets and performed a comparative genomic analysis. Genomic sequences revealed three consistent phylogenetic groups, within which genetic diversity was moderate. The comparison of the major capsid protein (MCP) genes unveiled between 13 and 18 type-I introns that likely evolved through a still-active birth and death process mediated by intron-encoded homing endonucleases that began before the Faustovirus radiation. Genome-wide alignments indicated that despite genomes retaining high levels of gene collinearity, the central region containing the MCP gene together with the extremities of the chromosomes evolved at a faster rate due to increased indel accumulation and local rearrangements. The fluctuation of the nucleotide composition along the Faustovirus (FV) genomes is mostly imprinted by the consistent nucleotide bias of coding sequences and provided no evidence for a single DNA replication origin like in circular bacterial genomes.

Keywords: Asfarvirus; Faustovirus; genome evolution; nucleo-cytoplasmic large DNA virus.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Venn diagram of the numbers of protein families shared between FV clades. For each area of the diagram, the number of protein families ubiquitously conserved in all of the FVs of the corresponding group is indicated. Numbers in brackets indicate additional protein families that were only conserved in a subset of FVs from the relevant group. The names of the FV strains contained in each clade are detailed in Figure 2.
Figure 2
Figure 2
Phylogenetic relationships between FVs and virus relatives. (A) Unrooted phylogenetic tree of FVs and virus relatives reconstructed using the viral DNA polymerase as a marker. FV clades have been collapsed for clarity. Details of the clade compositions are given in B. SH-like local supports for branches are indicated beside nodes. The scale bar indicates the number of amino-acid substitutions per site. (B) Phylogenetic tree of FVs reconstructed from the concatenation of the alignments of 267 single-copy core FV proteins. All branches received maximal SH-like local support. Names of the newly sequenced FV strains are shown in blue.
Figure 3
Figure 3
Evolution of the major capsid protein (MCP) gene structure in FVs and Kaumoebavirus (KV). The figure shows a graphical representation of the FV and KV MCP gene alignment. Exons and introns of the MCP genes are shown by red rectangles and 2-colour segments, respectively. Introns represented by the same 2-colour code are orthologous (i.e., share sequence similarity and position relative to the MCP coding sequence). A circled “A” marks orthologous introns that were present in the last FV common ancestor (i.e., introns shared by any virus from clade M/L and any virus from clades D and/or E9). Inversely circled “N”s indicate introns that have no evidence of being present in the last FV common ancestor. Light and dark grey areas indicate significant nucleotide similarity between introns and between exons, respectively. Predicted ORFs with significant protein similarity to group I intron endonucleases are shown with blue arrows with a number inside indicating the ORF id in the respective genome annotation.
Figure 4
Figure 4
Sequence conservation along viral chromosomes. The graphs represent sequence conservation along reference genomes of FVs (i.e., F-M6 for clade E9, F-D3 for clade D, and F-E12 for clade M/L) and African swine fever viruses (ASFVs) (strain BA71G). Each reference FV genome was aligned against the other FV genomes of the same clade using BLASTN (evalue < 1E-15). The reference ASFV genome was aligned against 12 other sequenced ASFV genomes available in Genbank. The resulting alignments were parsed to compute various statistics within 10 Kb windows slid along the genomes with a 1 Kb step. The dark, medium and light blue areas represent maximal, average and minimal levels of within-clade sequence conservation (global identity) within windows. The mauve and black curves represent the frequencies of positions containing a nucleotide substitution or a gap, respectively, in any alignment within windows. Open red rectangles indicate the positions of the MCP genes, with individual exons shown with shaded boxes below the x-axis.
Figure 5
Figure 5
Complimentary nucleotide composition bias along viral genomes. Each graph represents the G-C walk (black), A-T walk (blue) and CDS walk (kaki – y-axis not shown). Open red rectangles indicate the position of the MCP genes, with individual exons shown with shaded boxes below the x-axis. The x-axis units are base pairs. PV: Pacmanvirus A19; ASFV: Asfarvirus BA71V; KV: Kaumoebavirus Sc.

Similar articles

Cited by

References

    1. Iyer L.M., Aravind L., Koonin E.V. Common origin of four diverse families of large eukaryotic DNA viruses. J. Virol. 2001;75:11720–11734. doi: 10.1128/JVI.75.23.11720-11734.2001. - DOI - PMC - PubMed
    1. Brandes N., Linial M. Giant Viruses-Big Surprises. Viruses. 2019;11:404. doi: 10.3390/v11050404. - DOI - PMC - PubMed
    1. Iyer L.M., Balaji S., Koonin E.V., Aravind L. Evolutionary genomics of nucleo-cytoplasmic large DNA viruses. Virus Res. 2006;117:156–184. doi: 10.1016/j.virusres.2006.01.009. - DOI - PubMed
    1. Guglielmini J., Woo A.C., Krupovic M., Forterre P., Gaia M. Diversification of giant and large eukaryotic dsDNA viruses predated the origin of modern eukaryotes. Proc. Natl. Acad. Sci. USA. 2019;116:19585–19592. doi: 10.1073/pnas.1912006116. - DOI - PMC - PubMed
    1. Khalil J.Y.B., Andreani J., La Scola B. Updating strategies for isolating and discovering giant viruses. Curr. Opin. Microbiol. 2016;31:80–87. doi: 10.1016/j.mib.2016.03.004. - DOI - PubMed

Publication types

LinkOut - more resources