Endogenization

Endogenization is the evolutionary process by which viral genetic material becomes stably integrated into the germline of a host organism and inherited by offspring through Mendelian inheritance. The integrated sequences become endogenous viral elements, which encompasses all inheritable virus-derived sequences regardless of virus type.[1]
The molecular mechanisms driving this process vary among virus and host types. It can more commonly occur from the direct use of integrases, or through other means such as the incidental hijacking of host DNA repair pathways.
Retroviruses are the most common source of endogenization, as genomic integration is an obligatory step in their replication, and can create endogenous retroviruses. Nevertheless, every category of eukaryotic virus can contribute sequences to host germlines, including RNA viruses with no obligatory DNA stage in their life cycle.[2][3]
Criteria
[edit]Endogenization in multicellar life usually requires that a viral integration occur in a germ cell, such as a sperm or egg cell, or in an early embryo before the somatic and germline lineages separate, so that the viral sequence is transmitted to all cells of the resulting organism and becomes available for vertical transmission.[4] Integration into normal somatic cells, no matter how widespread during an acute infection, will not produce a inheritable endogenous virus in animals, but may in other kingdoms.[1][2] Because germ cells are a numerically small and often physically isolated target, germline integration is exceedingly rare relative to infection, as such many endogenous viral elements (EVEs) appear to descend from single ancient integration events rather than repeated independent endogenizations.[5]
Once endogenized as a heterozygous insertion in one individual, an EVE is subject to the same forces as any new allele. Neutral or only slightly deleterious EVEs may drift to fixation, while those that are beneficial can spread by selection.[5][2]
Mechanism
[edit]Retroviruses
[edit]Retroviruses encode reverse transcriptase, which converts the RNA genome into double-stranded DNA, and integrase, which inserts that DNA into the host chromosome as a provirus. Because integration is an obligatory step in retroviral replication, occasional germline integration is a normal feature of retroviral biology, making endogenous retroviruses (ERVs) the most abundant class of EVE in vertebrate genomes,[6] including humans where HERVs make up approximately 8% of the human genome.[7]
RNA viruses
[edit]The RNA viruses which encode no reverse transcriptase and whose life cycle is entirely cytoplasmic do not undergo direct or purposeful endogenization of their genes as a retrovirus might, however since the discovery and study of EVEs, it has been demonstrated that they can become endogenized through different means. Bornavirus-like elements from the Bornaviridae family exist in the genomes of humans, other primates, rodents, elephants and others, with some primate integrations dated to over 40 million years ago.[8] Sequences related to Ebolavirus and Marburgvirus (Filoviridae family) were found endogenized in the genomes of rodents, bats and marsupials, indicating that filovirus-like viruses have been contributing germline sequences across mammalian lineages for tens of millions of years.[9]
The endogenization mechanism differs by virus genome type. For ssRNA viruses in animals, the encoded reverse transcriptase of the host genome's LINE-1 elements can act on cytoplasmic viral mRNAs via "target-primed" reverse transcription (TPRT), many resulting EVEs carry poly(A) tails and target-site duplications consistent with this route. However some integration events lack these hallmarks, indicating additional pathways.[8] dsRNA viruses are unsuitable for TPRT, yet totivirus-derived EVEs are widespread across insect genomes, demonstrating that dsRNA endogenization occurs altough the underlying route is uncharacterised. [10]
In insects, non-retroviral integrated RNA virus sequences (NIRVS) are particularly enriched in genomic regions populated by LTR and LINE retrotransposons, suggesting this host machinery serves as the primary vehicle for endogenizing viral RNA within the germline. In the Aedes aegypti and Aedes albopictus mosquitoes, NIRVS derived from Flaviviridae and Rhabdoviridae viruses are abundant and produce Piwi-interacting RNA that seem to participate in antiviral defense.[11] These same virus families have a strong presence in insects but have been endogenized across plant species as well.[12]
In single-celled organisms such as dinoflagellates, the frequency of EVEs correlates across genera with host LINE retroelement abundance, and LINE-encoded reverse transcriptase sequences occur in close genomic proximity to insertion sites, attesting to retroelement-driven reverse transcription as the primary endogenization mechanism. DNA repair processes and co-infection with different viruses may also be alternative routes.[13]
DNA viruses
[edit]DNA virus families have undergone endogenization by distinct routes. Hepadnaviridae, which replicate through an RNA intermediate using their own reverse transcriptase, have given rise to endogenous elements in birds, snakes, and other vertebrates.[14] A near-complete hepadnaviral genome has been identified at syntenic chromosomal position across multiple bird lineages, dating the endogenization to the Mesozoic. Researchers have used shared hepadnaviral elements these lineages to calibrate a long-term molecular clock for the family.[15] ssDNA viruses Circoviridae and Parvoviridae which lack reverse transcriptase have also been endogenized across mammals, seemingly through DNA repair.[2]
A widespread endogenization of Caulimoviridae and Geminiviridae sequences has been observed in plants genomes.[16][17] Neither family encodes an integrase, so integration occurs through illegitimate recombination at double-strand breaks.[2] Plants lack a strict germ/soma differentiation as their meristematic cells remain pluripotent and can give rise to gamete-producing tissue, thus an infection in the right developmental context can produce lead to endogenization.[18][19][12]
Giant viruses
[edit]
The giant viruses of nucleocytoviricota can endogenize into eukaryotic host genomes, seemingly an incidental post-infection occurrence, as with other non-retrovial endogenization events.[20][21][22] An exception to this is are the viruses of Phycodnaviridae, who encode integrase recombinases and follow a lysogenic life cycle within brown algae.[22]
Epigenetic silencing of endogenized viral DNA can reduce fitness cost to the host, with 5-methylcytosine DNA methylation serving as a primary mechanism in eukaryotic lineages retaining 5mC. Their sequences tend to be hypermethylated, and experimental removal of 5mC is sufficient to reactivate viral gene transcription.[21] Where 5mC is absent, histone methylation can serve an equivalent function, as observed in EsV-1 Ectocarpus provirus.[21][22] Correspondingly, giant virus endogenizations in eukaryotes are primarily found in lineages with these mechanisms.
On the other hand, giant viruses appear to carry countermeasures against silencing, including viral-encoded DNA methyltransferases and histone demethylases that may protect the endogenized viruses from the host chromatin.[23]
Once endogenized, a viral sequence may undergo progressive genomic erosion. Spliceosomal introns accumulate in viral genes, transposable elements invade the inserted region, and core viral genes may become lost or duplicated through chromosomal rearrangements.[24][23] Retained genes exhibit high Ka/Ks ratios ratios compared to free-living homologs, whereas introns and protein-coding genes can be shared with flanking genomic regions, and can lead to gradual assimiliton through negative selection[24][24]
Despite this, the amount of genetic material introduced by the endogenization of giant viruses in eukaryotes (such as Amoebidium)[25] can become a potential source of evolutionary novelty, just as with more commonly endogenized viruses. Giant virus-derived genes have been identified outside their endogenous viral regions while simultaneously exhibiting normal methylation patterns, having presumably been co-opted by their hosts.[21] Study of the endogenization of giant viruses, combined with their unique properties, has helped lend credence the hypothesis of viral eukaryogenesis.[26]
Virophages
[edit]Virophages are small dsDNA viruses that parasitize giant viruses. Virophage genomes can undergo endogenization into those of single-celled eukaryotes, and then reactivate upon infection by a giant virus, thereby providing a form of inducible antiviral defense.[27] They encode integrase related to those of Polinton elements, integration is non site-specific and can occur in the nuclear genome of their host in various chromosomal positions, where they produce silent provirophages.[28] Some virophages carry a second integration enzyme, a tyrosine recombinase, suggesting independent acquisitions of integration capacity across lineages.[29]
Effect
[edit]Endogenization of viral elements typically creates non-functional relics that accumulate mutations and persist by neutral drift, with their original coding capacity eroded.[5][1] Still, a subset has been exapted for host functions, usually retroviral sequences. endogenous bornavirus-like nucleoproteins have been proposed to interfere with replication of exogenous bornaviruses, representing a form of inheritable antiviral immunity created by past infections.[2][8] Aberrant reactivation of silenced EVEs has been associated with autoimmune and inflammatory conditions in vertebrates, although establishing certain causality remains a challenge in medical research.[5][30]
See also
[edit]References
[edit]- ^ a b c Katzourakis, Aris; Gifford, Robert J. (2010). "Endogenous viral elements in animal genomes". PLOS Genetics. 6 (11) e1001191. doi:10.1371/journal.pgen.1001191. PMC 2987831. PMID 21124940.
- ^ a b c d e f Feschotte, Cedric; Gilbert, Clement (2012). "Endogenous viruses: insights into viral evolution and impact on host biology". Nature Reviews Genetics. 13 (4): 283–296. doi:10.1038/nrg3199. PMID 22421730.
- ^ Aiewsakun, Pakorn; Katzourakis, Aris (2015). "Endogenous viruses: connecting recent and ancient viral evolution". Virology. 479–480 (5): 26–37. doi:10.1016/j.virol.2015.02.011. PMID 25771503.
- ^ "Koala study reveals clues about origins of the human genome". ScienceDaily. Institute for Genomic Biology, University of Illinois at Urbana-Champaign. 6 November 2014. Retrieved 30 May 2026.
- ^ a b c d Holmes, Edward C. (2011). "The evolution of endogenous viral elements". Cell Host and Microbe. 10 (4): 368–377. doi:10.1016/j.chom.2011.09.002. PMC 7172163. PMID 22018237.
- ^ Stoye, Jonathan P. (2012). "Studies of endogenous retroviruses reveal a continuing evolutionary saga". Nature Reviews Microbiology. 10 (6): 395–406. doi:10.1038/nrmicro2783. PMID 22565131.
- ^ Burn, Aidan (18 October 2022). "Humans are 8% virus – how the ancient viral DNA in your genome plays a role in human disease and development". The Conversation. Retrieved 22 May 2026.
- ^ a b c Horie, Masayuki; Honda, Tomoyuki; Suzuki, Yoshihiro; Kobayashi, Yuki; Daito, Takuji; Oshida, Tatsuo; Ikuta, Kazuyoshi; Jern, Patric; Gojobori, Takashi; Coffin, John M.; Tomonaga, Keizo (2010). "Endogenous non-retroviral RNA virus elements in mammalian genomes". Nature. 463 (7277): 84–87. Bibcode:2010Natur.463...84H. doi:10.1038/nature08695. PMC 2818285. PMID 20054395.
- ^ Belyi, Vladimir A.; Levine, Arnold J.; Skalka, Anna Marie (2010). "Unexpected inheritance: multiple integrations of ancient bornavirus and ebolavirus/marburgvirus sequences in vertebrate genomes". PLOS Pathogens. 6 (7) e1001030. doi:10.1371/journal.ppat.1001030. PMC 2912400. PMID 20686665.
- ^ Huang, Hai-Jian; Li, Yi-Yuan; Ye, Zhuang-Xin (2023). "Co-option of a non-retroviral endogenous viral element in planthoppers". Nature Communications. 14 (1) 7264. Bibcode:2023NatCo..14.7264H. doi:10.1038/s41467-023-43186-2. PMC 10636211. PMID 37945658.
- ^ Palatini, Umberto; Miesen, Pascal; Carballar-Lejarazu, Rebeca; Ometto, Lino; Rizzo, Elisa; Tu, Zhijian; van Rij, Ronald P.; Bonizzoni, Mariangela (2017). "Comparative genomics shows that viral integrations are abundant and express piRNAs in the arboviral vectors Aedes aegypti and Aedes albopictus". BMC Genomics. 18 (1) 512. doi:10.1186/s12864-017-3903-3. PMC 5496407. PMID 28676109.
- ^ a b {{cite journal |last1=Chiba |first1=Sotaro |last2=Kondo |first2=Hideki |last3=Tani |first3=Akio |last4=Saisho |first4=Daisuke |last5=Sakamoto |first5=Wataru |last6=Kanematsu |first6=Satoko |last7=Suzuki |first7=Nobuhiro |title=Widespread endogenization of genome sequences of non-retroviral RNA viruses into plant genomes |journal=PLOS Pathogens |year=2011 |volume=7 |issue=7|article-number=e1002146 |doi=10.1371/journal.ppat.1002146 |pmid=21779172 |pmc=3136472 |doi-access=free}
- ^ Veglia, Alex J.; Bistolas, Kalia S. I.; Vega Thurber, Rebecca L. (2023). "Endogenous viral elements reveal associations between a non-retroviral RNA virus and symbiotic dinoflagellate genomes". Communications Biology. 6 (1) 566. doi:10.1038/s42003-023-04917-9. PMC 10235124. PMID 37264063.
- ^ Gilbert, Clement; Feschotte, Cedric (2010). "Genomic fossils calibrate the long-term evolution of hepadnaviruses". PLOS Biology. 8 (9) e1000495. doi:10.1371/journal.pbio.1000495. PMC 2950124. PMID 20927357.
- ^ Suh, Alexander; Brosius, Jürgen; Schmitz, Jürgen; Kriegs, Jan Ole (2013). "The genome of a Mesozoic paleovirus reveals the evolution of hepatitis B viruses". Nature Communications. 4 1791. Bibcode:2013NatCo...4.1791S. doi:10.1038/ncomms2798. PMID 23653203.
- ^ Vassilieff, Héléna; Geering, Andrew D. W.; Choisne, Nathalie; Teycheney, Pierre-Yves; Maumus, Florian (3 July 2023). "Endogenous Caulimovirids: Fossils, Zombies, and Living in Plant Genomes". Biomolecules. 13 (7): 1069. doi:10.3390/biom13071069. PMC 10377300. PMID 37509105.
- ^ Geering, Andrew D. W.; Choisne, Nathalie; Vezzulli, Silvia; Sharahskin, Tanya; Velasco, Riccardo; Quesneville, Hadi; Teycheney, Pierre-Yves. "Endogenous caulimovirid sequences are widespread in plant genomes" (PDF).
- ^ Staginnus, Christina; Richert-Pöggeler, Katja R. (2006). "Endogenous pararetroviruses: two-faced travelers in the plant genome". Trends in Plant Science. 11 (10): 485–491. Bibcode:2006TPS....11..485S. doi:10.1016/j.tplants.2006.08.008. PMID 16949329.
- ^ Richert-Pöggeler, Katja R.; Vijverberg, Kitty; Alisawi, Osamah; Chofong, Gilbert N.; Heslop-Harrison, J. S.; Schwarzacher, Trude (2021). "Participation of multifunctional RNA in replication, recombination and regulation of endogenous plant pararetroviruses (EPRVs)". Frontiers in Plant Science. 12 689307. Bibcode:2021FrPS...1289307R. doi:10.3389/fpls.2021.689307. PMC 8256270. PMID 34234799.
- ^ "Lurking in genomic shadows: How giant viruses fuel the evolution of algae". Phys.org. Virginia Tech. 18 November 2020.
- ^ a b c d Sarre, Luke A.; Kim, Iana V.; Ovchinnikov, Vladimir; Olivetta, Marine; Suga, Hiroshi; Dudin, Omaya; Sebé-Pedrós, Arnau; de Mendoza, Alex (2024). "DNA methylation enables recurrent endogenization of giant viruses in an animal relative". Science Advances. 10 (28) eado6406. Bibcode:2024SciA...10O6406S. doi:10.1126/sciadv.ado6406. PMC 11244446. PMID 38996012.
- ^ a b c Mckeown, Dean; Cormier, Alexandre; Schroeder, Declan; Couloux, Arnaud; Tadrent, Nachida; Cock, J. Mark; Corre, Erwan (2025). "Multiple, diverse endogenous giant virus elements within the genome of a brown alga". Virus Evolution. 11 (1) veaf009. doi:10.1093/ve/veaf009. PMC 11890097. PMID 40061209.
- ^ a b Jivaji, Abdeali M.; Karki, Sangita; Franken, Emma; Erazo-Garcia, Maria P.; Barth, Zach K.; Aylward, Frank O. (2025). "Giant endogenous viral elements in the genome of the model protist Euglena gracilis reveal past interactions with giant viruses". Journal of Virology. 99 (9) e00713-25. doi:10.1128/jvi.00713-25. PMC 12455990. PMID 40856505.
- ^ a b c Moniruzzaman, Mohammad; Weinheimer, Alaina R.; Martinez-Gutierrez, Carolina A.; Aylward, Frank O. (2020). "Widespread endogenization of giant viruses shapes genomes of green algae". Nature. 588 (7836): 141–145. Bibcode:2020Natur.588..141M. doi:10.1038/s41586-020-2924-2. PMID 33208937.
- ^ "Biologists Find Ancient Giant Virus Insertions in Genome of Unicellular Parasite". Sci.News. 16 July 2024. Retrieved 24 May 2026.
- ^ Forterre, Patrick; Gaïa, Morgan (2021). "Giant DNA viruses make big strides in eukaryote evolution". Cell Host and Microbe. 29 (2): 152–154. doi:10.1016/j.chom.2021.01.008. PMID 33571441.
- ^ Fischer, Matthias G.; Hackl, Thomas (2016). "Host genome integration and giant virus-induced reactivation of the virophage mavirus". Nature. 540 (7632): 288–291. Bibcode:2016Natur.540..288F. doi:10.1038/nature20593. PMID 27929021.
- ^ Koslová, Anna; Hackl, Thomas; Fischer, Matthias G. (2024). "Endogenous virophages are active and mitigate giant virus infection in the marine protist Cafeteria burkhardae". Proceedings of the National Academy of Sciences. 121 (11) e2314606121. Bibcode:2024PNAS..12114606K. doi:10.1073/pnas.2314606121. PMC 10945749. PMID 38446847.
- ^ Hackl, Thomas; Duponchel, Sarah; Barenhoff, Karina; Weinmann, Alexa; Fischer, Matthias G. (2021). "Virophages and retrotransposons colonize the genomes of a heterotrophic flagellate". eLife. 10 e72674. doi:10.7554/eLife.72674. PMC 8547959. PMID 34698016.
- ^ Stetson, Daniel B. (10 October 2012). "Endogenous retroelements and autoimmune disease". Current Opinion in Immunology. 24 (6): 692–697. doi:10.1016/j.coi.2012.09.007. PMC 4005353. PMID 23062469.