Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Journal articles

Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences

Abstract : Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as Zipf's law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes.
Keywords : noncoding DNA
Document type :
Journal articles
Complete list of metadata

Cited literature [73 references]  Display  Hide  Download
Contributor : Lauriane Pillet Connect in order to contact the contributor
Submitted on : Thursday, May 28, 2020 - 2:16:08 PM
Last modification on : Tuesday, July 20, 2021 - 5:20:05 PM


2016_Sheinman_Scientific Repor...
Publisher files allowed on an open archive


Distributed under a Creative Commons Attribution 4.0 International License




M. Sheinman, A. Ramisch, F. Massip, P.F. Arndt. Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences. Scientific Reports, Nature Publishing Group, 2016, 6, pp.30851. ⟨10.1038/srep30851⟩. ⟨hal-02049888⟩



Record views


Files downloads