Scientists Uncover a Surprising Connection Between Number Theory & Evolutionary Genetics

ReachMD Healthcare Image


The robustness of every component of every folded phenotype for both the RNA and HP GP maps (of various lengths) is plotted against the frequency (fraction of vertices f = |V(G)|/k in the entire Hamming graph) alongside the continuous interpolation of the maximum robustness curve from equation (3.6)) (black line). The minimum robustness of a neutral component, equation (4.1), is given by the blue line, and the random null expectation, equation (1.2), is plotted as the red line. Note that for these small neutral components, the minimum component robustness exceeds the random null expectation since, by definition, the neutral component consists of vertices which are a connected component. Such a collection of vertices will naturally have more edges than the average collection of vertices randomly selected to form a neutral network in the random null model. The data for these curves can be generated by using the RoBound Calculator [52] tool we have introduced. The natural GP maps all contain neutral components which attain the bricklayer’s graph bound (as well as some very low robustness components that attain the minimum bound). The unfolded (trivial) phenotype is omitted from each of these plots. The minimum robustness appears to be larger than the bricklayer’s graph line for low frequencies; but, this only happens for non-integer values of the number of vertices. Of course, any graph will have an integer number of vertices; in all of those cases, the bricklayer’s graph robustness will be greater than or equal to the minimum robustness. Credit: Journal of The Royal Society Interface (2023). DOI: 10.1098/rsif.2023.0169

Number theory, the study of the properties of positive integers, is perhaps the purest form of mathematics. At first sight, it may seem far too abstract to apply to the natural world. In fact, the influential American number theorist Leonard Dickson wrote, "Thank God that number theory is unsullied by any application."

And yet, again and again, number theory finds unexpected applications in science and engineering, from leaf angles that (almost) universally follow the Fibonacci sequence, to modern encryption techniques based on factoring prime numbers. Now, researchers have demonstrated an unexpected link between number theory and evolutionary genetics. Their work is published in the Journal of The Royal Society Interface.

Specifically, the team of researchers (from Oxford, Harvard, Cambridge, GUST, MIT, Imperial, and the Alan Turing Institute) have discovered a deep connection between the sums-of-digits function from number theory and a key quantity in genetics, the phenotype mutational robustness. This quality is defined as the average probability that a point mutation does not change a phenotype (a characteristic of an organism).

The discovery may have important implications for evolutionary genetics. Many genetic mutations are neutral, meaning that they can slowly accumulate over time without affecting the viability of the phenotype. These neutral mutations cause genome sequences to change at a steady rate over time. Because this rate is known, scientists can compare the percentage difference in the sequence between two organisms and infer when their latest common ancestor lived.

But the existence of these neutral mutations posed an important question: what fraction of mutations to a sequence are neutral? This property, called the phenotype mutational robustness, defines the average amount of mutations that can occur across all sequences without affecting the phenotype.

Professor Ard Louis from the University of Oxford, who led the study, said, "We have known for some time that many biological systems exhibit remarkably high phenotype robustness, without which evolution would not be possible. But we didn't know what the absolute maximal robustness possible would be, or if there even was a maximum."

It is precisely this question that the team has answered. They proved that the maximum robustness is proportional to the logarithm of the fraction of all possible sequences that map to a phenotype, with a correction which is given by the sums of digits function sk(n), defined as the sum of the digits of a natural number n in base k. For example, for n = 123 in base 10, the digit sum would be s10(123) = 1 + 2 + 3 = 6.

Another surprise was that the maximum robustness also turns out to be related to the famous Tagaki function, a bizarre function that is continuous everywhere, but differentiable nowhere. This fractal function is also called the blancmange curve, because it looks like the French dessert.

First author Dr. Vaibhav Mohanty (Harvard Medical School) added, "What is most surprising is that we found clear evidence in the mapping from sequences to RNA secondary structures that nature in some cases achieves the exact maximum robustness bound. It's as if biology knows about the fractal sums-of-digits function."

Professor Ard Louis added, "The beauty of number theory lies not only in the abstract relationships it uncovers between integers, but also in the deep mathematical structures it illuminates in our natural world. We believe that many intriguing new links between number theory and genetics will be found in the future."

More information: Vaibhav Mohanty et al, Maximum mutational robustness in genotype–phenotype maps follows a self-similar blancmange-like curve, Journal of The Royal Society Interface (2023). DOI: 10.1098/rsif.2023.0169

Citation: Scientists uncover a surprising connection between number theory and evolutionary genetics (2023, August 1) retrieved 1 August 2023 from

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Facebook Comments


We’re glad to see you’re enjoying Prova Education…
but how about a more personalized experience?

Register for free