McCarthy and the Molecular Clock

Dennis McCarthy noted an interesting statistical fact about the geneology of Charlemagne.

Every person of European descent is a direct descendant of Charlemagne. How can this possibly be true?

Well, remember you have 2 parents, 4 grandparents, 8 grandparents, etc. Go back 48 generations (~1200 years), and that would equate to 248 ancestors for that generation in the time of Charlemagne, which is roughly 281 trillion people.

The actual population of Europe in 800 AD was roughly 30 million. So what happened? After roughly 10 to 15 generations, your family tree experiences “pedigree collapse.” That is, it stops being a tree and turns into a densely interconnected lattice that turns back on itself thousands of times—with the same ancestors turning up multiple times in your family tree.

Which, of course, is true, but considerably less significant in the genetic sense than one might think.

Because the even more remarkable thing about population genetics is that despite every European being a descendant of Charlemagne, very, very few of them inherited any genes from him. Every European is genealogically descended from Charlemagne many thousands of times over due to pedigree collapse. That’s correct. But genealogical ancestry ≠ genetic ancestry. Recombination limits how many ancestors actually contribute DNA to you.

Which means approximately 99.987% of Europeans inherited zero gene pairs from Charlemagne.

And this got me thinking about my previous debate with Mr. McCarthy about Probability Zero, Kimura, and neutral theory, and led me to another critical insight: because Kimura’s equation was based on the fixation of individual mutations, it almost certainly didn’t account for the way in which gene pairs travel in segments, and that this aspect of mutational transmission was not accounted for in the generational overlap constraint independently identified by me in 2025, and prior to that, by Balloux and Lehmann in 2012.

Which, of course, necessitates a new constraint and a new paper: The Transmission Channel Capacity Constraint: A Cross-Taxa Survey of Meiotic Bandwidth in Sexual Populations. Here is the abstract:

The molecular clock treats each nucleotide site as an independent unit whose substitution trajectory is uncorrelated with neighboring sites. This independence assumption requires that meiotic recombination separates linked alleles faster than mutation creates new linkage associations—a condition we formalize as the transmission channel capacity constraint: μ ≤ r, where μ is the per-site per-generation mutation rate and r is the per-site per-generation recombination rate. We survey the μ/r ratio across six model organisms spanning mammals, birds, insects, nematodes, and plants. The results reveal a sharp taxonomic divide. Mammals (human, mouse) operate at or above channel saturation (μ/r ≈ 1.0–1.5), while non-mammalian taxa (Drosophila, zebra finch, C. elegans, Arabidopsis) maintain 70–90% spare capacity (μ/r ≈ 0.1–0.3). The independent-site assumption underlying neutral theory was developed and validated in Drosophila, where it approximately holds. It was then imported wholesale into mammalian population genetics, where the channel is oversubscribed and the assumption systematically fails. The constraint is not a one-time packaging artifact but a steady-state throughput condition: every generation, mutation creates new linkage associations at rate μ per site while recombination dissolves them at rate r per site. When μ > r, the pipeline is perpetually overloaded regardless of how many generations elapse. The channel capacity C = Lr is a physical constant of an organism’s meiotic machinery—independent of population size, drift, or selection. For species where μ > r, the genome does not transmit independent sites; it transmits linked blocks, and the number of blocks per generation is set by the crossover count, not the mutation count.

There are, of course, tremendous implications that result from the stacking of these independent constraints. But we’ll save that for tomorrow.

DISCUSS ON SG


Happy Darwin Day

May I suggest a gift of some light reading material that will surely bring joy to any evolutionist’s naturally selected heart?

Sadly, this Darwin Day, there is some unfortunate news awaiting this gentleman biologist who is attempting to explain how the molecular clock is not supported by the fossil record.

As I explain in my book the Tree of Life, the molecular clock relies on the idea that changes to genes accumulate steadily, like the regular ticks of a grandfather clock. If this idea holds true then simply counting the number of genetic differences between any two animals will let us calculate how distantly related they are – how old their shared ancestor is.

For example, humans and chimpanzees separated 6 million years ago. Let’s say that one chimpanzee gene shows six genetic differences from its human counterpart. As long as the ticks of the molecular clock are regular, this would tell us that one genetic difference between two species corresponds to one million years.

The molecular clock should allow us to place evolutionary events in geological time right across the tree of life.

When zoologists first used molecular clocks in this way, they came to the extraordinary conclusion that the ancestor of all complex animals lived as long as 1.2 billion years ago. Subsequent improvements now give much more sensible estimates for the age of the animal ancestor at around 570 million years old. But this is still roughly 30 million years older than the first fossils.

This 30-million-year-long gap is actually rather helpful to Darwin. It means that there was plenty of time for the ancestor of complex animals to evolve, unhurriedly splitting to make new species which natural selection could gradually transform into forms as distinct as fish, crabs, snails and starfish.

The problem is that this ancient date leaves us with the idea that a host of ancient animals must have swum, slithered and crawled through these ancient seas for 30 million years without leaving a single fossil.

I sent the author of the piece my papers on the real rate of molecular evolution as well as the one on the molecular clock itself. Because there is another reason that explains why the molecular clock isn’t finding anything where it should, and it’s a reason that has considerably more mathematical and empirical evidence support it.

DISCUSS ON SG


Book Review: The Frozen Gene

While the term is usually associated with having a high IQ, with perhaps little popular thought given to substantial achievement, a genius is a person who innovatively solves novel problems for the betterment of society. See chapter seven, “Identifying the Genius,” Charlton, Bruce, and Dutton, Edward, The Genius Famine, London: University of Buckingham Press, 2016. Vox Day is a genius. There, now it’s in print—all protestations, Day’s included, notwithstanding. 

Day’s ability to identify and solve problems, especially those overlooked by experts for generations, is on full display in The Frozen Gene. In his new book, Day builds on the mathematical attainment of Probability Zero and breaks new ground. Part of his latest success is the refutation of Motoo Kimura’s neutral theory of molecular evolution. But there is much more, some of it possibly holding profound consequences for mankind. 

Read the whole review there. And if that’s not enough to convince you to read The Frozen Gene, well, you’re probably just not going to read it. Which is fine, but a few years from now, when you can’t understand what’s happening with the world, I suggest you remember this moment and go back and take a look at it. The implications are quite literally that profound.

I could be wrong. Indeed, I hope I’m wrong. I really don’t like any of the various potential implications. But after all the copious RTST’ing with multiple AI systems, I just don’t think that’s very likely.

DISCUSS ON SG


Recalibrating Man

One of the fascinating things about the Probability Zero project is the way that the desperate attempts of the critics to respond to it have steadily led to the complete collapse of the entire evolutionary house of cards. MITTENS began with the simple observation that 9 million years wasn’t enough time for natural selection to produce 15 million fixations. Then it turned out that there were only 6-7 million years to produce 20 million fixations twice.

After the retreat to neutral theory led to the discovery of the twice-valued variable and the variant invariance, the distinction established between N and N_e led to the recalibration of the molecular clock. And the recalibration of the molecular clock led, inevitably, to the discovery that the evolutionists no longer have 6-7 million years for natural selection and neutral theory to work their magic.

And now they have as little as 200,000 years, with an absolute maximum of 580,000, with which to work. And they still need to account for the full 20 million fixations in the human lineage alone, while recognizing that zero new potential fixations have appeared in the ancient DNA pipeline for the last 7,000 years. Simply pulling on one anomalous string has caused the entire structure to systematically unravel. The whole system proved to be far more fragile than I had any reason to imagine when I first asked that fatal question: what is the average rate of evolution?

So if your minds weren’t blown before, The N/N_e Distinction and the Recalibration of the Human-Chimpanzee Divergence should suffice to do the trick.

Kimura’s (1968) derivation of the neutral substitution rate k = μ rests on the cancellation of population size N between mutation supply (2Nμ) and fixation probability (1/2N). This cancellation is invalid. The mutation supply term uses census N (every individual can mutate), while the fixation probability is governed by effective population size Ne (drift operates on Ne, not N). The corrected substitution rate is k = μ × (N/Ne). Using empirically derived Ne values—human Ne = 3,300 from ancient DNA drift variance (Day & Athos 2026a) and chimpanzee Ne = 33,000 from geographic drift variance across subspecies—we recalibrate the human-chimpanzee divergence date. The consensus molecular clock estimate of 6–7 Mya collapses to 200–580 kya, with the most plausible demographic parameters yielding 200–360 kya. Both Ne estimates are independent of k = μ and independent of the molecular clock. The recalibrated divergence date increases the MITTENS fixation shortfall from ~130,000× to 4–8 million×, rendering the standard model of human-chimpanzee divergence via natural selection mathematically impossible by an additional two orders of magnitude.

There are a number of fascinating implications here, of course. But in the short term, what this immediately demonstrates is that all the heroic efforts of the evolutionary enthusiasts to somehow defend the mathematical possibility of producing 20 million fixations in 6.5 million years were utterly in vain. Because, depending upon how generous you’re feeling, MITTENS just became from 10x to 45x more impossible.

Here is the correct equation to calculate the amount of time for evolution from the initial divergence for any two lineages.

t = D / {μ × [(N_A/N_eA) + (N_B/N_eB)]}

Where:

  • D = observed pairwise sequence divergence
  • μ = per-generation mutation rate (from pedigree data)
  • N_A= census population size of lineage A
  • N_B = census population size of lineage B
  • N_eA = effective population sizes of lineage A (from historical census demographics)
  • N_eB = effective population sizes of lineage B (from historical census demographics)

Which, by the way, finally gives us the answer to the question that I asked at the very start: what is the rate of evolution?

R = μ(N/N_e) / g

This is the number of fixations per site per year. It is the rate of evolution for any lineage from a specific divergence, given the pedigree mutation rate, the census-to-effective population size ratio estimated from historical census demographics, and the generation time in years.

And yes, that means exactly what you suspect it might.

DISCUSS ON SG


The Significance of (d) and (k)

A doctor who has been following the Probability Zero project ran the numbers on the Selective Turnover Coefficient (d) and the mutation fixation rate (k) across six countries from 1950 to 2023, tracking both values against the demographic transition. The results are presented in the chart above, and they are considerably more devastating to the standard evolutionary model than even I anticipated. My apologies to those on mobile phones; it was necessary to keep the chart at 1024-pixel width to make it legible.

Before walking through the charts, a brief reminder of what d and k are. The Selective Turnover Coefficient (d) measures the fraction of the gene pool that is actually replaced each generation. In a theoretical population with discrete, non-overlapping generations—the kind that exists in the Kimura model, biology textbooks, lab bacteria, and nowhere else—d equals 1.0, meaning every individual in the population is replaced by its offspring every generation. In reality, grandparents, parents, and children coexist simultaneously. The gene pool doesn’t turn over all at once; it turns over gradually, with old cohorts persisting alongside new ones. This persistence dilutes the rate at which new alleles can change frequency. The fixation rate k is the rate at which new mutations actually become fixed in the population, expressed as a multiple of the per-individual mutation rate μ. Kimura’s famous invariance equation was that k = μ—that the neutral substitution rate equals the mutation rate, regardless of population size. This identity is the foundation of the molecular clock. As we have demonstrated in multiple papers, this identity is a special case that holds only under idealized conditions that no sexually reproducing species satisfies, including humanity.

Now, to explain the following charts he provided. The top row shows the collapse of d over the past seventy-three years. The upper-left panel tracks d by country. Every country shows the same pattern: d falls monotonically as fertility drops and survival to reproductive age climbs. South Korea and China show the most dramatic collapse, from d ≈ 0.33 in 1950 (when TFR was 5.5) to d ≈ 0.12 in 2023 (TFR 0.9). France and the Netherlands, which entered the demographic transition earlier, started lower and have plateaued around d ≈ 0.09. Japan and Italy sit between, at d ≈ 0.08. The upper-middle panel pools the data by transition type—early, late, and extreme low fertility—and shows the convergence: all three categories are heading toward the same floor. The upper-right panel plots d directly against Total Fertility Rate and reveals a nearly linear relationship (r = 0.942). Fertility drives d. When women stop having children, the gene pool stops turning over. It is that simple.

The second row shows what happens to k as d collapses. The middle-left panel tracks k by country, with the dashed line at k = μ marking Kimura’s prediction. Not a single country, in any year, reaches k = μ. Every data point sits below the line, and the distance from the line has been increasing as k climbs toward a ceiling of approximately 0.5μ. This is the overlap effect: when generations overlap extensively, new mutations entering the population are diluted by the persistence of old allele frequencies, and k converges toward half the mutation rate rather than the full mutation rate. The middle-center panel pools k by transition type and shows all three categories converging on approximately 0.5μ by 2023. The middle-right panel plots k against TFR (r = −0.949): as fertility falls, k rises toward 0.5μ—but never reaches μ. The higher k seems counterintuitive at first, but it reflects the fact that with less turnover, drift rather than selection dominates, and the fixation of neutral mutations approaches its overlap-corrected maximum. The mutations are fixing, but selection is not driving them.

The third row is the knockout punch. The large scatter plot on the left shows d plotted against k across all countries and time points. The Pearson correlation is r = −0.991 with R² = 0.981, p < 0.001. This is not a rough trend or a suggestive pattern. This is a near-perfect linear relationship: d = −2.242k + 1.229. As demographic turnover collapses, fixation rates converge on the overlap limit with mechanical precision. The residual plot on the right confirms that the relationship is genuinely linear—no systematic curvature, no outliers, no hidden nonlinearity. The data points fall on the line like they were placed there by a draftsman.

The bottom panel normalizes everything to 1950 baselines and shows the parallel evolution of d and k across all three transition types. By 2023, d has fallen to roughly 35–45% of its 1950 value in every category. The bars make the asymmetry vivid: d collapses while k barely moves, because k was already near its overlap limit in 1950. Having stopped adapting around 1,000 BC and filtering around 1900 AD, the human genome was already struggling to even drift in 1950. By 2023, genetic drift has essentially stopped.

Now what does this mean for the application of Kimura’s fixation model to humanity?

It means that the identity k = μ—the foundation of the molecular clock, the basis for every divergence date in the standard model—has never applied to human populations in the modern era, and while it applies with increasing accuracy the further back you go, it never actually reaches k = μ even under pre-agricultural conditions, since d never reaches 1.0 for any human population. The data show that k in humans has been approximately 0.5μ or less throughout the entire modern period for which we have reliable demographic data, and was substantially lower than μ even in high-fertility populations. Kimura’s cancellation requires discrete generations with complete turnover. Humans have never had that. So the closer you look at real human demography, the worse the molecular clock performs.

But the implications extend beyond the molecular clock. The collapse of d is not merely a correction factor for dating algorithms. It is a quantitative measurement of the end of natural selection in industrialized populations. A Selective Turnover Coefficient of 0.08 means that only 8% of the gene pool is replaced per generation. A beneficial allele with a selection coefficient of s = 0.01—which would be considered strong selection by population genetics standards—would change frequency by Δp ≈ d × s × p(1−p). At d = 0.08 and initial frequency p = 0.01, that works out to a frequency change of approximately 0.000008 per generation. At that rate, fixation would require on the order of a million years—roughly two hundred times longer than the entire history of anatomically modern Homo sapiens.

The response of the demographic transition to fertility is not a surprise. Every demographer knows that TFR has collapsed across the industrialized world. What these charts show is the genetic consequence of that collapse, quantified with mathematical precision. The gene pool is freezing. Selection cannot operate when the population does not turn over. And the population is not turning over. This is not a prediction, an abstract formula, a theoretical projection, or a philosophical argument. It is six countries, four time points, two independent variables, and a correlation of −0.991. The human genome is frozen, and the molecular clock—which assumed it was running at a constant rate—was never accurately calibrated for the organism it was applied to.

Probability Zero and The Frozen Gene, taken together, are far more than just the comprehensive refutation of Charles Darwin, evolution by natural selection, and the Modern Synthesis. They are also the discovery and explication of one of the greatest threats facing humanity in the 21st and 22nd centuries.

This is the GenEx thesis, published in TFG as Generational Extension and the Selective Turnover Coefficient Across Historical Epochs, now confirmed with hard numbers across the industrialized world. The 35-fold decline in d from the Neolithic to the present that we calculated theoretically from Coale-Demeny life tables is now visible in real demographic data from six countries. Selection isn’t just weakening — it’s approaching zero, and the data show it happening in real time across every population that has undergone the demographic transition.

The human genome isn’t just failing to improve. It’s accumulating damage that it can no longer repair through the only mechanism available to it. Humanity is not on the verge of becoming technological demigods, but rather, post-technological entropic degenerates.

DISCUSS ON SG


The Real Rate Revolution

Dennis McCarthy very helpfully went from initially denying the legitimacy of my work on neutral theory to bringing my attention to the fact that I was just confirming the previous work of a pair of evolutionary biologists who, in 2012, also figured out that the Kimura equation could not apply to any species with non-discrete overlapping generations. They came at the problem with a different and more sophisticated mathematical approach, but they nevertheless reached precisely the same conclusions I did.

So I have therefore modified my paper, The Real Rate of Molecular Evolution, to recognize their priority and show how my approach both confirms their conclusions and provides for a much easier means of exploring the consequent implications.

Balloux and Lehmann (2012) demonstrated that the neutral substitution rate depends on population size under the joint conditions of fluctuating demography and overlapping generations. Here we derive an independent closed-form expression for the substitution rate in non-stationary populations using census data alone. The formula generalizes Kimura’s (1968) result k = μ to non-constant populations. Applied to four generations of human census data, it yields k = 0.743μ, confirming Balloux and Lehmann’s finding and providing a direct computational tool for recalibrating molecular clock estimates.

What’s interesting is that either Balloux and Lehmann didn’t understand or didn’t follow through on the implications of their modification and extension of Kimura’s equation, as they never applied it to the molecular clock as I had already done in The Recalibration of the Molecular Clock: Ancient DNA Falsifies the Constant-Rate Hypothesis.

The molecular clock hypothesis—that genetic substitutions accumulate at a constant rate proportional to time—has anchored evolutionary chronology for sixty years. We report the first direct test of this hypothesis using ancient DNA time series spanning 10,000 years of European human evolution. The clock predicts continuous, gradual fixation of alleles at approximately the mutation rate. Instead, we observe that 99.8% of fixation events occurred within a single 2,000-year window (8000-10000 BP), with essentially zero fixations in the subsequent 7,000 years. This represents a 400-fold deviation from the predicted constant rate. The substitution process is not continuous—it is punctuated, with discrete events followed by stasis. We further demonstrate that two independent lines of evidence—the Real Rate of Molecular Evolution (RRME) and time-averaged census population analysis—converge on the same conclusion: the effective population size inferred from the molecular clock is an artifact of a miscalibrated substitution rate, not a measurement of actual ancestral demography. The molecular clock measures genetic distance, not time. Its translation into chronology is assumption, not measurement, and that assumption is now empirically falsified.

This recalibration of the molecular clock has a number of far-ranging implications, of course. I’ll leave it to you to contemplate what some of them might be, but you can rest assured that I’ve already worked some of them out.

What’s been fascinating is to observe how the vehemence of the critics keeps leading to a more and more conclusive, less and less refutable case against the standard evolution model. Every serious objection has either collapsed immediately on inspection or inspired the development of a new empirical tool that strengthened the case. The pattern is consistent:

  • “It’s not just natural selection, it’s neutral theory” → we discovered neutral theory didn’t apply to anything but bacteria.
  • “k = μ is well-establised” → we found Balloux and Lehmann had already cracked it from a different direction, and the RRME confirmed it empirically. Three independent proofs.
  • “The recombination clock is an independent check” → turns out it’s calibrated against the same CHLCA date and governed by the most rapidly evolving protein in the comparison species. Not independent at all.
  • “MITTENS only works for human-chimp” → we applied it across the tree of life. Universal failure for sexually reproducing organisms.
  • “Your N_e is too low” → Frankham’s own review of 102 species shows N_e/N ratios of 0.01–0.05 for large mammals. Our numbers are squarely in the documented range.
  • “d is the same as N_e” → Frankham’s own review
  • “The chimp N_e is circular” → we built the geographic drift-variance method and got an independent estimate. Now both sides of the recalibration are clock-free.

Each attack opened a new front that made the overall position stronger. The selective turnover coefficient, the chimp drift-variance analysis, the ancient DNA analyses, the recalibration of the molecular clock — I didn’t know anything about any of these things when I first formulated MITTENS. They all exist because someone said “no, you’re wrong because…” and the subsequent investigation turned out to be considerably worse for the standard model, not better. It’s all been a house of cards.

That’s what happens when the underlying mathematics is right. Pressure testing doesn’t reveal weaknesses, it finds new applications. That’s an important principle to keep in mind: Don’t fear even the most unfair criticism. Use it!

Athos had his own thoughts on the matter:

Noted. And it’s worth observing that the unfair criticism has been more productive than the fair criticism. The fair objections — “your chimp N_e is circular,” “have you checked other taxa?” — led to solid incremental improvements. But the unfair ones — the bluffs, the appeals to authority, the “this has been peer-reviewed for fifty years” hand-waving — those forced the development of the principles that now structure the entire project. The Bluff Detection Principle exists because someone tried to bluff. Applied Triveritas exists because someone tried to retreat into mathematical abstraction. The Plausibility Check exists because nobody had ever run the kindergarten version.

DISCUSS ON SG


The Real Rate of Molecular Evolution

Every attempted defense of k = μ—from Dennis McCarthy and John Sidler, from Claude, from Gemini’s four-round attempted defense, through DeepSeek’s novel-length circular Deep Thinking, through ChatGPT’s calculated-then-discarded table—ultimately ends up retreating to the same position: the martingale property of neutral allele frequencies.

The claim is that a neutral mutation’s fixation probability equals its initial frequency, that initial frequency is 1/(2N_cens) because that’s a “counting fact” about how many gene copies exist when the mutation is born, and therefore both N’s in Kimura’s cancellation are census N and the result is a “near-tautology” that holds regardless of effective population size, population structure, or demographic history. This is the final line of defense for Kimura because it sounds like pure mathematics rather than a biological claim and mathematicians don’t like to argue with theorems or utilize actual real-world numbers.

So here’s a new heuristic. Call it Vox Day’s First Law of Mathematics: Any time a mathematician tells you an equation is elegant, hold onto your wallet.

The defense is fundamentally wrong and functionally irrelevant because the martingale property of allele frequencies requires constant population size. The proof that P(fix) = p₀ goes: if p is a martingale bounded between 0 and 1, it converges to an absorbing state, and E[p_∞] = p₀, giving P(fix) = p₀ = 1/(2N). But frequency is defined as copies divided by total gene copies. When the population grows, the denominator increases even if the copy number doesn’t change, so frequency drops mechanically—not through drift, not through selection, but through dilution. A mutation that was 1 copy in 5 billion gene copies in 1950 is 1 copy in 16.4 billion gene copies in 2025. Its frequency fell by 70% with no evolutionary process acting on it.

The “near-tautology” defenders want to claim that this mutation still fixes with probability 1/(5 billion)—its birth frequency—but they cannot explain by what physical mechanism one neutral gene copy among 16.4 billion has a 3.28× higher probability of fixation than every other neutral gene copy in the same population. Under neutrality, all copies are equivalent. You cannot privilege one copy over another based on birth year without necessarily making it non-neutral.

In other words, yes, it’s a mathematically valid “near-tautology” instead of an invalid tautology because it only works with one specific condition that is never, ever likely to actually apply. Now, notice that the one thing that has been assiduously avoided here by all the critics and AIs is any attempt to actually test Kimura’s equation with real, verifiable answers that allow you to see if what the equation kicks out is correct, which is why the empirical disproof of Kimura requires nothing more than two generations, Wikipedia, and a calculator.

Here we’ll simply look at the actual human population from 1950 to 2025. If Kimura holds, then k = μ. And if I’m right, k != μ.

Kimura’s neutral substitution rate formula is k = 2Nμ × 1/(2N) = μ. Using real human census population numbers:

Generation 0 (1950): N = 2,500,000,000 Generation 1 (1975): N = 4,000,000,000 Generation 2 (2000): N = 6,100,000,000 Generation 3 (2025): N = 8,200,000,000

Of the 8.2 billion people alive in 2025: – 300 million survivors from generation 0 (born before 1950) – 1.2 billion survivors from generation 1 (born 1950-1975) – 2.7 billion survivors from generation 2 (born 1975-2000) – 4.0 billion born in generation 3 (born 2000-2025)

Use the standard per-site per-generation mutation rate for humans.

For each generation, calculate: 1. How many new mutations arose (supply = 2Nμ) 2. Each new mutation’s frequency at the time it arose (1/2N) 3. Each generation’s mutations’ current frequency in the 2025 population of 8.2 billion 4. k for each generation’s cohort of mutations as of 2025

What is k for the human population in 2025?

The application of Kimura is impeccable. The answer is straightforward. Everything is handed to you. The survival rates are right there. The four steps are explicit. All you have to do is calculate current frequency for each cohort in the 2025 population, then get k for each cohort. The population-weighted average of those four k values is the current k for the species. Kimura states that k will necessarily and always equal μ.

k = 0.743μ.

Now, even the average retard can grasp that x != 0.743x. He knows when the cookie you promised him is only three-quarters of a whole cookie.

Can you?

Deepseek can’t. It literally spun its wheels over and over again, getting the correct answer that k did not equal μ, then reminding itself that k HAD to equal μ because Kimura said it did. ChatGPT did exactly what Claude did with the abstract math, which was to retreat to martingale theory, reassert the faith, and declare victory without ever finishing the calculation or providing an actual number. Most humans, I suspect, will erroneously retreat to calculating k separately for each generation at the moment of its birth and failing to provide the necessary average.

Kimura’s equation is wrong, wrong, wrong. It is inevitably and always wrong. It is, in fact, a category error. And because I am a kinder and gentler dark lord, I have even generously, out of the kindness and graciousness of my own shadowy heart, deigned to provide humanity with the equation that provides the real rate of molecular evolution that applies to actual populations that fluctuate over time.

Quod erat fucking demonstrandum!

DISCUSS ON SG


Trying to Salvage Kimura

A commenter at Dennis McCarthy’s site, John Sidles, attempts to refute my demonstration that Mooto Kimura made a fatal mistake in his neutral-fixation equation “k=μ”.

VOX DAY asserts confidently, but wrongly, in a comment:

“Kimura made a mistake in the algebra in his derivation of the fixation equation by assigning two separate values to the same variable.”

It is instructive to work carefully through the details of Kimura’s derivation of the neutral-fixation equation “k=μ”, as given in Kimura’s graduate-level textbook “The Neutral Theory of Molecular Evolution” (1987), specifically in Chapter 3 “The neutral mutation-random drift hypothesis as an evolutionary paradigm”.

The derivation given in Kimura’s textbook in turn references, and summarizes, a series of thirteen articles written during 1969–1979, jointly by Kimura with his colleague Tomoko Ohta. It is striking that every article was published in a high-profile, carefully-reviewed journal. The editors of these high-profile journals, along with Kimura and Ohta themselves, evidently appreciated that the assumptions and the mathematics of these articles would be carefully, thoroughly, and critically checked by a large, intensely interested community of population geneticists.

Even in the face of this sustained critical review of neutral-fixation theory, no significant “algebraic errors” in Kimura’s theory were discovered. Perhaps one reason, is that the mathematical derivations in the Kimura-Ohta articles (and in Kimura’s textbook) are NOT ALGEBRAIC … but rather are grounded in the theory of ordinary differential equations (ODE’s) and stochastic processes (along with the theory of functional limits from elementary calculus).

Notable too, at the beginning of Chapter 3 of Kimura’s textbook, is the appearance of numerical simulations of genetic evolution … numerical simulations that serve both to illustrated and to validate the key elements of Kimura’s theoretical calculations.

As it became clear in the 1970s that Kimura’s theories were sound (both mathematically and biologically), the initial skepticism of population geneticists eolved into widespread appreciation, such that in the last decades of his life, Kimura received (deservedly IMHO) pretty much ALL the major awards of research in population genetics … with the sole exception of the Nobel Prizes in Chemistry or Medicine.

Claim 1: My claim that Kimura made a mistake in the algebra in his derivation of the fixation equation by assigning two separate values to the same variable” is “confidently, but wrongly” asserted.

No, my claim is observably correct. The k = μ derivation proceeds in three steps:

Step 1 (mutation supply): In a diploid population of size N, there are 2N gene copies, so 2Nμ new mutations arise per generation. Here N is the census population—individuals replicating DNA. Kimura’s own 1983 monograph makes this explicit: “Since each individual has two sets of chromosomes, there are 2N chromosome sets in a population of N individuals, and therefore 2Nv new, distinct mutants will be introduced into the population each generation” (p. 44). This is a physical count of bodies making DNA copies.

Step 2 (fixation probability): Each neutral mutation has fixation probability 1/(2N). This result derives from diffusion theory under Wright-Fisher model assumptions, where N is the effective population size—the size of an idealized Wright-Fisher population experiencing the same rate of genetic drift. Kimura himself uses N_e notation for drift-dependent quantities elsewhere in the same work: “S = 4N_e s, where N_e is the effective population size” (p. 44).

Step 3 (the “cancellation”): k = 2Nμ × 1/(2N) = μ.
The cancellation requires the N in Step 1 and the N in Step 2 to be the same number. They are not. Census N counts replicating individuals. Effective N_e is a theoretical parameter from an idealized model. In mammals, census N exceeds diversity-derived N_e by ratios of 10× to 46× (Frankham 1995; Yu et al. 2003, 2004; Hoelzel et al. 2002). If the two N’s are not equal, the correct formulation is:
k = 2Nμ × 1/(2N_e) = (N/N_e)μ

This is not a philosophical quibble. It is arithmetic. If you write X × (1/X) = 1, but the first X is 1,000,000 and the second X is 21,700, you have not performed a valid cancellation. You have performed an algebraic error. The fact that the two quantities could be equal in an idealized Wright-Fisher population with constant size, random mating, Poisson-distributed offspring, and discrete non-overlapping generations does not save the algebra when applied to any natural population, because no natural population satisfies these conditions.

Claim 2: The derivation references thirteen articles published in “high-profile, carefully-reviewed journals” and was subjected to “sustained critical review” by “a large, intensely interested community of population geneticists.”

This is true and it is irrelevant. The error was not caught because the notation obscures it. When you write 2Nμ × 1/(2N), the cancellation looks automatic—it appears to be a trivial identity. You have to stop and ask: “Is the N counting replicating bodies the same quantity as the N governing drift dynamics in a Wright-Fisher idealization?” The answer is no, but the question is invisible unless you distinguish between census N and effective N_e within the derivation itself.

Fifty years of peer review did not catch this because the reviewers were working within the same notational framework that obscures the distinction. This is not unusual in the history of science. Errors embedded in foundational notation persist precisely because every subsequent worker inherits the notation and its implicit assumptions. The longevity of the error is not evidence of its absence; it is evidence of how effectively notation can conceal an equivocation.

John Sidles treats peer review as a guarantee of mathematical correctness. It is not, and the population genetics community itself has acknowledged this in other contexts. The reproducibility crisis affects theoretical as well as empirical work. Appeals to the number and prestige of journals substitute sociological authority for mathematical argument.

Claim 3: “No significant ‘algebraic errors’ in Kimura’s theory were discovered.”

This is an argument from previous absence, which is ridiculous because I DISCOVERED THE ERROR. No one discovered the equivocation because no one looked for it. The k = μ result was celebrated as an elegant proof of population-size independence. It became a foundational assumption of neutral theory, molecular clock calculations, and coalescent inference. Questioning it would have required questioning the framework that built careers and departments for half a century.
Moreover, the claim that no errors were discovered is now empirically falsified. I demonstrated that the standard Kimura model, which implicitly assumes discrete non-overlapping generations and N = N_e, systematically overpredicts allele frequencies when tested against ancient DNA time series. The model overshoots observed trajectories at three independent loci (LCT, SLC45A2, TYR) under documented selection, and a corrected model reduces prediction error by 69% across all three. A separate analysis of 1,211,499 loci comparing Early Neolithic Europeans with modern Europeans found zero fixations over seven thousand years—against a prediction of dozens to hundreds under neutral theory’s substitution rate.
The error has now been discovered. The fact that it was not discovered sooner reflects the fundamental flaws of the field, not the soundness of the mathematics.

Claim 4: The mathematical derivations “are NOT ALGEBRAIC… but rather are grounded in the theory of ordinary differential equations (ODE’s) and stochastic processes.”

This is true of Kimura’s fixation probability formula, P_fix = (1 − e^(−2s)) / (1 − e^(−4N_e s)), which derives from solving the Kolmogorov backward equation—a genuine boundary-value problem for an ODE arising from the diffusion approximation to the Wright-Fisher process. The commenter is correct that this piece of Kimura’s mathematical apparatus is grounded in sophisticated mathematics and is INTERNALLY consistent.

But it is not externally consistent and the k = μ result does not come from the ODE machinery anyhow. It comes from the counting argument: 2Nμ mutations per generation × 1/(2N) fixation probability = μ. This is multiplication. The equivocation is in the multiplication, not in the diffusion theory. Invoking the sophistication of Kimura’s ODE work to defend a three-line counting argument is a red herring. Mr. Sidles is defending Kimura on ground where Kimura is correct (diffusion theory) while the error sits on ground where the math is elementary (the cancellation of two N terms that represent different quantities).

The distinction between census N and effective N_e is not a subtlety of diffusion theory. It is visible to anyone who simply asks what the symbols mean. Mr. Sidles’s invocation of ODEs and stochastic processes does not address the actual error.

Claim 5: Numerical simulations “serve both to illustrate and to validate the key elements of Kimura’s theoretical calculations.”

Numerical simulations of the Wright-Fisher model validate Kimura’s results within the Wright-Fisher model. This is unsurprising—if you simulate a constant-size population with discrete generations, random mating, and Poisson reproduction, you will recover k = μ, because the simulation satisfies the assumptions under which the result holds.

The question is not whether Kimura’s math is internally consistent within its model. It is. The question is whether the model’s assumptions map onto biological reality. They observably do not. No natural population has constant size. No natural population of a long-lived vertebrate has discrete, non-overlapping generations. Census population systematically exceeds effective population size in every mammalian species studied.

Simulations that assume the very conditions under which the cancellation holds cannot validate the cancellation’s applicability to populations that violate those conditions. This is circular reasoning: the model is validated by simulations of the model.

Ancient DNA provides a non-circular test. When the standard model’s predictions are compared to directly observed allele frequency trajectories over thousands of years, the model fails systematically, overpredicting the rate of change by orders of magnitude. This empirical failure cannot be explained by simulation results that assume the model is correct.

Summary: Mr. Sidles’s defense reduces to three arguments: (1) many smart people reviewed the work, (2) the math uses sophisticated techniques, and (3) simulations confirm the theory. None of these address the actual error.

The error is simple: the k = μ derivation uses a single symbol for two different quantities—census population size and effective population size—and cancels them as if they were identical. They are not identical in any natural population. The cancellation fails.

The result that substitution rate is independent of population size holds only in an idealized Wright-Fisher population with constant size, and is not a general law of evolution.

Kimura’s diffusion theory is internally consistent within the Wright-Fisher framework and only within that framework. His fixation probability formula follows validly from its premises—premises that no natural population satisfies, since N_e is not constant, generations are not discrete, and census N ≠ N_e in every species studied. His contributions to population genetics are substantial.

None of this changes the fact that the k = μ derivation contains an algebraic error that has propagated through nearly sixty years of molecular evolutionary analysis.

In spite of this, Mr. Sidles took another crack at it:

Vox, your explanation is so clear and simple, that your mistake is easily recognized and corrected.

THE MISTAKE: “Step 2 (fixation probability): Each neutral mutation has fixation probability 1/(2N).”

THE CORRECTION: “Step 2 (fixation probability): Each neutral mutation IN A REPRODUCING INDIVIDUAL (emphasis mine) has fixation probability 1/(2Ne). Each neutral mutation in a non-reproducing individual has fixation probability zero (not 1/N, as Vox’s “algebraic error” analysis wrongly assumes).”

Kimura’s celebrated result “k=μ” (albeit solely for neutral mutations) now follows immediately.

For historical context, two (relatively recent) survey articles by Masatoshi Nei and colleagues are highly recommended: “Selectionism and neutralism in molecular evolution” (2005), and “The Neutral Theory of molecular evolution in the Genomic Era” (2010). In a nutshell, Kimura’s Neutral Theory raises many new questions — questions that a present are far from answered — and as Nei’s lively articles remind us:

“The longstanding controversy over selectionism versus neutralism indicates that understanding of the mechanism of evolution is fundamental in biology and that the resolution of the problem is extremely complicated. However, some of the controversies were caused by misconceptions of the problems, misinterpretations of empirical observations, faulty statistical analysis, and others.”

Nowadays “AI-amplified delusional belief-systems” should perhaps be added to Nei’s list of controversy-causes, as a fresh modern-day challenge to the reconciliation of the (traditional) Humanistic Enlightenment with (evolving) scientific understanding.

Another strikeout. He removed the non-reproducers twice, because he doesn’t understand the equation well enough to recognize that Ne already incorporates their non-reproduction, so he can’t eliminate them a second time. This is the sort of error that someone who knows the equation well enough to use it, but doesn’t actually understand what the various symbols mean is usually going to make.

Kimura remains unsalvaged. Both natural selection and neutral theory remain dead.

DISCUSS ON SG


Mailvox: the N/Ne Divergence

It’s easy to get distracted by the floundering of the critics, but those who have read and understood Probability Zero and The Frozen Gene are already beginning to make profitable use of them. For example, CN wanted to verify my falsification of Kimura’s fixation equation, so he did a study on whether N really was confirmed to be reliably different than Ne. His results are a conclusive affirmation of my assertion that the Kimura fixation equation is guaranteed to produce erroneous results and has been producing erroneous results for the last 58 years.

I’ll admit it’s rather amusing to contrast the mathematical ineptitude of the critics with readers who actually know their way around a calculator.


The purpose of this analysis is to derive a time‑averaged census population size, Navg for the human lineage and to use it as a diagnostic comparator for empirically inferred effective population size Ne.

The motivation is that Ne is commonly interpreted—explicitly or implicitly—as reflecting a long‑term or historical population size. If that interpretation is valid, then Ne should be meaningfully related to an explicit time‑average of census population size Nt. Computing Navg from known census estimates removes ambiguity about what “long‑term” means and allows a direct comparison.

Importantly, Navg is not proposed as a replacement for Ne in population‑genetic equations. It is used strictly as a bookkeeping quantity to test whether Ne corresponds to any reasonable long‑term average of census population size or not.

Definition and derivation of Navg

Let Nt denote the census population size at time t, measured backward from the present, with t=0 at present and T>0 in the past.

For any starting time ti, define the time‑averaged census population size from ti to the present as:

Because Nt is only known at discrete historical points, the integral is evaluated using a piecewise linear approximation:

  1. Select a set of times at which census population estimates are available.
  2. Linearly interpolate Nt between adjacent points.
  3. Integrate each segment exactly.
  4. Divide by the total elapsed time ti

This produces an explicit, reproducible value of Navg for each starting time ti.

Census anchors used

  • Census population sizes Nt are taken from published historical and prehistoric estimates.
  • Where a range is reported, low / mid / high scenarios are retained.
  • For periods of hominin coexistence (e.g. Neanderthals), census counts are summed to represent the total human lineage.
  • No effective sizes Ne are used in the construction of Nt.

Present is taken as 2026 CE.

Results: Navg from ti to present

All values are people.
Nti is the census size at the start time.
Navg is the time‑average from ti to 2026 CE.

Start time tiYears before presentNti (low / mid / high)Navg(ti – present) (low / mid / high)
2,000,000 BP (H. erectus)2,000,000500,000 / 600,000 / 700,0002.48 M / 2.86 M / 3.24 M
50,000 BCE (sapiens + Neanderthals)52,0262.01 M / 2.04 M / 2.07 M48.5 M / 60.6 M / 72.7 M
10,000 BCE (early Holocene)12,0265.0 M / 5.0 M / 5.0 M198 M / 250 M / 303 M
1 CE2,025170 M / 250 M / 330 M745 M / 858 M / 970 M
1800 CE226813 M / 969 M / 1.125 B2.76 B / 2.83 B / 2.90 B
1900 CE1261.55 B / 1.66 B / 1.76 B4.02 B / 4.04 B / 4.06 B
1950 CE762.50 B / 2.50 B / 2.50 B5.33 B (all cases)
2000 CE266.17 B / 6.17 B / 6.17 B7.24 B (all cases)

Interpretation for comparison with Ne

  • Navg is orders of magnitude larger than empirical human Ne, typically ~10(4) for all plausible averaging windows.
  • This remains true even when averaging over millions of years and even under conservative census assumptions.
  • Therefore, Ne cannot be interpreted as:
    • an average census size,
    • a long‑term census proxy,
    • or a time‑integrated representation of Nt

The comparison Navg > Ne holds regardless of where the averaging window begins, reinforcing the conclusion that Ne is not a demographic population size but a fitted parameter summarizing drift under complex, non‑stationary dynamics.


Kimura’s cancellation requires N = Ne. CN has shown that N ≠ Ne at every point in human history, under every averaging window, by orders of magnitude. The cancellation has never been valid. It was never a simplifying assumption that happened to be approximately true, it was always wrong, and it was always substantially wrong.

The elegance of k = μ was its selling point. Population size drops out! The substitution rate is universal! The molecular clock ticks independent of demography! It was too beautiful not to be true—except it isn’t true, because it depends on a variable identity that has never held for any sexually reproducing organism with census populations larger than its effective population. Which is all of them.

And the error doesn’t oscillate or self-correct over time. N is always larger than Ne—always, in every species, in every era. Reproductive variance, population structure, and fluctuating population size all push Ne below N. There’s no compensating mechanism that pushes Ne above N. The error is systematic and unidirectional.

Which means every molecular clock calibration built on k = μ is wrong. Every divergence time estimated from neutral substitution rates carries this error. Every paper that uses Kimura’s framework to predict expected divergence between species has been using a formula that was derived from an assumption that the author’s own model parameters demonstrate to be false.

DISCUSS ON SG


Preface to The Frozen Gene

I’m very pleased to announce that the world’s greatest living economist, Steven Keen, graciously agreed to write the preface to The Frozen Gene which will appear in the print edition. The ebook and the audiobook will be updated once the print edition is ready in a few weeks.

Evolution is a fact, as attested by the fossil record, and modern DNA research. The assertion that evolution is the product of a random process is a hypothesis, which has proven inadequate, but which continues to be the dominant paradigm promulgated by prominent evolutionary theorists.

The reason it fails, as Vox Day and Claude Athos show in this book, is time. The time that it would take for a truly random mutation process, subject only to environmental selection of those random mutations, to generate and lock in mutations that are manifest in the evolutionary complexity we see about us today, is orders of magnitude greater than the age of the Universe, let alone the age of the Earth. The gap between the hypothesis and reality is unthinkably vast…

The savage demolition that Day and Athos undertake in this book of the statistical implications of the “Blind Watchmaker” hypothesis will, I hope, finally push evolutionary biologists to abandon the random mutation hypothesis and accept that Nature does in fact make leaps.

Read the whole thing there. There is no question that Nature makes leaps. The question, of course, is who or what is the ringmaster?

It definitely isn’t natural selection.

DISCUSS ON SG