top of page

Ian Williams - Another new Race Equivalency Calculator

Ian Williams: An Updated Race Equivalency Calculator Attempt

Ian Williams: An updated formula for marathon-running success

This is another attempt by someone in trying to optimize the marathon race equivalency calculation portion. There's little doubt that the 5k, 10k, and HM relationships are strong. But the M sits atop the mountain with a difficulty unlike the others in properly estimating finish time from other race distances previously completed. Why is it even important to have a good race equivalency going into a race day? Well, running a marathon can literally come down to a few seconds per mile vs best performance and literal blow-up. It all comes down to the physiological difference between the M and the other races. Once you pass a certain threshold the ticking time bomb that is pace will starting counting down. And unless you pace perfectly, things can go haywire quick. So a good race equivalency or honest assessment of race day goal pace can be extremely beneficial. The classic formula used in most online calculators is Peter Rigel's formula:

M = HM x 2^1.06

Which means your M is 2.08 times slower than your HM.

I've previously reviewed a new-age calculation from Vickers (LINK TO COME).

So let's dive right into Ian Williams attempt at adjusting the classic marathon race equivalency calculator.

Sample size - 1071 different HM to M relationships. Good, but about half the size of Vickers data set (although Vickers used 5k, 10k, and HM performances). Williams did cut the data set to runners who had completed at least 5 HM and Ms, thus more experienced runners who knew what they were getting themselves into.

Sample collection - An internet "logging system" open to anyone using fetcheveryone.com to find participants. The article does not speak to potential issues of representativeness and selection bias. I'm not terribly concerned about the selection bias. There is literally no data as to whether this data set resembles a normal population set (male/female/age/training history/representative finishing times, etc.). I have reason to believe that the majority of William's data set is from runners at 2:00 half marathon or less (based on the displayed data and groups he chooses to display). The male median time in this study was UNK for the marathon versus 4:11 for NYC marathon, and 4:16 for Running in the USA. The female median time in this study was UNK, 4:38 in NYC, and 4:41 in Running in the USA. So my best guess on what I can surmise from the data set is that while the median national time is close to 4:16-4:41 in the US, very little of this data set (if at all) was based on runners around or slower than the national average.

I can't tell initially from the article whether the data is logged daily or just once at the end. That would call into question the chance for error. More measurements would reduce the chance for error. If you've got the entire data set (like a Strava history), then everything is there. But if the dataset Williams used relied solely on self-reporting, then it could make for a much higher chance for error.

Also, I can't tell if this is recent HM vs recent M. Or if it is PR HM vs recent M.

Alight, so let's dive in!

As previously stated, Rigel is:

M = HM x 2^R

where R=1.06

Williams sets out to redefine R with a new value that makes the calculator more accurate for more people.

Williams starts by using his dataset of 1071 runners to define the relationship between their HM and M performances.


The very first thing that sticks out to me - no y-axis defined. What exactly am I looking at here? It would appear to be a histogram or distribution plot of the relationship of the 1071 runners HM to M. 1.06 represents the current Rigel. Williams proposed 1.15 is a better R value since it falls further towards the middle. I would not deny that either based on the graph. It certainly appears the 1.15 falls much closer to the middle than 1.06. And if being conservative on pacing for the marathon is an important variable (which I believe it is), then being on the slower side for predicting won't prevent a great marathon performance (because you can negative split the back half of the race). But I wasn't satisfied having no y-xais. So I made one for him:


I actually used photoshop to measure the height of each of his bars. Then I assumed this shown data set represented the whole 1071 runners. Which may or may not be the case. I don't believe anyone is faster than 1.01, but slower than 1.30 is certainly possible. Although, I certainly don't know. I feel relatively confident because the total height of the bars added together was 49.79 or very very close to a whole number of 50. That means I could calculate the number of runners per bar:


So when I look back at the 1.01 bar, it really represents 0.25% of the population or a guess of 2.7 runners. Makes sense. Only 3 runners out of 1071 were able to hit a 1.01 R value. So, does my data extraction work? Well Williams states in the article that less than 5% of the runners had a R of 1.06. His other linked article says 49 total runners at 1.06 or less. That jives closely with what I've got. Remember mine are in bars of 1.06. But that probably really means 1.055 to 1.064. So the numbers will be off slightly, but not terribly. So keep in mind when the data set talks about runners at exactly 1.06, it's really only talking about 29 total runners. A much much much smaller data set suddenly.

But what does that mean in actual time conversions?


So for example, someone with a R of 1.01 with a HM time of 2:00:00 was able to run a M in 4:01:40. For someone after 5 HM/Ms to run a virtual identical pace between their HM and M is astounding. Almost too astounding... That brings up another question about the dataset. The relationship between HM and M can't be viewed under a microscope. There are variables of race day that matter so much for performance. Race crowding, elevation, and weather just to name a few. If someone is running a uphill HM in hot weather in 2:00:00 and then a downhill cold weather M in 4:01:40, then the data starts making more sense. Regardless, it's another reason to cast question on this. Vickers did a better job attempting to correct this. So since Vickers is such a great guy and released his dataset to the public we can map Vickers dataset in the same manner as Williams. Vickers has a total of 862 runners in his dataset (including what I believe is a slower median population meaning it is more representative of the US population of marathon runners) that have matching HM and M condition races (and if not matching than an adjustment was used).


Hooray! I'd say for the most part the datasets follow a similar trend. Not the same, but similar.

So the initial conclusion was 1.15 is a better predictor R for HM to M than is 1.06. It does split the middle of the data set (with 47% on both sides). So better. Williams dataset says the midpoint is 1.15 with a 25-75% range of 1.10 to 1.19 and Vickers dataset says the midpoint is 1.13 with a 25-75% range of 1.09-1.17.

So for a 2:00 HM runner, what does that mean?

Rigel - traditional calculator (1.06) = M of 4:10:12 Williams - 1.15 = M of 4:26:18 (range of 4:17-4:33) Vickers - 1.13 = M of 4:22:38 (range of 4:15-4:30)

Since you are likely to see a better performance in the marathon with a conservative start, this new value of around 1.13-1.15 looks good to me. Slower is better at the beginning so you can leave some room for error in the second half of the race. Go out too fast in the beginning and the risk of blowing up is much much higher.

The problems start to arise when he starts to parce the data apart to make other conclusions about training in general that leads to performance.

Does gender matter?

Matches what I've read before. Women are better pacers during a marathon (more even/negative splits and less positive splits, (or faster at the end)), hypothesized that women are better at burning fat then men, and hypothesized that women are better at dissipating heat than men. So if a woman and a man have equal HM times going into the M, the woman is more often than not going to beat the man.

So I agree with the conclusion.

Are faster runners better?


The bottom grey line represents the top 10% of runners with that HM time in each subset of data. So Williams pieced apart the dataset into secondary pools with HM times of 1:20, 1:25, 1:30, 1:35, 1:40, etc. Given the relative smoothness of the line we can tell this is the case. Remembering back, there are only 67 total runners with a 1.06 or less in the dataset of 1071. There are only 256 with a 1.10 or less. There appear to be 9 subsets of data. As would make sense, there are likely fewer runners in the dataset at 1:20-1:30, then there is at 1:50-2:00 (if this dataset is anything like a normal population of HM runners). So the data at the beginning of the line is probably based off very few runners.

The first thing that jumps out to me is that the relationship between HM time and R (for M) is pretty equal for the top 10% across all HM times. A 1:20 10% runner is around 1.06, but so is a 1:55 runner. And the difference between the two is quite small anywhere in-between.

So the variation of the mean is not coming from the top 10% becoming worse converters, but the bottom portion of the population as the HM time slows are getting worse at being converters. So the question would follow, what are the top 10% runners doing that are all near 1.06 across all HM times that the bottom 10% are not? Seems to suggest that regardless of HM time you can be a good converter if you're doing the right things in training. And those in the slower HM times tend to have more runners doing the wrong thing in training (hence bad converters).

What about training mileage?


So per Williams this graph is the "typical" amount of miles run by experienced marathon runners (not their first) going for a PR marathon attempt. This does not have to be the same dataset he used to create the previous graph, but rather a measuring stick he created. So this original dataset doesn't have to be correlated with success in any way or being a good converter.

So the graph on the surface tells a story that most of us know. The people with faster marathon finishing times run more miles. But you know me, I don't like to look at miles, I like duration. So if I were to standardize these mileages across each subset by either Marathon Pace or EB pace (which tends to be the average pace I schedule runners at or 1.12 times slower than MP), then what does the dataset look like?


A 2:20 runner runs 1200 miles in 16 weeks. The MP of 2:20 is a 5:21 min/mile. If the 2:20 runner were to average MP for the 16 weeks of training, then they would do 6:40 hours of training per week (or 106 hours total). If we instead used EB, then the 2:20 runner averages 7:28 hours per week. The 2:20 is clearly the outlier, because look at the other subsets of data. The 2:40, 3:00, 3:20, 3:40, 4:00, 4:20, and 4:40 all run about 5:00 hours (if at MP) or 5:30 hours (if at EB) per week. So on the surface the 2:40 to 4:40 runners would appear different, but when taking into account their relative training pace, they're all actually very similar. This comes down to training load and why I like to evaluate training plans by time moreso than mileage. Two runners doing 80% of training at easy with 9 hours of total running per week will be reaping similar training benefits regardless if one runs a 2:20 M and the other a 4:40 M.

For reference, the marathon training plans I write tend to be in the 7-8 hours average range for 16 weeks. So my plans are like the outliers in the 2:20 M time group.