As a pretext to write a meaningful blog post, I wanted to go back to one of my recent little hacks.
There was a race, MalmöMilen, which happened on June 14, 2014. But before actually going to the race, I wanted to know what was the distribution of the participants into the different start groups.
I had registered myself in the 50-55 minutes start group, but I knew I would do less than 44 minutes (yes that was a stupid choice). The question then was to know how many runners (roughly) will be positioned before and that I will have to overtake.
The simple solution was to use an Jupyter Notebook and do some data scraping, directly using the data from their website. Code (pretty short and simple) and graph follow.
This is how we get the number of participant and group them by start time.
The following illustrates how to visualize the distribution on a nice graph.
Which plots the following graph:
Now that the race is finished and results are out, I really want to play with the available data and make it say something.