Analytics engineering case study

Three F1 questions, one pipeline

A portfolio project about turning messy, biased sports data into fairer comparisons, reproducible data products, and long-form visual stories that a non-technical reader can still follow.

Methodology  ·  Source on GitHub  ·  Data: Jolpica–F1 (1950–present) & FastF1 (2018–present)

01

Who's really the GreatestOfAllTime?

Start with the hardest comparison in the sport: driver ability across eras. The model only compares teammates, because same team, same car, same weekend is the cleanest fairness constraint Formula 1 offers.

02

OK, but who's the LuckiestOfAllTime?

Talent is only half the story. The companion question is who benefited most from circumstance: reliability luck, inherited positions, and safety-car timing, each measured against the other side of the same garage.

03

Has F1 gotten boring?

The history chapter. Have regulation resets ever changed who won? How did reliability transform the sport? Has this cleaner, safer era made races too predictable?

What this case study demonstrates

The site is designed to show judgment as much as implementation: fair comparisons, reproducible data products, and methodology that stays visible instead of hiding behind the charts.

Judgement calls and tradeoffs are written up on the methodology page — because the modelling choices matter at least as much as the final leaderboard.