Why we standardise using training statistics when doing Machine Learning

This post runs through why it’s important to preprocess new data that you’re passing to a Machine Learning model using statistics calculated from the training data.

February 4, 2022 · 6 min · Ed

Generating fake personal data in Python with faker

This post gives a quick example of using the faker package in Python to generate fake customer data.

December 11, 2021 · 3 min · Ed