Need to standardise on words like the "recipes", what's a single "run", etc. Maybe a `run` should be a single execution of `Workload.run()` against a specific `Dataset`.