[benchmark-cli] Wrong proof_size calculation leading to exaggerated weights

> This has been found during investigation of [x5 proof_size weight](https://github.com/paritytech/substrate/pull/13268/files/50036312437c50c5b491ae49294a7b271cb3bb0a#r1090960965) for one of the pallet_contracts API functions.
This bug was introduced in  https://github.com/paritytech/substrate/pull/11637.

## Context

Results of benchmarks are processed with [`writer::process_storage_results()`](https://github.com/paritytech/substrate/blob/9c92e4987160a17daa72f79186d981b6fbe5879e/utils/frame/benchmarking-cli/src/pallet/writer.rs#L506) in the following way.

It loops through all the storage keys of all results

https://github.com/paritytech/substrate/blob/9c92e4987160a17daa72f79186d981b6fbe5879e/utils/frame/benchmarking-cli/src/pallet/writer.rs#L549-L550

and multiples each benchmark result for each key, adjusting the result's *proof_size* for each key, as it depends on:
- PoV estimation mode
- single read PoV overhead
- number of reads

https://github.com/paritytech/substrate/blob/9c92e4987160a17daa72f79186d981b6fbe5879e/utils/frame/benchmarking-cli/src/pallet/writer.rs#L628-L632 

We get *storage_per_prefix* data as the result, which originally was used for creating comments with information about the storage keys touched during each benchmark.

## The Bug

In [PR#11637](https://github.com/paritytech/substrate/pull/11637/files#diff-9e23aff4915f04fd032342adfc09c096d87ad2a659cd1a6f15719ccf87d537a7R299) this data started to be used for calculation of the resulting *proof_size* formula into the `weights.rs` . **But in a wrong way**:

**Step 1, (almost right)**. We find average base *proof_size* value and *slope* for each component, by making regression analysis of benchmark results we put to this component in  *storage_per_prefix* (see above).

https://github.com/paritytech/substrate/blob/9c92e4987160a17daa72f79186d981b6fbe5879e/utils/frame/benchmarking-cli/src/pallet/writer.rs#L299-L303

*(This is *almost* right but not totally right, because the values we're making regression analysis on are not per-key benchmark results but originated from benchmark results for the all the keys and then adjusted for a single key, see below for details)*

**Step 2, (wrong)**.  The resulting  *base_calculated_proof_size* and *component_calculated_proof_size* values are calculated as a simple **sum** of the values for all prefixes from the *per_prefix* benchmark results. 

https://github.com/paritytech/substrate/blob/9c92e4987160a17daa72f79186d981b6fbe5879e/utils/frame/benchmarking-cli/src/pallet/writer.rs#L317-L323

But, wait a minute. To recap, the path of these *proof_sizes* is:

1. They first were calculated for **all keys** in each benchmark run.
2. Then we **multiplied them for each key**, adjusted depending on reads overhead for the key.
    *(which is weird because we're adding up overhead of a single key PoV to proof_size of all the keys)*
4. Then we **summed up** those multiplied values.

**And this is wrong**. It leads to exaggerated weights when being put to `weights.rs`:

https://github.com/paritytech/substrate/blob/9c92e4987160a17daa72f79186d981b6fbe5879e/.maintain/frame-weight-template.hbs#L73-L75

## Suggested Fix

**Instead** of a simple sum, those *base_calculated_proof_size* and *component_calculated_proof_size*  should be calculated as averages or medians from all keys.


	// Add the additional trie layer overhead for every new prefix.
	if *reads > 0 {
	prefix_result.proof_size += 15 * 33 * additional_trie_layers as u32;
	}
	storage_per_prefix.entry(prefix.clone()).or_default().push(prefix_result);

	let proof_size_per_components = storage_per_prefix
	.iter()
	.map(\|(prefix, results)\| {
	let proof_size = analysis_function(results, BenchmarkSelector::ProofSize)
	.expect("analysis function should return proof sizes for valid inputs");

	for (_, slope, base) in proof_size_per_components.iter() {
	base_calculated_proof_size += base;
	for component in slope.iter() {
	let mut found = false;
	for used_component in used_calculated_proof_size.iter_mut() {
	if used_component.name == component.name {
	used_component.slope += component.slope;

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[benchmark-cli] Wrong proof_size calculation leading to exaggerated weights #13765

Context

The Bug

Suggested Fix

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	for result in results.iter().rev() {
	for (key, reads, writes, whitelisted) in &result.keys {

	{{#each benchmark.component_calculated_proof_size as \|cp\|}}
	.saturating_add(Weight::from_parts(0, {{cp.slope}}).saturating_mul({{cp.name}}.into()))
	{{/each}}

[benchmark-cli] Wrong proof_size calculation leading to exaggerated weights #13765

Description

Context

The Bug

Suggested Fix

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions