Introduce prefixed storage with enumeration #4185

gui1117 · 2019-11-22T13:11:44Z

DONE:

a new trait StoragePrefixedMap is introduced and implemened for map, linked_map, double_map. it allows for enumeration and removal.
ChangeSetOverlay is now using BTreeMap instead of HashMap.
InMemory backend is now using BTreeMap instead of HashMap.
sr-io has a new function next_key(key)-> Option<key> returning the next key in the storage.

TODO:

cache the backend next_key in Ext:
every time next_key is called we fetch the next in overlay and the next in backend. But the backend doesn't implement any cache for key iteration. a quick cache could be implemented in Ext implementation of Externalities, but then if a user iterate on two structure at the same time this cache is useless. A proper cache should be implemented in a lower layer.
the sr-io API could include the suffix like next_key_with_suffix(key, suffix) -> Option<key> so that if we have to implement some cache we could limit the number of key in advance we cache.

note: in the current architecture with BTreeMap for changeset this is not needed
note: if in the future we have to change this overlay we can still also provide a new call with this suffix for an optimized call.
benchmark to see if BTreeMap is ok, otherswise we can try other solution with some cache on the hashmap. Some benchmark between BTreeMap and Hashmap are there Introduce prefixed storage with enumeration #4185 (comment)

gui1117 · 2019-11-22T15:13:55Z

Iteration on overlay by changing Hashmap to BTreeMap:

Early benchmark between BTreeMap and HashMap:
each map are filled with random key of size 32bytes, we remove 100 existing keys or insert 100 random keys or get 100 existing keys. The result shows:

if maps contain less than 10_000 key/values then performance are quite same.
if maps contain 100_000 key/values then performance of BTreeMap is almost 2 times slower than HashMap.

benchmark.txt
result.txt

Iteration on overlay with cache.

If ext_call specifies the range on which it will iterate

for example get_in_range(from_key, until_key, number_of_key_to_return) -> Vec<Key>
then we can cache all the keys from the last returned (not included) to until_key. then if the cache (like a key is inserted or removed) we can reduce the cache to the cleand part (but then this cache reducing operation happens always). we remove the cache entirely when it is touched.

If ext_call doesn't specifies the range on which it will iterate

then the only way to make a cache that is useful is to reduce the cache to the not dirty part when the storage gets dirty.

gui1117 · 2019-11-26T12:40:44Z

implementation is almost complete, still todo:

optimisation of next_key in overlay.
storage cache in client
implement next_key for child storage (easy)

xlc · 2019-11-27T10:31:25Z

How likely is this going to be merged with in next two weeks? Asking because I am writing materials for my Substrate course and if this is going to land before the course start, I can drop the linked_map part and teach prefixed map instead.

This reverts commit c91a484. the storage cache cannot be used this way

This reverts commit 4931088.

gui1117 · 2019-11-29T17:35:59Z

How likely is this going to be merged with in next two weeks? Asking because I am writing materials for my Substrate course and if this is going to land before the course start, I can drop the linked_map part and teach prefixed map instead.

The PRs waiting for this are no longer waiting, but it seems a very useful feature. About the deadline, it is mostly about if performance are OK by switching HashMap to BTreeMap, or if we need better and different cache and still use HashMap

gavofyork · 2019-12-01T19:55:14Z

it's acceptable, though we might want to revert to hashmap and add a cache at a later date if real world usage dictates. as long as the API is fixed and we can change as desired later, then we're good.

gui1117 · 2019-12-03T14:06:42Z

yes understood,

hmm then maybe it worth making the sr-io API next_key_with_prefix(next_key: &[u8], prefix: &[u8]) so that if we have to implement a cache then the runtime can provide the prefix he wants to iterate on, allowing not to create a big cache when not needed.

EDIT: top description updated including this thought

gavofyork · 2019-12-05T13:11:59Z

Shortly after this goes in we will want to consider adding to storage maps the possibility to prefix the unhashed key to the value, thereby allowing the possibility of enumeration over all keys from both the runtime and UI. Right now to do that we only really have the option of using twox(key)++key hasher which isn't especially secure against unbalanced trie griefing.

gavofyork · 2019-12-06T13:41:40Z

anyone else want to sign this off before merge?

bkchr · 2019-12-06T13:42:57Z

I will review it now.

bkchr · 2019-12-06T16:29:08Z

primitives/state-machine/src/trie_backend_essence.rs

+		// The key just after the one given in input, basically `key++0`.
+		// Note: We are sure this is the next key if:
+		// * size of key has no limit (i.e. we can always add 0 to the path),
+		// * and no keys can be inserted between `key` and `key++0` (this is ensured by sr-io).


What does this mean and how is that ensured?

we doesn't want to iterate from the key included but from the key excluded and trie crate only provides seek function.

So we compute the next potential key, by doing next_potential_key = key++0.

But this is only true if:

we can indeed add 0 to the key

and if there is no key in the trie between key and key++0

First option requires size of key has no limit or that the runtime is not suppose to write anything near this limit, and the second option is ensured because sr-io doesn't provide any way to write between key and key++0, even if it is a 16-trie.

Maybe @cheme you can confirm this ?

But maybe it is better to actually don't make those assumption and seek from the key, get the next, if the next is the current one then get the second next. or implement some new features in trie.

that would result in something like this:

diff --git a/primitives/state-machine/src/trie_backend_essence.rs b/primitives/state-machine/src/trie_backend_essence.rs index aea0193dd..1c8bafb09 100644 --- a/primitives/state-machine/src/trie_backend_essence.rs +++ b/primitives/state-machine/src/trie_backend_essence.rs @@ -110,23 +110,29 @@ impl<S: TrieBackendStorage<H>, H: Hasher> TrieBackendEssence<S, H> where H::Out: let mut iter = trie.iter() .map_err(|e| format!("TrieDB iteration error: {}", e))?; - // The key just after the one given in input, basically `key++0`. - // Note: We are sure this is the next key if: - // * size of key has no limit (i.e. we can always add 0 to the path), - // * and no keys can be inserted between `key` and `key++0` (this is ensured by sr-io). - let mut potential_next_key = Vec::with_capacity(key.len() + 1); - potential_next_key.extend_from_slice(key); - potential_next_key.push(0); - - iter.seek(&potential_next_key) + iter.seek(&key) .map_err(|e| format!("TrieDB iterator seek error: {}", e))?; let next_element = iter.next(); let next_key = if let Some(next_element) = next_element { - let (next_key, _) = next_element - .map_err(|e| format!("TrieDB iterator next error: {}", e))?; - Some(next_key) + let next_key = next_element + .map_err(|e| format!("TrieDB iterator next error: {}", e))?.0; + + if next_key != key { + Some(next_key) + // If next key is same as key iter to second next element. + } else { + let next_element = iter.next(); + if let Some(next_element) = next_element { + let next_key = next_element + .map_err(|e| format!("TrieDB iterator next error: {}", e))?.0; + + Some(next_key) + } else { + None + } + } } else { None };

Yes, the fact that api only allow to write key value at byte address is important here (otherwhise we would need to work with nibbles (half byte for 16-trie) and add 0 nibble.
I am not sure if skipping the first iteration is very important here (we need to query the node anyway during seek call).
The relevant perf gain would be from using the same iterator between calls (to skip the 'seek' call on every 'next), but it is not a easy todo.

Should we not rely on "the fact that api only allow to write key value at byte address", this property could be leverage in the future.

shawntabrizi · 2019-12-09T10:15:38Z

Is it correct that after this PR is merged, linked_map could be entirely replaced with map?

gui1117 · 2019-12-09T10:21:48Z

Is it correct that after this PR is merged, linked_map is obsolete?

To make it "really" obsolete we need to provide an easy to make maps storing their key alongside the value as Gav says here: #4185 (comment)

bkchr · 2019-12-09T11:20:38Z

Is it correct that after this PR is merged, linked_map could be entirely replaced with map?

Maps are also not sorted, like linked maps.

primitives/sr-io/src/lib.rs

jacogr mentioned this pull request Nov 22, 2019

Support Metadata v9 (prefixed map) polkadot-js/api#1599

Closed

gui1117 mentioned this pull request Nov 22, 2019

Introduce new storage type: prefixed map #3992

Closed

rphmeier added the A3-in_progress Pull request is in progress. No review needed at this stage. label Nov 22, 2019

mxinden mentioned this pull request Nov 22, 2019

*: Enable refactored authority discovery paritytech/polkadot#601

Closed

jacogr mentioned this pull request Nov 23, 2019

Support prefixed map polkadot-js/api#1602

Closed

cheme mentioned this pull request Nov 25, 2019

Possible batch update of values paritytech/trie#40

Open

gui1117 force-pushed the gui-prefixed-map-2 branch from 01f71c4 to d9e6736 Compare November 26, 2019 12:38

gavofyork and others added 4 commits November 27, 2019 11:48

Introduce storage_next allowing iteration. (without childtries)

9100d75

Implement prefixed storage

4931088

impl cache in client_storage_cache (needs test)

c91a484

switch overlay change to btreemap

5c8d71a

gui1117 force-pushed the gui-prefixed-map-2 branch from b1934c6 to 5c8d71a Compare November 27, 2019 11:15

gui1117 changed the title ~~Introduce prefixed storage without enumeration~~ Introduce prefixed storage with enumeration Nov 27, 2019

gui1117 added 7 commits November 27, 2019 22:19

Revert "impl cache in client_storage_cache"

c34d2f3

This reverts commit c91a484. the storage cache cannot be used this way

Revert "Implement prefixed storage"

8b5810d

This reverts commit 4931088.

Merge remote-tracking branch 'origin/master' into gui-prefixed-map-2

90819a5

Impl StoragePrefixedMap for all map storages

2013098

remove comment

d99c6e9

Merge remote-tracking branch 'origin/master' into gui-prefixed-map-2

0bcaa10

Move all overlays to BTreeMap

63127f0

gui1117 marked this pull request as ready for review November 29, 2019 17:22

gui1117 requested review from kianenigma and pepyakin as code owners November 29, 2019 17:22

gui1117 added 2 commits December 2, 2019 18:02

Merge remote-tracking branch 'origin/master' into gui-prefixed-map-2

992800a

Merge remote-tracking branch 'origin/master' into gui-prefixed-map-2

aafd161

gui1117 added 5 commits December 3, 2019 16:55

btreemap iteration improvment

6c5889e

impl for child tries

6fc2dcc

impl tests for childs

22b2efb

fix

08c2a92

remove cache comment

47d0d8c

gui1117 added A0-please_review Pull request needs code review. and removed A3-in_progress Pull request is in progress. No review needed at this stage. labels Dec 4, 2019

gavofyork approved these changes Dec 5, 2019

View reviewed changes

gavofyork added A6-seemsok and removed A0-please_review Pull request needs code review. labels Dec 5, 2019

bkchr approved these changes Dec 6, 2019

View reviewed changes

4meta5 mentioned this pull request Dec 8, 2019

prefixed storage for enumeration JoshOrndorff/substrate-recipes#108

Open

pepyakin reviewed Dec 9, 2019

View reviewed changes

primitives/sr-io/src/lib.rs Outdated Show resolved Hide resolved

Fix grumble

1caea3e

bkchr merged commit 6b70f12 into master Dec 9, 2019

bkchr deleted the gui-prefixed-map-2 branch December 9, 2019 19:55

gui1117 mentioned this pull request Dec 11, 2019

decl_storage: storage the key alongside the value to be able to iterate on key/value #4362

Closed

gui1117 mentioned this pull request Jan 2, 2020

Prefixed double_map #4178

Closed

cheme added a commit to cheme/substrate that referenced this pull request Oct 9, 2020

adapt paritytech#4185 benches.

9528bc4

Introduce prefixed storage with enumeration #4185

Introduce prefixed storage with enumeration #4185

Uh oh!

Conversation

gui1117 commented Nov 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

DONE:

TODO:

Uh oh!

gui1117 commented Nov 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Iteration on overlay by changing Hashmap to BTreeMap:

Iteration on overlay with cache.

If ext_call specifies the range on which it will iterate

If ext_call doesn't specifies the range on which it will iterate

Uh oh!

gui1117 commented Nov 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xlc commented Nov 27, 2019

Uh oh!

gui1117 commented Nov 29, 2019

Uh oh!

gavofyork commented Dec 1, 2019

Uh oh!

gui1117 commented Dec 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gavofyork commented Dec 5, 2019

Uh oh!

gavofyork commented Dec 6, 2019

Uh oh!

bkchr commented Dec 6, 2019

Uh oh!

bkchr Dec 6, 2019

Choose a reason for hiding this comment

Uh oh!

gui1117 Dec 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gui1117 Dec 9, 2019

Choose a reason for hiding this comment

Uh oh!

cheme Dec 9, 2019

Choose a reason for hiding this comment

Uh oh!

gui1117 Dec 9, 2019

Choose a reason for hiding this comment

Uh oh!

shawntabrizi commented Dec 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gui1117 commented Dec 9, 2019

Uh oh!

bkchr commented Dec 9, 2019

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

gui1117 commented Nov 22, 2019 •

edited

Loading

gui1117 commented Nov 22, 2019 •

edited

Loading

gui1117 commented Nov 26, 2019 •

edited

Loading

gui1117 commented Dec 3, 2019 •

edited

Loading

gui1117 Dec 9, 2019 •

edited

Loading

shawntabrizi commented Dec 9, 2019 •

edited

Loading