Skip to content
This repository was archived by the owner on Nov 15, 2023. It is now read-only.
Merged
Show file tree
Hide file tree
Changes from 6 commits
Commits
Show all changes
25 commits
Select commit Hold shift + click to select a range
2741554
Add Arch Linux installation instructions
cmichi Jan 14, 2019
e95140a
Enable tracing heap size
gavofyork Jan 11, 2019
d2a7935
Extract heap
cmichi Jan 12, 2019
aff276a
Replace linear allocator with buddy allocator
cmichi Jan 14, 2019
292c177
Fix test
cmichi Jan 16, 2019
1d71e15
Get rid of memcpy in to_vec()
cmichi Jan 16, 2019
b65f0e4
fixup: Style and comments
cmichi Jan 17, 2019
02a3d1e
fixup: Split Linux instructions by distribution
cmichi Jan 17, 2019
dfa62a8
fixup: Remove unnecessary types and code
cmichi Jan 17, 2019
41e4ab7
fixup: Make Pointers start from 1, remove some panics, code improvements
cmichi Jan 17, 2019
53bf828
fixup: Return 0 on errors
cmichi Jan 17, 2019
b189736
fixup: Move loop to separate function
cmichi Jan 17, 2019
18a1efc
fixup: Use FnvHashMap instead of HashMap
cmichi Jan 17, 2019
137b5bd
fixup: Fix error handling
cmichi Jan 17, 2019
741e972
fixup: Use current_size() instead of used_size()
cmichi Jan 17, 2019
93dc2e9
fixup: Fix and document allocation offset
cmichi Jan 17, 2019
d4676b0
fixup: Remove unnecessary multiplication
cmichi Jan 17, 2019
6a99426
fixup: Fix comments
cmichi Jan 17, 2019
6725a49
fixup: Remove Arch installation instructions
cmichi Jan 17, 2019
6438db3
Revert "Fix test"
cmichi Jan 17, 2019
ecd9731
fixup: Remove unused code, improve import
cmichi Jan 17, 2019
ca065b1
fixup: Proper alignment
cmichi Jan 17, 2019
58ee588
fixup: Do not use internal constant in public description
cmichi Jan 17, 2019
247d37b
fixup: Add comment regarding invariants
cmichi Jan 18, 2019
8e4deda
fixup: Move assertion to compile-time check
cmichi Jan 18, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions README.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -192,6 +192,7 @@ You will also need to install the following packages:
- Linux:
[source, shell]
sudo apt install cmake pkg-config libssl-dev git clang libclang-dev
sudo pacman -S cmake pkgconf openssl git clang

- Mac:
[source, shell]
Expand Down
303 changes: 303 additions & 0 deletions core/executor/src/heap.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1,303 @@
// Copyright 2017-2019 Parity Technologies (UK) Ltd.
// This file is part of Substrate.

// Substrate is free software: you can redistribute it and/or modify
// it under the terms of the GNU General Public License as published by
// the Free Software Foundation, either version 3 of the License, or
// (at your option) any later version.

// Substrate is distributed in the hope that it will be useful,
// but WITHOUT ANY WARRANTY; without even the implied warranty of
// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
// GNU General Public License for more details.

// You should have received a copy of the GNU General Public License
// along with Substrate. If not, see <http://www.gnu.org/licenses/>.

#![warn(missing_docs)]

//! This module implements a buddy allocation heap.
//! It uses a binary tree and follows the concepts outlined in
//! https://en.wikipedia.org/wiki/Buddy_memory_allocation.
use std::vec;
use std::collections::HashMap;

// The pointers need to be aligned to u8 allocations.
// By choosing a block size which is a multiple of this
// number it is ensured that a pointer is always aligned.
// This is because in buddy allocation a pointer always
// points to the start of a block.
//
// In our case the alignment for unknown-unknown-webassembly
// is 1 byte though, i.e. the pointer will always be aligned.
const BLOCK_SIZE: usize = 8192; // 2^13 Bytes

#[repr(u8)]
#[derive(PartialEq, Copy, Clone)]
enum Node {
Free,
Full,
Split,
}

pub struct Heap {
allocated_bytes: HashMap<u32, u32>,
levels: u32,
tree: vec::Vec<Node>,
total_size: u32,
}

impl Heap {

pub fn new(reserved: u32) -> Self {
let leafs = reserved / BLOCK_SIZE as u32;
let levels = Heap::get_tree_levels(leafs);
let node_count: usize = (1 << levels + 1) - 1;

Heap {
allocated_bytes: HashMap::new(),
levels,
tree: vec![Node::Free; node_count],
total_size: 0,
}
}

/// Gets requested number of bytes to allocate and returns an index offset.
pub fn allocate(&mut self, size: u32) -> u32 {
let new_total_size = self.total_size + size;
if new_total_size > self.total_size {
trace!(target: "wasm-heap", "Heap size is over {} Bytes after allocation", new_total_size);
}
self.total_size = new_total_size;

// Get the requested level from number of blocks requested
let blocks_needed = (size as f32 / BLOCK_SIZE as f32).ceil() as u32;
let levels_needed = Heap::get_tree_levels(blocks_needed);
if levels_needed > self.levels {
panic!("heap is too small: {:?} > {:?}", levels_needed, self.levels);
}

// Start at tree root and traverse down
let mut index = 0;
let mut current_level = self.levels;
'down: loop {
let buddy_exists = index & 1 == 1;

if current_level == levels_needed {
if self.tree[index] == Node::Free {
self.tree[index] = Node::Full;

if index > 0 {
let parent = self.get_parent_node_index(index);
self.update_parent_nodes(parent);
}

break 'down;
}
} else {
match self.tree[index] {
Node::Full => {
if buddy_exists {
// Check if buddy is free
index += 1;
} else {
break;
}
continue 'down;
},

Node::Free => {
// If node is free we split it and descend further down
self.tree[index] = Node::Split;
index = index * 2 + 1;
current_level -= 1;
continue 'down;
},

Node::Split => {
// Descend further
index = index * 2 + 1;
current_level -= 1;
continue 'down;
},
}
}

if buddy_exists {
// If a buddy exists it needs to be checked as well
index += 1;
continue 'down;
}

// Backtrack once we're at the bottom and haven't matched a free block yet
'up: loop {
if index == 0 {
panic!("heap is too small: tree root reached.");
}

index = self.get_parent_node_index(index);
current_level += 1;
let has_buddy = index & 1 == 1;
if has_buddy {
index += 1;
break 'up;
}
}
}

let current_level_offset = (1 << self.levels - current_level) - 1;
let level_offset = index - current_level_offset;
let block_offset = level_offset * (1 << current_level);

let ptr = BLOCK_SIZE as u32 * block_offset as u32;
self.allocated_bytes.insert(ptr, size as u32);

ptr
}

pub fn deallocate(&mut self, ptr_: u32) {
let ptr = &(ptr_ as u32);
let allocated_size = self.allocated_bytes.get(ptr).unwrap().clone() as u32;

let new_total_size = self.total_size - allocated_size;
if new_total_size < self.total_size {
trace!(target: "wasm-heap", "Heap size over {} Bytes after deallocation", new_total_size);
}
self.total_size = new_total_size;

let count_blocks = (allocated_size as f32 / BLOCK_SIZE as f32).ceil() as u32;
let block_offset = ptr / BLOCK_SIZE as u32;
self.free(block_offset, count_blocks);
self.allocated_bytes.remove(ptr).unwrap();
}

pub fn free(&mut self, block_offset: u32, count_blocks: u32) {
let requested_level = Heap::get_tree_levels(count_blocks);
let current_level_offset = (1 << self.levels - requested_level) - 1;
let level_offset = block_offset / (1 << requested_level);
let index_offset = current_level_offset + level_offset;

if index_offset > self.tree.len() as u32 - 1 {
panic!("index offset {} is > length of tree {}", index_offset, self.tree.len());
}

self.free_and_merge(index_offset as usize);

let parent = self.get_parent_node_index(index_offset as usize);
self.update_parent_nodes(parent);
}

fn get_parent_node_index(&mut self, index: usize) -> usize {
(index + 1) / 2 - 1
}

fn free_and_merge(&mut self, index: usize) {
self.tree[index] = Node::Free;

if index == 0 {
return;
}

let other_node: usize;
let has_right_buddy = (index & 1) == 1;
if has_right_buddy {
other_node = index + 1;
} else {
other_node = index - 1;
}

if self.tree[other_node] == Node::Free {
let parent = self.get_parent_node_index(index);
self.free_and_merge(parent);
}
}

fn update_parent_nodes(&mut self, index: usize) {
let left_child = index * 2 + 1;
let right_child = index * 2 + 2;

let children_free = self.tree[left_child] == Node::Free && self.tree[right_child] == Node::Free;
let children_full = self.tree[left_child] == Node::Full && self.tree[right_child] == Node::Full;
if children_free {
self.tree[index] = Node::Free;
} else if children_full {
self.tree[index] = Node::Full;
} else {
self.tree[index] = Node::Split;
}

if index == 0 {
// Tree root
return;
}

let parent = self.get_parent_node_index(index);
self.update_parent_nodes(parent);
}

fn get_tree_levels(count_blocks: u32) -> u32 {
let requested_blocks;
if count_blocks == 0 {
requested_blocks = 1;
} else {
requested_blocks = count_blocks.next_power_of_two();
}

(requested_blocks as f64).log2() as u32
}

}

#[cfg(test)]
mod tests {
use heap::BLOCK_SIZE;

#[test]
fn should_calculate_tree_size_from_heap_size() {
let heap_size = BLOCK_SIZE * 4;
let heap = super::Heap::new(heap_size as u32);

assert_eq!(heap.levels, 2);
}

#[test]
fn should_round_tree_size_to_nearest_possible() {
let heap_size = BLOCK_SIZE * 4 + 1;
let heap = super::Heap::new(heap_size as u32);

assert_eq!(heap.levels, 2);
}

#[test]
fn heap_size_should_stay_zero_in_total() {
let heap_size = BLOCK_SIZE * 4;
let mut heap = super::Heap::new(heap_size as u32);
assert_eq!(heap.total_size, 0);

let ptr = heap.allocate(42);
assert_eq!(heap.total_size, 42);

heap.deallocate(ptr);
assert_eq!(heap.total_size, 0);
}

#[test]
fn heap_size_should_stay_constant() {
let heap_size = BLOCK_SIZE * 4;
let mut heap = super::Heap::new(heap_size as u32);
for _ in 1..10 {
assert_eq!(heap.total_size, 0);

let ptr = heap.allocate(42);
assert_eq!(ptr, 0);
assert_eq!(heap.total_size, 42);

heap.deallocate(ptr);
assert_eq!(heap.total_size, 0);
}

assert_eq!(heap.total_size, 0);
}

}
1 change: 1 addition & 0 deletions core/executor/src/lib.rs
Original file line number Diff line number Diff line change
Expand Up @@ -63,6 +63,7 @@ mod wasm_executor;
#[macro_use]
mod native_executor;
mod sandbox;
mod heap;

pub mod error;
pub use wasm_executor::WasmExecutor;
Expand Down
34 changes: 3 additions & 31 deletions core/executor/src/wasm_executor.rs
Original file line number Diff line number Diff line change
Expand Up @@ -32,35 +32,7 @@ use primitives::sandbox as sandbox_primitives;
use primitives::{H256, Blake2Hasher};
use trie::ordered_trie_root;
use sandbox;


struct Heap {
end: u32,
}

impl Heap {
/// Construct new `Heap` struct.
///
/// Returns `Err` if the heap couldn't allocate required
/// number of pages.
///
/// This could mean that wasm binary specifies memory
/// limit and we are trying to allocate beyond that limit.
fn new(memory: &MemoryRef) -> Self {
Heap {
end: memory.used_size().0 as u32,
}
}

fn allocate(&mut self, size: u32) -> u32 {
let r = self.end;
self.end += size;
r
}

fn deallocate(&mut self, _offset: u32) {
}
}
use heap;

#[cfg(feature="wasm-extern-trace")]
macro_rules! debug_trace {
Expand All @@ -73,7 +45,7 @@ macro_rules! debug_trace {

struct FunctionExecutor<'e, E: Externalities<Blake2Hasher> + 'e> {
sandbox_store: sandbox::Store,
heap: Heap,
heap: heap::Heap,
memory: MemoryRef,
table: Option<TableRef>,
ext: &'e mut E,
Expand All @@ -84,7 +56,7 @@ impl<'e, E: Externalities<Blake2Hasher>> FunctionExecutor<'e, E> {
fn new(m: MemoryRef, t: Option<TableRef>, e: &'e mut E) -> Result<Self> {
Ok(FunctionExecutor {
sandbox_store: sandbox::Store::new(),
heap: Heap::new(&m),
heap: heap::Heap::new(m.used_size().0 as u32),
memory: m,
table: t,
ext: e,
Expand Down
4 changes: 1 addition & 3 deletions core/sr-io/without_std.rs
Original file line number Diff line number Diff line change
Expand Up @@ -132,9 +132,7 @@ pub fn storage(key: &[u8]) -> Option<Vec<u8>> {
if length == u32::max_value() {
None
} else {
let ret = slice::from_raw_parts(ptr, length as usize).to_vec();
ext_free(ptr);
Some(ret)
Some(<Vec<u8>>::from_raw_parts(ptr, length as usize, length as usize))
Copy link
Contributor

@pepyakin pepyakin Jan 16, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As was discussed in the issue, this actually violates formal preconditions required by Vec::from_raw_parts (i.e. the pointer is allocated not with Vec routines). Given that we agree with it, I still would like to have a comment here mentioning this fact.

And there should be another case nearby (below).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm still against using Vec::from_raw_parts because of all the invariants. If we can prove the invariants, I'm happy.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gavofyork What is your opinion on this?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It may be better to return some other new type as well.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As I stated in the other thread I'm against this UB dogma. I demonstrated that this code is safe. If you think otherwise please bring the argument forward, otherwise this code goes in.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I'd agree that it would work for us and I don't have anymore to add to our discussion here! But I'd still be in favor of a comment here. I hope it will save us a headache.

Also, @cmichi there is another case where we apply the same pattern: child_storage, a function below. I'd like them to be synchronized.

}
}
}
Expand Down
2 changes: 1 addition & 1 deletion node/executor/src/lib.rs
Original file line number Diff line number Diff line change
Expand Up @@ -381,7 +381,7 @@ mod tests {
},
CheckedExtrinsic {
signed: Some((alice(), 0)),
function: Call::Consensus(consensus::Call::remark(vec![0; 120000])),
function: Call::Consensus(consensus::Call::remark(vec![0; 140000])),
}
]
)
Expand Down