Skip to content

Commit dbfc511

Browse files
committed
Spell, re-use
1 parent b1d053c commit dbfc511

File tree

1 file changed

+6
-5
lines changed

1 file changed

+6
-5
lines changed

src/5-levels-of-data.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -3,18 +3,19 @@
33
Not all data are created equal.
44
There are notable differences in how much you can do with data and how much effort it takes.
55
The more reusable data is, the easier it will be to use it as a developer, researcher or other type of data user.
6+
Re-useability is about being able to transform, sort, query, serialize, modify, render and audit data without requiring too much work.
67

78
_This list is inspired by Tim Berners-Lee's [5-star open data](https://5stardata.info/en/)_.
89

910
## Level 0: proprietary data
1011

1112
If you don't give others the _rights_ to read, use or modify your data, it's reusability is zero.
1213

13-
That's why it's important to have a _licence_ that allow others to use your data.
14+
That's why it's important to have a _license_ that allow others to use your data.
1415
A good choice for a permissive option is the [Open Database License](https://opendatacommons.org/licenses/odbl/summary/).
1516
Creative Commons licenses are also good options to clearly communicate _if_, and if so then _how_, your data is permitted to be re-used.
1617

17-
It's also important to use _open formats_ (such as `CSV`, `JSON` or `PNG`), intead of _proprietary formats_ (tied to specific vendors, such as `PSD` or `RAR`).
18+
It's also important to use _open formats_ (such as `CSV`, `JSON` or `PNG`), instead of _proprietary formats_ (tied to specific vendors, such as `PSD` or `RAR`).
1819

1920

2021
## Level 1: unstructured data
@@ -51,15 +52,15 @@ If we want predictability, we need to make it _type-safe_.
5152

5253
## Level 3: type-safe data
5354

54-
_Examples: SQL + DB SCHEMA, JSON + JSON schema, XSD + XML, RDF + SHACL, In-memory data in type-safe programming langauges_
55+
_Examples: SQL + DB SCHEMA, JSON + JSON schema, XSD + XML, RDF + SHACL, In-memory data in type-safe programming languages_
5556

5657
Type-safe data means that every value of the data has an explicit datatype.
5758
It is _strongly typed_ and has a clear _schema_ that describes which properties you can expect in a Resource.
5859
This means that someone re-using type-safe data can know for certain that it conforms to a specification, a set of rules.
5960
The shape of the data is _predictable_.
6061
This predictability means that developers can safely re-use it in their system without worrying about missing fields or datatype errors.
6162

62-
Lots of software has _internal_ type safety, especially if you use type-safe programming langauges like Typescript, Kotlin or Rust.
63+
Lots of software has _internal_ type safety, especially if you use type-safe programming languages like Typescript, Kotlin or Rust.
6364
However, when the data _leaves the system_, a lot of type related data is lost.
6465
Even if this schema related information is described, the schema itself is often not machine-readable.
6566
The best way to have type-safe data, is to describe the schema in a machine-readable format.
@@ -79,7 +80,7 @@ In Atomic Data, the Properties themselves (the links in the keys in JSON-AD) des
7980

8081
## Level 4: browsable data
8182

82-
_Examples: Atomic Data, propertly hosted RDF_
83+
_Examples: Atomic Data, properly hosted RDF_
8384

8485
If your data is _connected_ to other pieces of machine-readable dat, is becomes browsable, similar to how websites link to each other.
8586
This effectively creates a _web of data_, and allows for a whole new way to think about the internet.

0 commit comments

Comments
 (0)