x86-bootloader

build an x86 bootloader

The point of this project is to learn about computation by developing an OS bootloader from scratch. The first question we face when developing on bare metal is which computer architecture to use. A computer architecture, (or instruction set), is a mapping from binary machine codes to well-defined hardware operations. During execution, the CPU reads machine codes in sequence and executes the corresponding operations to manipulate memory and its internal state.

The main two instruction set categories are x86 and ARM, each having their own advantages. The x86 architecture (rather, family of architectures) was introduced by Intel in 1978 for the Intel 8086 microprocessor. The 8086 had an instruction size of 16 bits, which greatly limited the range of memory it could reach by comparison to today. Thus in 1985 Intel released a 32-bit version of the x86 architecture for the i386 microprocessor. This enabled the CPU to reach a far greater range of memory, around 4 GB. Finally, a 64-bit version of x86 was released in 1999 which included new modes and further built-in support for memory management. We will focus on the 32-bit version of the x86 architecure to keep things simple.

ARM is an instruction set which has been conventionally used for embedded and low-power applications. It has recently seen amazing success as the architecture behind Apple's buil-in-house M1 microprocessor. We choose x86 rather than ARM in this project for several reasons:

x86 is "classic" and worth studying to understand its influence and design patterns
it is widely used and will continue to be widely used for a long time
it is probably better to learn than ARM as a first architecture as it's older and simpler(?)

The instruction set determines both the family of processors we may write for and the assembly language we will write in. An assembly language is a mapping from commands made of simple mnemonics to machine codes for some computer architecture. The assembler is a program which transforms assembly code into the corresponding machine code for some architecture.¹ We will use the NASM x86 32-bit assembly language to write our assembly code for a 32-bit, x86 processor. We make use of the QEMU software package for emulating this processor and its peripherals.

We will start with the boot process in assembly and begin working up from there. Everything that we will need to learn will be taught along the way, with various examples and well-commented code that can be ran using the Makefile. Eventually we'll be compiling C programs to assembly and building our kernel code base tha way.

But first, let's start at the beginning.

The boot process

We take the BIOS as a given. The BIOS is a set of software routines stored on a chip which is loaded into memory and initialized when the computer is switched on. The BIOS detects and gives basic control of the essential devices: keyboard, screen and hard disks.

The BIOS's job is to boot the computer by reading the boot sector into memory. The boot sector is a portion of memory (in 512-byte chunks) starting at the very first physical location on the hard disk. The BIOS knows it has reached the end of the boot sector when it encounters the 2-byte magic number 0xaa55.

In memory, these bytes will be written '55 aa' as the x86 is a little-endian system, meaning that multi-byte values are written with the least-significant preceding the most-significant bytes. This magic number will occur at the end of a 512-byte block.

While booting, the BIOS loops through each physical device and finds the one which contains the boot sector. The data in the boot sector is then loaded into memory as the operating system's ~~machine code~~ bootloader (?). We are the ones who write this bootloader and operating system.

The simplest example of machine code would be a block of 512 bytes which:

ends with the magic number
begins with a loop containing a very simple jump instruction
is padded the rest of the way with zero-valued bytes².

We can write the following machine code, using a hex editor like Hex Fiend, to the file minimal_example.bin:

eb fe 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
...
00 00 00 00 00 00 00 00 00 00 00 00 00 00 55 aa

The file is padded with zero values to contain 512 bytes in all.

Next, we must emulate the x86 64-bit processor into which this machine code is booted and executed. We can do this using a hardware emulator like QEMU:

qemu-system-x86_64 miminal.bin

The result is that it should launch the boot process and then hang as there is no operating system code to be executed³.

Assembly language

An x86 assembler such as nasm can be used to convert assembly language into bytecode for an x86 processor. Assembly language is meant to provide a human-readable way to write machine code, with short mnemonics for many CPU instructions in the x86 instruction set. boot_sector.asm is an assembly code file for the minimal boot sector we created in the last section. We can assemble the machine code with the following command:

nasm boot_sector.asm -f bin -o boot_sector.bin

It is worth noting that at this point our processor boots into 16-bit real mode, which means that its instructions have a length of 16 bits and addresses refer to the corresponding real locations in physical memory. Real mode stands in opposition to protected mode, which prevents a userspace process from accessing kernel memory.

When programming assembly, we can make use of interrupts. These are a mechanism that have the CPU temporarily halt what it is doing and run a BIOS routine. Each interrupt is represented by an index in the interrupt vector, a table initially set up by the BIOS at physical address 0x0 in memory. The entries in this table are pointers to interrupt service routines (ISRs), or sequences of instructions in memory that perform the function of a specific interrupt.

The CPU has four registers for quick storage of two-byte values:

ax, with high-byte ah and low-byte al,
bx, with high-byte bh and low-byte bl,
cx, ...
dx, ...

A value can be assigned to a register using the mov operation:

mov ax, 1234 ; store the decimal number 1234 in ax

Let us now write a boot sector that prints "hello world" to the screen. We need the interrupt code for a BIOS routine that prints a character to the screen. It turns out we need the BIOS interrupt 0x10, with ah set to 0x0e to indicate teletype mode. This prints to screen the ASCII character stored in byte al and advances the cursor by one character. The assembly code for this is in hello_simple.asm.

We know that the BIOS loads our 512-byte boot sector into some place in memory and then, after initializing, leaves the CPU to jump to the start of the boot sector and begin executing. During initialization, the BIOS stored the interrupt vector at the very start of memory (0x0) and stored ISR code in various places as well. To avoid collision with any of these, the BIOS loads the boot sector machine code to a reserved block at address 0x7c00.

This figure represents the typical layout of lower memory after boot:


        |                                  |
        | free                             |
0x100000|----------------------------------|
        | BIOS (256 kB)                    |
 0xC0000|----------------------------------|
        | video memory (128 kB)            |
 0xA0000|----------------------------------|
        | extended BIOS data area (639 kB) |
 0x9fc00|----------------------------------|
        | free (638 kB)                    |
  0x7e00|----------------------------------|
        | loaded boot sector (512 bytes)   |
  0x7c00|----------------------------------|
        |                                  |
   0x500|----------------------------------|
        | BIOS data area (256 bytes)       |
   0x400|----------------------------------|
        | interrupt vector table (1 kB)    |
     0x0====================================

An exercise in locating data with these offsets is contained in the assembly code find_the_byte.asm.

The stack

The CPU also has an important functionality called the stack. The stack is a way to easily store and retrieve values beyond what can be stored in the registers without worrying about those values' actual locations in memory. The stack is defined by two values, bp and sp. bp is the memory location which serves as the base of the stack. sp is the location of the item on top of the stack, which is initially bp. When a word (i.e. two bytes in 16-bit real mode) is added to the stack, those bytes are written at location sp and sp is decremented to point to the next space down in memory. Thus the stack begins at bp and extends downward in memory according to the number of words it contains. For this reason it is important to allocate bp, the base of the stack, far enough above memory regions used for other code that the stack will never grow to the point those regions are overwritten. The assembly program stack_example.asm demonstrates usage of the stack to print "ABC".

Several more stack utilities:

call: stores the return address (address after current one) on the stack and then jumps to function
ret: used at the end of the function, pops top value (return address) off the stack and jumps to it
pusha: used at the beginning of a function, pushes all registers to the stack to avoid overwriting data they contained if function uses those registers
popa: used at the end of a function, pops top values from the stack back into the registers

So code with a function will look like:

some_function:
  pusha        ; Push all registers to the stack.
  mov bx, 10   ; Do some stuff with registers.
  add bx, 20
  mov ah, 0x0e ; Setup BIOS teletype output.
  int 0x10     ; Print character.
  popa         ; Pop values from stack back into registers.
  ret          ; Return to address right after function call that got us here.

Include files

The statement

%include "my_file.asm"

will be completely replaced by the contents of the file.

A more sophisticated "hello world" program that ties these things together is hello_advanced.asm, which makes use of labels, the stack, and the print_string routine as well. print_hex.asm makes use of the assembly operations and and shr to convert a register value to a hex string and print to screen. print_hex_example.asm demonstrates usage.

Reading the disk

So that we can reach addresses higher than 65535 on a 16-bit CPU, we introduce segments that provide a programmable data offset to the assembler, like the usage of the [org 0x7c00]. Segments provide an offset equal to the value they contain times 16 (which is easy to compute through a leftwise bit-shift). For example, setting a segment to 0x7c0 and then getting an address using that segment will offset the address by 0x7c00. It turns out that by default data addresses are offset by the ds segment. However we can explicitly use another segment, like the es segment, like so: mov al, [es:my_label]. See a demonstration of segment usage in segments_example.asm.

Spinning disk drives

Locations on the disks are addressed by three physical coordinates:

head, which describes which platter and side are to be read from
cylinder, which describes which ring-shaped track of data is to be read from
sector, a 512-byte segment of the track

Reading the disk with BIOS

Though various disk types use different data bus technologies, BIOS abstracts this away and provides several routines to interact with common disk devices.

One such routing is BIOS interrupt 0x13. This requires setting ah to 0x02 and several other registers to tell the CPU where to read the data from and to. See the following:

mov ah, 0x02 ; BIOS read sector function.
mov dl, 0    ; Read drive 0 (i.e. first floppy drive).
mov ch, 3    ; Select cylinder 3.
mov dh, 1    ; Select 2nd side of floppy (this has zero-based indexing)
mov cl, 4    ; Select the 4th sector on the track (this has one-based indexing)
mov al, 5    ; Read 5 sectors (512-byte chunks) from the start point.
 
; Set address we want to move data to by writing to `es:bx`. Let's say 0xa1234.
mov bx, 0xa000
mov es, bx
mov bx, 0x1234
 
int 0x13     ; Perform the actual read.

We can evaluate the success of this operation by looking in the carry flag (CF) of the flags register. This flag is set to signal a general fault. Furthermore, the number of sectors actually read is stored in al after the interrupt. We can check the carry flag using the jump operator jc, which only jumps if the carry flag was set:

int 0x13
 
; Check carry flag.
jc disk_error
 
; Check that 5 (the expected number) of sectors were read.
cmp al, 5
jne disk_error
 
disk_error:
  mov bx, ERROR_MSG
  call print_string
  jmp $
 
ERROR_MSG:
  db 'disk error', 0

We put this all together in the function disk_load.asm, which reads the first n sectors following the boot sector from drive dh and where n equals dl.

32-bit protected mode

32-bit protected mode is different from 16-bit real mode:

registers are extended to 32-bits e.g.: mov ebx, 0274fe8fe
two more general purpose registers fs, gs
32-bit offsets
- offset can reference up to 4 GB of memory (0xffffffff)
memory segmentation
- protect kernel from user applications
- virtual memory with page swapping
more sophisticated interrupt handling

Printing to screen without BIOS

There is a display device which can be configured into text mode, wherein what is displayed on the screen is a visual representation of a specific range of memory. Specifically, each character cell of the display device is represented by two bytes in memory: the first byte is the ASCII code of the character to display and the second encodes attributes such as colors or blinking. This memory begins at the offset 0xb8000 and is laid out sequentially with 80 characters per row. print_string_pm.asm gives an example of printing a string a string by writing directly to video memory.

The global descriptor table

The global descriptor table is an important data structure in memory which is used for the protected-mode memory addressing system. In protected mode, a logical address is the combination of a segment register and an offset (i.e. [es:0x0f] for the 15th byte 15 of the segment starting at the address stored in es). And the way a logical address is mapped to a physical address is different as well. Whereas:

in 16-bit real mode, the physical address was obtained by multiplying the segment register by 16 and adding the offset,
in 32-bit protected mode, the physical address is obtained by indexing into the global descriptor table with the segment register and adding the offset described by that entry.

An entry in the global descriptor table occupies 8 bytes and looks like this:

 31        24 23   22   21   20   19             16 15  14  13 12  11   8 7          0
+------------+---+-----+---+-----+-----------------+---+------+---+------+------------+ byte
| base 31:24 | G | D/B | L | AVL | seg limit 19:16 | P | DPL  | S | type | base 23:16 |  4
+------------+---+-----+---+-----+-----------------+---+------+---+------+------------+

 31                                   16 15                                    0
+------------------------------------------+------------------------------------------+ byte
|            base address 15:00            |            segment limit 15:00           |  0
+------------------------------------------+------------------------------------------+

base/limit:
    LIMIT (2 bytes, 4 bits) segment limit
    BASE  (1 byte, 2 bytes) segment base address

1st flags:
    P     (1 bit)           segment present (use 1, indicates segment is present in memory)
    DPL   (2 bits)          descriptor privilege level (00 is highest privilege)
    S     (1 bit)           descriptor type (0=system, 1=code or data)

type flags:
    TYPE  (4 bits)          segment type
        code        (1 bit) 1 (this is a code segment) or 0

    [if code segment]
        conforming  (1 bit) 0 (not conforming [protects memory]) or 1 (conforming)
        readable    (1 bit) 1 (readable) or 0 (execute-only)
    [if data segment]
        expand down (1 bit) 1 (allow segment to expand down) or 0
        writeable   (1 bit) 1 (writeable) or 0 (read-only)

        accessed    (1 bit) 0 (set to 1 once accessed)

2nd flags:
    G     (1 bit)           granularity (left-shift limit by 3)
    D/B   (1 bit)           default operation size (0=16-bit segment, 1=32-bit segment)
    L     (1 bit)           64-bit code segment (IA-32e mode only)
    AVL   (1 bit)           available for user by system software

segment base address: this is the offset that this entry describes.
default operation size: whether segment is to be interpreted as 32-bit or 16-bit code.
descriptor privilege level: 0 is the highest privilege.
descriptor type: 1 for code or data segment, 0 is used for traps
type:
- highest bit: code: 1 if code segment
- next bit: conforming: 0 (not conforming) means code in a segment with lower privilege may not call code in this segment, which is key to memory protection.
- next bit: readable: 1 means readable, 0 means execute-only. Readable allows us to read constants defined in the code.
- lowest bit: accessed: used for debugging and virtual memory techniques since the CPU sets this bit to 1 when it accesses the segment.
granularity: if set to 1, limit is left-shifted by 3 (multiplied by 4 K).
64-bit code segment: set to 0, unused on 32-bit processor.
AVL: can be used for our purposes (e.g. debugging)

We point the CPU to the GDT through a simple 6-byte structure called the GDT descriptor:

first two byte: GDT size
next four bytes: GDT address

The preparation of a global descriptor table is shown in gdt.asm.

Switching into 32-bit mode

The actual switch from 16-bit real to 32-bit protected mode involves the following steps:

Clear interrupts: cli. This is important to avoid attempting to use the IVT that BIOS set up at the start of memory.
Inform the CPU about the GDT: lgdt [gdt_descriptor].

Make the actual switch over by setting the first bit of the CPU control register cr0:

mov eax, cr0 ; We cannot set a single bit in `cr0` directly, so we store in `eax` and
or eax, 0x1  ; leave the top seven bits untouched but set lowest bit to 1, then
mov cr0, eax ; put the result back into `cr0` with the updated lowest bit.

Clear the instruction pipeline by issuing a far jump⁴ to the code segment, which we labelled earlier while creating the GDT. Because this is a far jump, the cs register is set to the offset of the segment being jumped to. We pass in the offset of the code segment in the GDT and jump to a well-defined label at the start of our 32-bit, protected mode code:
```
  jmp CODE_SEG:start_protected_mode

[bits 32]

start_protected_mode:
  ...                 ; Here we begin initialization in protected mode.
```
Use the [bits 32] directive to tell our assembler that, from this point onwards, it should encode in 32-bit intsructions.

We combine these steps into a reusable switchover routine in switch_to_pm.asm. Note that the first thing we do for initialization in protected mode is reset our segment registers to the data segment in our GDT and set up the stack.

We combine everything we have learned thus far into a boot sector that demonstrates the switch from 16-bit real mode into 32-bit protected mode: boot_to_pm.asm.

C compilation

We will be writing our kernel in C and compiling it to machine code. At boot, our boot sector will be loaded into memory and executed to read in the kernel code. In preparation for writing the kernel, we will study C compilation, the process by which C source code is converted to assembly code.

Consider the following C code in the file basic_example.c:

// Define an empty function that returns an integer
int my_function() {
    return 0xbaba;
}

We can compile the code to the file basic_example.o using gcc:

x86_64-elf-gcc -m32 -ffreestanding -c basic_example.c -o basic_example.o

and then disassemble it using objdump:

objdump -d basic_example.o

The output of objdump is the following:

basic_example.o:    file format elf32-i386


Disassembly of section .text:

00000000 <my_function>:
       0: 55                            pushl   %ebp
       1: 89 e5                         movl    %esp, %ebp
       3: b8 ba ba 00 00                movl    $47802, %eax
       8: 5d                            popl    %ebp
       9: c3                            retl

We can make that a bit friendlier by translating to NASM and commenting:

basic_example.o:    32-bit i386

Machine code:               Assembly:          
00000000 <_my_function>:    my_function:
0: 55                         push bp         ; Preserve current frame pointer in case we call
                                              ; another function inside this one. That way we can
                                              ; reset the frame pointer to its value before calling
                                              ; this function at the end of our function code.
1: 89 e5                      mov bp, sp      ; Create new pointer to current stack top, so that we
                                              ; can use the stack for our own purposes during this
                                              ; function.
3: b8 ba ba 00 00             mov eax, 0xbaba ; Move return value into A register, which is the
                                              ; register expected to hold the return value when  
                                              ; flow of control exits this function.
8: 5d                         pop bp          ; Replace stack pointer with what it was before the
                                              ; call to this function. That way we return to the
                                              ; context we had before this function was called.
9: c3                         ret             ; Pop return address off the stack and jump back to
                                              ; the address from which this routine was called.

To implement functions on top of assembly routines, we introduce the concept of a frame. For noew it suffices to know that a frame is a stack pointer associated to a call to a function in context. When a function is called, the start address of its caller's frame is pushed onto the stack. Upon starting, the function sets up its own stack frame at the top of the stack of its caller. Once the function returns, the top value on the stack is popped to the ebp register. This is the equivalent of saying the new current frame has a stack back at address ebp, the caller's frame.

The object files output by the compiler contain the raw machine code plus some annotation. This annotation associates label to their relative addresses and is there to be used by the linker. During compilation, the linker stitches together the machine code from the object files into one executable binary. When writing this binary, it substitutes labels for absolute addresses using the object file annotations.

To use the linker to create an executable using the object file above, run:

x86_64-elf-ld -melf_i386 -o basic_example.bin -Ttext 0x0 --oformat binary basic_example.o

-melf_i386 sets emulation to elf_i386, the original 32-bit x86 architecture.
-o basic_example.bin specifies the output file as basic_example.bin,
-Ttext 0x0 says to offset all addresses in the binary relative to 0x0. This option is important when writing our kernel later, as the kernel is loaded to some known location in memory and we must offset addresses relative to it.
--oformat binary specifies the output format as binary (leaving no metadata in the machine code).
basic_example.o is an input argument, the file to be linked.

Now let us dissassemble this binary using a NASM dissassembler:

ndisasm -b 32 basic_example.bin

The result is:

00000000  55                push ebp
00000001  89E5              mov ebp,esp
00000003  B8BABA0000        mov eax,0xbaba
00000008  5D                pop ebp
00000009  C3                ret

This is essentially the same as our "nice" version of the dissassembly output from objdump. The left-hand column shows the file offsets of the instructions. The middle columns shows the machine code as one or multiple bytes. The right column shows the NASM assembly instruction GCC generated.

Now we can write some C code declaring a local variable and compile, link and disassemble it as before. We write the following to local_var_example.c:

// Declare a local variable.
int my_function() {
    int my_var = 0xbaba;
    return my_var;
}

And then run:

x86_64-elf-gcc -m32 -ffreestanding -c local_var_example.c -o local_var_example.o
x86_64-elf-ld -melf_i386 -o local_var_example.bin -Ttext 0x0 --oformat binary local_var_example.o
ndisasm -b 32 local_var_example.bin

We get the following output:

00000000  55                push ebp
00000001  89E5              mov ebp,esp
00000003  83EC10            sub esp,byte +0x10
00000006  C745FCBABA0000    mov dword [ebp-0x4],0xbaba
0000000D  8B45FC            mov eax,[ebp-0x4]
00000010  C9                leave
00000011  C3                ret

Let's walk through this line by line:

00: First we push ebp to the stack, storing the frame pointer of the caller's stack frame.
01: Next, we set the stack base to the current top of the stack, in order to use the stack for the current function.
03: reserves 16 (0x10) bytes on the stack for local use. This is enough for one variable of whatever size it may be, though we will only use the first four bytes to store the 32-bit int.
06: store the double-word value 0xbaba at the address ebp - 4, filling up the 4 bytes at the base of this function's stack. Here we use a CPU shortcut called effective address manipulation, where the CPU calculates the specified offset of 0x4 on-the-fly from the value in ebp (which is only known at runtime). dword states explicitly that we are storing a double word (4 bytes) on the stack, 0x0000baba (not a single word, 0xbaba or four words 0x000000000baba).
0D: load the value 0xbaba (at the address ebp - 4) into the A register.
10: leave is equivalent to:
```
mov esp, ebp    ; Put the stack back to how it was before calling this function. This is the
pop ebp         ; reciprocal of the first two lines of this function.
```
- Line 1 of leave says the new top of the stack is ebp, or the value at the bottom of the stack from of the function that is exiting. It should be fine to overwrite any memory there because anything above it is leftover values on the stack
11: pop return address off the stack and jump (back) to it.

Throughout all of this, what we think of as my_var is tracked by the compiler as ebp - 0x4, i.e., the first four bytes of the stack. The compiler stores the return value my_var in A register knowing this is interpreted as my_var's return value.

Calling functions

Now see the following code featuring one function which calls another, from calling_example.c:

void callee_function(int my_arg) {
    return my_arg;
}

void caller_function() {
    callee_function(0xdede);
}

Compilation/linking and dissasembly gives:

00000000  55                push ebp          ; callee start
00000001  89E5              mov ebp,esp
00000003  8B4508            mov eax,[ebp+0x8]
00000006  5D                pop ebp
00000007  C3                ret               ; callee end
00000008  55                push ebp          ; caller start
00000009  89E5              mov ebp,esp
0000000B  68DEDE0000        push dword 0xdede
00000010  E8EBFFFFFF        call 0x0
00000015  83C404            add esp,byte +0x4
00000018  90                nop
00000019  C9                leave
0000001A  C3                ret               ; caller end

It's useful to start with the caller function.

08 and 09 are the familiar function prolog.
0B pushes double-word 0xdede to the stack as an argument to the function about to be called.
10: we call the function starting at 0x0, the callee.
00 and 01 are the familiar function prolog.
03 copies the four bytes from ebp+0x8:ebp+0x4 into the A register. Here we are actually reaching up into the callee's stack frame to get the argument's value. The callee's address occupies bytes 0x4:0x0 on the stack and the argument occupies 0x8:0x4. By convention in C, when calling arguments are pushed to the stack in reverse order, so the first argument is on the top. The A register is where the int return value for this function is supposed to be stored.
06 return stack base to the stack frame of the caller by grabbing it off the stack.
07 pop caller address off the stack and jump back to it to return flow of control.

Local variables in the stack

A variable is simply a reference to an allocated memory address, where sufficient space has been reserved to accomodate a particular data type.

Consider the following C code:

def my_function() {
    int a = 3;
    int b = 4;
    int total = a + b;
}

How would this look in assembly? First we encounter the function definition, so we start a new assembly routine with the basic function prolog. This sets up a "local" stack for the context of this function and takes note of the caller's stack so the caller's context can be restored upon return.

def my_function() -> MY_FUNCTION:
{                 ->   push ebp     ; Function prolog...
                       mov ebp, esp ; Let's say sp happened to be `0x40` before this function call.

The local stack is at this point just a flat base, though the top of the caller's stack is in fact just below it. The top of the caller's stack will include the parameters the caller passed as arguments to this function.


     (this function's stack)

0x30====0x31== . . . ==0x3f====0x40 <- bp, sp
|                              |
|       (caller's stack)       |

Next, we declare an integer a whose value is 3.

int a = 3; -> sub esp, 0x10 ; Allocate 16 bytes on the stack for int a.
              push dword [ebp-0x4], 3 ; Load 3 as a double-word at the base of the stack.

The stack now contains the 32-bit value a in the four bytes at its base.


0x20-- . . . --0x2c----0x2d----0x2e----0x2f----0x30 <- sp
|              |  0x03 |  0x00 |  0x00 |  0x00 |
0x30== . . . ==0x3c====0x3d====0x3e====0x3f====0x40 <- bp

We declare an integer b whose value is 4.

int b = 4; -> sub esp, 0x10            ; Allocate 16 bytes on the stack for int b.
              push dword [ebp-0x14], 4 ; Load 4 as a double-word at second "slot" in the stack.

The stack now also contains the 32-bit value b in the lowest four bytes of its second slot.


0x10-- . . . --0x1c----0x1d----0x1e----0x1f----0x20 <- sp
|              |  0x04 |  0x00 |  0x00 |  0x00 |
0x20-- . . . --0x2c----0x2d----0x2e----0x2f----0x30
|              |  0x03 |  0x00 |  0x00 |  0x00 |
0x30== . . . ==0x3c====0x3d====0x3e====0x3f====0x40 <- bp

Next, we declare an integer total whose value is a and b added together.

int total = a + b; -> mov eax, [ebp-0x4]  ; Move `a` into A register to perform sum.
                      add eax, [ebp-0x14] ; Add `b` to `a`, with result in A register.
                      sub esp, 0x10       ; Allocate 16-bytes on stack for int `total`.
                      mov [ebp-0x24], eax ; Store result of sum as dword in `total`'s slot in stack.

The stack now contains all three variables.

0x00-- . . . --0x0c----0x0d----0x0e----0x0f----0x10 <- sp
|              |  0x07 |  0x00 |  0x00 |  0x00 |
0x10-- . . . --0x1c----0x1d----0x1e----0x1f----0x20
|              |  0x04 |  0x00 |  0x00 |  0x00 |
0x20-- . . . --0x2c----0x2d----0x2e----0x2f----0x30
|              |  0x03 |  0x00 |  0x00 |  0x00 |
0x30-- . . . --0x3c----0x3d----0x3e----0x3f----0x40 <- bp

Lastly we store the return value in the A register, tear down the stack, and return to the caller.

return total; -> mov eax, [ebp-0x24] ; Set `a` as return value by writing to A register.
}             -> leave               ; Teardown local stack and re-setup caller's.
                 ret                 ; Jump back to the code for this function's caller.

Pointers, addresses and data

Note how, in C, memory is managed for us by the stack; that is, we don't have to worry about the addresses of a and b when allocating space and writing to those addresses. But suppose, in the course of our C programming, we want to store a value at an explicitly provided location in memory? Or how about if we want to read the value at a specific location in memory? For this, we introduce the concept of a pointer. A pointer is a special type of variable which holds a reference to the storage address for an object of some specified type. We can write to location 0xb8000 directly by creating a pointer to the address 0xb8000 called, say, ptr, and issuing a command to "write to the address referred to by ptr". The "address referred to by ptr" is represented by *ptr, where * is an operator that can be used on a pointer variable to directly access the value to which it refers. This operation is called dereferencing and * is called the dereference operator.

For instance, knowing the first character of video memory maps onto the address 0xb8000, we could print an 'X' to line 1, col 1 in the following way:

char video_address = 0xb8000 ; Declare a pointer to an integer stored at address 0xb8000.
*video_address = 'X'         ; Assign ASCII value 'X' to the integer at address 0xb8000.

The concept of dereferencing should feel familiar because the syntax [ax] from assembly code essentially dereferences a register; that is, it refers to the value stored at the address in ax. In this context, ax is being used as a pointer rather than as a data container in itself. On a 32-bit operating system, all pointers (no matter what type they point to) will be 32-bits. This is because 32-bit numbers are used to specify addresses.

In contrast to dereferencing one can use the address-of operator, &. This operator may be applied to any variable and will return the address of that variable in memory. The * and & operators are related to each other such that for variables x and y, the following equations are always true:

*(&x) == x
&(*y) == y

Representing strings

We can now see a good way to represent strings in memory. We can represent the start of the string with a pointer its first letter stored in memory: type char*. This declaration creates a variable holding the address in memory of a char, or unsigned 8-bit value (often taken to be an ASCII character code). The string then expected to be stored as a series of char values, arranged contiguously in memory beginning at the start address.

Let's see how a compiler treats a string variable in C (string_example.c):

void my_function() {
    char* my_string = "Hello";
}

When we declare the variable my_string of type char*, the compiler understands that the value which follows will be a string of char, that is a sequence of bytes to be interpreted as ASCII characters. So it will then look up the ASCII codes for the characters provided:

'H': 0x48
'e': 0x65
'l': 0x6c
'l': 0x6c
'o': 0x6f

It will call a function (starting its own new stack on top of the stack), allocate space on its stack to store this string, and store the characters in order that space. Let's write out what this might look like in assembly, and then compare it with the disassembled binary from the C compiler.

push ebp                       ; Store stack base of previous context.
mov ebp, esp                   ; Set new stack base to top of previous context's stack.
sub esp, 0x10                  ; Allocate 16 bytes on the stack for our variable.
mov dword [ebp-0x4], MY_STRING ; Store the start address to our string as a 32-bt int (double word)
                               ; in the first four bytes of the stack.
leave                          ; Teardown local stack (mov esp, ebp; pop ebp).
ret                            ; jmp back to where this function was called.

MY_STRING:                     : Assembler converts this symbol into offset of "H" byte.
  db "Hello", 0x0              ; Null-terminated string of contiguous bytes.

Here is what the compiled binary looks like, dissassembled:

00000000  55                push ebp
00000001  89E5              mov ebp,esp
00000003  83EC10            sub esp,byte +0x10
00000006  C745FC10000000    mov dword [ebp-0x4],0x10
0000000D  90                nop
0000000E  C9                leave
0000000F  C3                ret
00000010  48                dec eax
00000011  656C              gs insb
00000013  6C                insb
00000014  6F                outsd
00000015  00                db 0x00

The first few lines make sense, though we notice that MY_STRING has been replaced with the absolute offset 0x10. Now, at offset 0x10 in this machine code, things get funky: the CPU sends off a seemingly random series of commands.

The reason is for this is that the disassembler cannot distinguish between code and data when interpreting the machine code it encounters. If we instead look at bytes 0x10-0x15 as data (ASCII values rather than assembly commands), we see that these values are the ASCII characters for my_string, terminated by the null byte 0x0. The offset 0x10 stored at the base of the stack is our char* my_string: its assembly label is MY_STRING, and it points to the first character of my_string in memory.

A simple kernel in C

We now have all we need to boot and execute the simplest of kernels in C:

Write and compile kernel code.
Write and assemble boot sector code.
Create a kernel image that concatenates boot sector with compiled kernel code.
Load our kernel code into memory.
Switch to 32-bit protected mode.
Begin executing our kernel code from start of where we loaded it into memory.

Writing the code

The kernel code itself is very straightforward: its only job is to indicate that it has been successfully loaded and executed. We can keep things simple for now, and simply print an X to the first position on the console. Note that (char*) is a casting operator which typecasts the following value to the type pointer-to-char.

void main() {
    // The first position of video memory (row 0, col 0) is at memory address 0xb8000.
    // We store a pointer to this byte. `(char*) ` is necessary before 0xb8000 to typecast
    // it from a 32-bit int to a pointer-to-char.
    char* video_memory = (char*) 0xb8000;

    // We assign the value 'X' to the byte in memory pointed to by `video_memory`.
    // Note: char, byte and uint8 all contain 8 bits of memory, they just mean different things.
    *video_memory = 'X';
}

Now we need to compile this to binary. It is important to know at which position this code will reside in memory after it has been loaded in, because the linker must add that offset while resolving these addresses. We explicitly tell the linker that this code will start at position in 0x1000 in memory using the -Ttext 0x1000 option.

x86_64-elf-gcc -m32 -ffreestanding -c kernel.c -o kernel.o
x86_64-elf-ld -melf_i386 -o kernel.bin -Ttext 0x1000 --oformat binary kernel.o

Creating the kernel image

Now it comes time to write a boot sector that loads our kernel code into memory. Recall that we have BIOS routines (available in 16-bit mode) which will read an arbitrary number of sectors from the given drive, while in 32-bit mode we have no disk reading capability as of yet. This means our boot sector should load the kernel before switching into 32-bit protected mode. To simplify the problem of where to read the kernel sectors from, by convention we place them in the sectors right after the boot sector on the boot drive. This way the BIOS will load the boot sector and in it, we can use BIOS routines to read in the remaining kernel code. Our boot drive is be created by

cat boot_sect.bin kernel.bin > os-image

os-image is the operating system image: it is just the concatenation of the boot sector and kernel machine code. load_kernel.asm shows a boot sector that will boostrap the kernel from the disk containing our kernel image.

The entry point to the kernel

Before recklessly running this image, we must attend to one more important detail. To begin executing our kernel, we jumped to the start of the block where we loaded it in memory. But how do we know that address 0x0 in the kernel code corresponds to the function main()? The C compiler, in principle, may have placed the machine code for main() further down in the binary, especially if the source code file contained other functions preceding main(). We need a robust way to ensure that we enter the kernel code at the function main().

A good trick for this is to simply prepend the kernel code with a bit of assembly whose only job is to jump to the location of the symbol main. Since the symbol main is defined in the assembly compiled from kernel.c, we just need to let our assembler know that it can expect main to be defined externally. Let's put the following in enter_kernel.asm:

[bits 32]
[extern main] ; Expect main to be defined in another file.

call main     ; Enter the main function.
jmp $         ; When we return from the kernel, hang.

Now, enter_kernel.asm cannot be assembled to binary on its own, because the label main is not defined within its scope and therefore cannot be resolved. We will assemble this file (using our usual assembler) as an object file instead of a binary. The object file is in the executable and linking format (ELF), a common output format for C compilers. ELF adds annotations to the compiled assembly to allow resolving symbols across files.

nasm enter_kernel.asm -f elf -o enter_kernel.o

Now, we can use the linker to link this file to our main kernel code file itself. The linker respects the order of files as they are given on the command line; that is, the following command will ensure that kernel_entry code precedes kernel code in the binary:

x86_64-elf-ld -melf_i386 -o kernel.bin -Ttext 0x1000 --oformat binary enter_kernel.o kernel.o

Like before, we can construct our OS image by concatenating load_kernel.bin with kernel.bin. At this point we can truly say we have the beginnings of an x86 operating system.

Hardware

At this point it is good to turn our attention towards hardware. What are the real-life products we need to put this computer together? The first thing is the microprocessor itself, and the motherboard it is mounted upon. The purpose of the motherboard is to connect the CPU to memory and its peripheral devices. In particular, we want a socketed processor, meaning one which can be attached to a motherboard via a socket.

todo: flesh out this section

socketed x86 32-bit microprocessor
motherboard with 2 usb ports and 1 video output
usb disk drive with floppy
usb memory-mapped keyboard
memory-mapped text/video console

8-bit option:
commodore 64 is a classic 8-bit computer with built-in keyboard and ports for peripherals
conventionally used with commodore 1541 floppy disk drive and 5.25" floppy disk
MOS 6502 microprocessor with MOS 6502 instruction set
programmable in BASIC high-level language

should also check out the PDP-8 (https://en.wikipedia.org/wiki/PDP-8) from 1965
1987 Winkel & Prosser, The Art of Digital Design

Appendix

ASCII table

dec  hex    char                dec  hex    char        dec  hex   char        dec  hex   char
0    0x0    null                32   0x20   space       64   0x40  @           96   0x60  `
1    0x1    start of heading    33   0x21   !           65   0x41  A           97   0x61  a
2    0x2    start of text       34   0x22   "           66   0x42  B           98   0x62  b
3    0x3    end of text         35   0x23   #           67   0x43  C           99   0x63  c
4    0x4    end transmission    36   0x24   $           68   0x44  D           100  0x64  d
5    0x5    inquiry             37   0x25   %           69   0x45  E           101  0x65  e
6    0x6    acknowledge         38   0x26   &           70   0x46  F           102  0x66  f
7    0x7    bell                39   0x27   '           71   0x47  G           103  0x67  g
8    0x8    backspace           40   0x28   (           72   0x48  H           104  0x68  h
9    0x9    horizontal tab      41   0x29   )           73   0x49  I           105  0x69  i
10   0xa    line feed           42   0x2a   *           74   0x4a  J           106  0x6a  j
11   0xb    vertical tab        43   0x2b   +           75   0x4b  K           107  0x6b  k
12   0xc    for mfeed           44   0x2c   ,           76   0x4c  L           108  0x6c  l
13   0xd    carriage return     45   0x2d   -           77   0x4d  M           109  0x6d  m
14   0xe    shift out           46   0x2e   .           78   0x4e  N           110  0x6e  n
15   0xf    shift in            47   0x2f   /           79   0x4f  O           111  0x6f  o
16   0x10   data link escape    48   0x30   0           80   0x50  P           112  0x70  p
17   0x11   device control 1    49   0x31   1           81   0x51  Q           113  0x71  q
18   0x12   device control 2    50   0x32   2           82   0x52  R           114  0x72  r
19   0x13   device control 3    51   0x33   3           83   0x53  S           115  0x73  s
20   0x14   device control 4    52   0x34   4           84   0x54  T           116  0x74  t
21   0x15   negative ack        53   0x35   5           85   0x55  U           117  0x75  u
22   0x16   synchronous idle    54   0x36   6           86   0x56  V           118  0x76  v
23   0x17   end of Tx block     55   0x37   7           87   0x57  W           119  0x77  w
24   0x18   cancel              56   0x38   8           88   0x58  X           120  0x78  x
25   0x19   end of medium       57   0x39   9           89   0x59  Y           121  0x79  y
26   0x1a   substitute          58   0x3a   :           90   0x5a  Z           122  0x7a  z
27   0x1b   escape              59   0x3b   ;           91   0x5b  [           123  0x7b  {
28   0x1c   file separator      60   0x3c   <           92   0x5c  \           124  0x7c  |
29   0x1d   group separator     61   0x3d   =           93   0x5d  ]           125  0x7d  }
30   0x1e   record separator    62   0x3e   >           94   0x5e  ^           126  0x7e  ~
31   0x1f   unit separator      63   0x3f   ?           95   0x5d  _           127  0x7f  del

Registers and their usage

From https://www.eecg.utoronto.ca/~amza/www.mindsec.com/files/x86regs.html.


    32 bits: EAX   EBX   ECX   EDX
    16 bits:  AX    BX    CX    DX
     8 bits:  AH AL BH BL CH CL DH DL

    EAX,AX,AH,AL: Called the **a**ccumulator register. 
                  Used for I/O port access, arithmetic, interrupt calls, etc...

    EBX,BX,BH,BL: Called the **b**ase register.
                  Used as a base pointer for memory access, gets some interrupt return values.

    ECX,CX,CH,CL: Called the **c**ounter register
                  Used as a loop counter and for shifts gets some interrupt values.

    EDX,DX,DH,DL: Called the **d**ata register
                  Used for I/O port access, arithmetic, some interrupt calls.

Makefile tricks

In recipe:

$^ is substituted with the target's dependency files $< is substituted with the first dependency $@ is substituted with the target file ${OBJ} is substituted for the variable OBJ, if any $* is substituted for match to % in generic targets
Good to include a clean target which gets rid of all generated files.
wildcard: automatically expands to a list of existing files that match pattern(s):
```
  C_SOURCES = $(wildcard kernel/*.c drivers/*.c)
```
OBJ = ${C_SOURCES:.c=.o}: convert all files ending with .c to end with .o; useful for creating list of object files to build.

Generic target for building anyfile.o from anyfile.c:

  %.o: %.c
      gcc --ffreestanding -c $< -o $@

Organization of the code

example: anything which is example or demonstration code
boot: anything related to booting and the boot sector
kernel: all kernel-related code which is not device-driver specific
drivers: any hardware specific driver code

Terminology

routine: assembly code, compiled to machine code, labeled by address, parametrized by registers.
function: C code: offset by compiler annotation, parametrized by contents of the stack.

Dependencies

MacOSX: host operating system, runs QEmu and editors.
Hex Fiend: hex editor, used to write and read raw binaries.
QEmu: x86 emulator, emulates a 32-bit x86 processor.
nasm: x86 assembler, assembles bytecode for an x86 processor.
Make: compilation tool, automates build process.
x86_64-elf-gcc, x86_64-elf-ld (brew install i386-elf-binutils i386-elf-gcc): binary utilities and GCC compiler for x86, cross-compiled for M1 Mac.

Acknowledgement

This document follows the presentation in Nick Blundell's "Writing a Simple Operating System -- from Scratch" quite closely. In fact, some explanations are paraphrased or copied from it and most of the code files are sourced from that book directly. I have combined it with my own comments and perspectives into a document in one voice, but the credit for this very well-thought out learning path belongs entirely to Dr. Blundell.

We take the assembler as a given. ↩
The CPU interprets zero-valued bytes as no-ops and thus knows to keep reading past them. If these bytes remain uninitialized, the CPU will attempt to execute them and either get itself into a bad state and reboot, or stumble upon a BIOS function that reformats the disk. ↩
If you change the magic numbers at the end of the file and try again, the BIOS should crash saying "no bootable device". ↩
To issue a far jump, as opposed to a near (standard) jump, we additionally provide the target segment: jmp <segment>:<address offset>. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
boot		boot
example		example
kernel		kernel
.DS_Store		.DS_Store
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
os-dev.pdf		os-dev.pdf
x86-architecture.md		x86-architecture.md

Folders and files

Latest commit

History

Repository files navigation

x86-bootloader

The boot process

Assembly language

The stack

Include files

Reading the disk

Spinning disk drives

Reading the disk with BIOS

32-bit protected mode

Printing to screen without BIOS

The global descriptor table

Switching into 32-bit mode

C compilation

Calling functions

Local variables in the stack

Pointers, addresses and data

Representing strings

A simple kernel in C

Writing the code

Creating the kernel image

The entry point to the kernel

Hardware

Appendix

ASCII table

Registers and their usage

Makefile tricks

Organization of the code

Terminology

Dependencies

Acknowledgement

Footnotes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages