Welcome to AROS-A Random Operating System

This is the accompanying documentation for an old project of mine where in I had written a basic x86 OS to get my hands dirty before jumping into my actual project of writing a distributed OS for small clusters.

AROS’s functionality

AROS as it stands today is a minimalist OS which can boot up and execute an associated file which is attached with the OS during boot up. In terms of I/O AROS supports keyboard input and output to a VGA style display. The code comprises with specific sections of x86 assembly which might not be compatible with all Intel and AMD based Laptops and PC’s.

Disclaimer

I have not physically booted the OS on a computer(as yest) and have tested it on a virtual machine and am not responsible if anything goes wrong while utilizing this code or any modifications of it xD.

Booting

Between a person physically turning on a computer to the system loading up the OS and booting from it, a number of intermediate tasks are performed before execution control is passed to the OS. Our focus is limited till after the BIOS passes execution control t the booloader. For our bootloader, we are going to be using GRUB(GRand unified Bootloader) about which you can learn more about on GRUB’s Homepage. An interesting side note is that modern day x86/x86_64 processors have multiple modes of operations

16bit real mode
16bit protected mode
32bit protected mode
64bit long mode (present in x86_64 systems)

When the BIOS runs, the processor is configured to run in the 16bit real mode and GRUB reconfigures the CPU to run in the 32bit protected mode, allowing us to access upto 4GB of memory.

Coming back to the topic of using GRUB as a bootloader for our OS, GRUB is a multiboot compliant bootloader which requires us to perform certain additional steps in order for GRUB to recognize our kernel. Digging into the documentation for multiboot (OSDev’s page on it is a pretty good starting point) we can see three important parameters:

Magic number
Flags
Checksum

The magic number is an arbitrary constant chosen by the multiboot specification which for multiboot complient kernels is chosen to be 0x1BADB002. The Flags parameter determines the modules which must be loaded along with the kernel which we will be setting to 0. More information regarding the function of the flags parameter and bitwise information can be found in the documentation for the multiboot standard. Finally, the checksum is a constant such that the sum of the above fields is 0. This is merely a validation to ensure that none of the data is corrupted. These three fields are required while the rest of the fields present in the multiboot header can be ignored as they directly deal with the other modules to be loaded and thus governed by the Flag field. As we are setting this field to be 0, we can safely ignore the presence of the other fields. Now that we know the fields which must be present, we also must know where these fields must be placed in memory. The documentation defines that these constants must be loaded into memory with offsets of 4bytes in the same format as I have described and must be within the first 8192bytes of the Operating System image. Apart from this, we need to know the memory address from which these parameters must be loaded. As GRUB utilizes memory below 1MB, we have to set these constants to be after the 1MB mark.

Assembly code

Okay, now that we know where what needs to be in place for GRUB to recognize our kernel, we need to write code for the same. In theory, we could use C to implement this, but I’m going to perform this using assembly with the following snippet of code,

;set.asm
section         .text
        align   4
        dd      0x1BADB002
        dd      0x00
        dd      - (0x1BADB002)
        
global loader
extern vmain
loader:
        cli
        call vmain
        hlt

Explanation of the code

The first part of the assembly code loads the constants into memory and is pretty staight forward. Once the header is loaded onto memory, we are in position to call our C code which is the main kernel. In order to do so, we must first inform our assembler regarding the existance of the kernel’s main function, the entry point, vmain which the line extern vmain accomplishes. Before calling vmain i.e pass execution controll to it, we need to clear the interrupts which the instruction cli does. The instruction hlt halts the CPU if ever the kernel stopped execution and passed execution controll back to this piece of assembly code. align 4 ensures that our header is 4byte (32 bit) aligned. Before we understand the function of the line section .text we must take a look at memory models and memory segments. In general, there are three major and important memory segments namely

Code Segment
Data Segment
Stack Segment

Code Segment

The code segment contains the actual code that is to be executed in memory. This is a fixed segment and the start of this section is indicated to the linker using .text attribute.

Data Segment

The data segment contains the data such as constants, variables etc. which is required for the code to function. This section is indicated to the linker using the .data and the .bss attributes. The .bss section is used to reserve memory which variables that are to be declared later in the program and is a static field which is initialized with 0’s in the memory.

Stack Segment

As the name implies, the stack segment is used for the program stack and must be setup so that we can run non trivial C programms. We will be looking at setting up the stack segment in the following sections.

Linker script

While we have the header ready, we have not yet defined the memory address where the header must be placed. This is done by the linker and we must define the linker script according to the specification. From the previous section, we know that we must place the .text section at the 1MB mark which translates to 0x100000. For now, we’ll set the .data and the .bss section also to begin at this address. So translating this to a linker script, we get the following linker script:

OUTPUT_FORMAT(elf32-i386)
ENTRY(loader)
SECTIONS
 {
   . = 1M;
   .text : { *(.text) }
   .data : { *(.data) }
   .bss  : { *(.bss)  }
 }

Setting up the Stack Segment

Before we speak about the stack segment, we must understand

Memory Models in 32bit modes
Segment registers

When operating in 32bit mode, we utilize a memory model known as flat memory model in which any 32bit register can be utilized for memory addressing. This is in contrast to the Multi-Segmnent memory model which was primaryly used in the 16bit era and the iconic 8086 microprocessor. Short digression into the 8086 microprocessor, there are primarily two sets of registers

General Purpose Registers
Segment Registers

General Purpose Registers

The list of 16bit GPR’s are as follows

AX: Accumulator register
BX: Base register to be be used in conjunction with the segment register(default is DS)
CX: Counter register
DX: Data register
SP: Stack Pointer which points to the top of the stack
BP: Base Pointer which points to the base of the stack
SI: Source Index which is to be used in conjunction with the segment register(default is DS)
DI: Destination Index which is to be used in conjunction with the segment register(default is DS)

The 32bit versions of these registers can be accessed in the 32bit protected mode by adding an E prefix (Ex:AX would be accessed as EAX)

Note: These GPRs are specific to certain instructions, for example CX is implicitly used with loop instructions

Segment Registers

Segment registers were extensively used in the 16bit era to hold the base address for each of the memory segments. This method allowed the 8086 to address upto 1MB of memory using 20bit address bus. In order to calculate the address of a variable, the content of the segment register was left shifted by 4bits and the contents of the corresponding offset address are summed to determine the physical address. A memory model which utilizes these segment registers is known as Multi-Segmented memory model which is in stark contrast to the flat memory model where no such partitioning exists.

Initializing Stack Segment

The stack in the x86 architecture is a downward growing stack which means that the stack pointer is decremented everytime a PUSH instruction is encountered and is incremented when the POP instruction is incremented. Therefore, the base address of the stack must be set in such a way that no memory conflicts occur and data is not overwritten. For our simple OS, we will be leaving the stack memory allocation job to the C compiler(gcc) and trusting the linker. One change that we must do to the linker script is to introduce a command that will combine all the .bss sections.

.bss:
{
  *(COMMON)
  *(.bss)
}

Display

With all the previous setup, we can now focus on writing code which allows us to interact with the software and allow it to display the result. At the very least, we require access to a display in order to print messages which will be helpful in debugging and keeping track of the tasks being performed. For this, we will be writing code for an old VGA-esq output with 8bit color depth. For now we will be writing the code to be white text on a black background, and can easily be extended to incorporate the supported color gamut.

In order to understand how we interact with I/O devices we must understand the concept of memory mapped I/O, a concept in which external peripherals are mapped to the memory address space. Therefore, we must know the address for a display. Fortunately for us, the framebuffer’s address is standardized (which you can learn more about here) and is at 0x000B8000. Since each entry is 16bits long, the address must be incremented by 16

From the documentation we can see the frame format for every charecter which is described as follows

[15-8 ]: ASCII code
[7-4 ]: Forground color code
[3-0 ]: Background color code

For implementing this in a consize and easy to use manner, we will be defining a C structure with the requisit fields as shown in the below code snippet


typedef struct cf
{
  unsigned char ch;
  unsigned char col;
}__attribute__((packed)) cf;

In the above code snippet, the field ch stores the ASCII value of the charecter to be displayed and the field col holds information for the background and foreground color. The final element of the struct __attribute__((packed)) is a compiler directive/flag which tells the compiler not to add padding to the struct ensuring that the bit fields are correctly populated.

Now with this struct, we can write our own putch() function which will allow us to write text to the screen.

static unsigned short int horpos;
static unsigned short int verpos;

const short int VGA_HOR = 80;
const short int VGA_VER = 25;

void putch(char ch)
{
	cf* fb = (cf*)0xB8000;

	cf data;
	data.col = 0xF0;

	if (verpos == VGA_VER - 1)
		verpos = 0;

	if (horpos == VGA_HOR - 1)
	{
		horpos = 0;
		verpos++;
	}
	data.ch = ch;

	if (ch == '\n')
	{
		horpos = 0;
		verpos++;
		return;
	}

	fb[verpos * VGA_HOR + horpos++] = data;
}

The above C code is not in it’s most compact form and written in a manner in which the logic is more or less explicitly stated.

Hello world Test

Okay now that we have written so much code, we are in a position to get our OS to boot up and print a “Hello World” message as a verification of it’s functionality. To do so, we need to setup our toolchain. For compiling our OS, I recommend the use of a 32bit PC running Linux. I personally use Ubuntu 16.04.6 for development on a 32bit PC as setting up a cross compiler, linker and GRUB for i386 on an AMD 64bit platform is a bit of a hassle. Now for assembling our assembly loader file, we will be using NASM and for compiling our C code, we will be using gcc. GCC will also be used for linking our files.

The flow will be the following

Using NASM to assemble our setup assembly file to into a .o file
Using GCC to compile our C code into a .o file
Using GCC-linker module to link the .o files giving us an executable
Using GRUB-mkrescue to convert our executable into an ISO file along with the GRUB bootloader

NASM

The command to be executed is as follows:

` nasm -f elf set.asm -o set.o `

Breakdown of the command:

nasm is used to invoke our assembler

-f elf is used to define the output file to be in the ELF(Executable Linkable Format) file format

set.asm is our input file

-o set.o is used to define the output file name and extension

GCC

The command to be executed is as follows:

gcc -c kernel.c -o kernel.o -ffreestanding

Breakdown of the command

gcc is used to invoke our compiler

-c flag is used to tell our compiler to compile the code, but not to link it

kernel.c is our input C file

-o kernel.o is used to define the output file name and extension

-ffreestanding is used to define that the C code is going to be executed in a free standing environment without any standard libraries which the compiler would otherwise expect to be present and also that the entrypoint is not the main function.

Linker

The command to be executed is as follows:

gcc set.o kernel.o -T linker.ld -o aros -nostdlib -nodefaultlibs -lgcc

This command takes in the preious;y compiled .o files and gives us our final executable kernel which must be wrapped up along with our bootloader and converted into an .iso image. The flags/options are self explainatory.

GRUB Bootloader

Now we need to create a bunch of folders and configuratiion files before we can execute the command necessary to convert the entire tree into a bootable ISO image. The folder tree is as follows

iso
|
|->boot
|   |->grub
|   |   |-> grub.cfg
|   |-> aros

Copy paste the executable file generated in the previous step inside the boot folder. The grub.cfg file is a configuration file which describes the kernel name, type and location in the above folder tree. It’s contents are as follows

menuentry "AROS" {
        multiboot /boot/aros
}

NOTE: DONOT CHANGE THE OPENING BRACKET LOCATION!

With this as the folder setup, we can now execute the last command which is going to convert the folder tree and give us a bootable ISO image.

grub-mkrescue iso --output = AROS.iso