xv6 is a re-implementation of Dennis Ritchie's and Ken Thompson's Unix Version 6 (v6). xv6 loosely follows the structure and style of v6, but is implemented for a modern x86-based multiprocessor using ANSI C.
xv6 is inspired by John Lions's Commentary on UNIX 6th Edition (Peer to Peer Communications; ISBN: 1-57398-013-7; 1st edition (June 14, 2000)). See also https://pdos.csail.mit.edu/6.828/, which provides pointers to on-line resources for v6.
xv6 borrows code from the following sources: JOS (asm.h, elf.h, mmu.h, bootasm.S, ide.c, console.c, and others) Plan 9 (entryother.S, mp.h, mp.c, lapic.c) FreeBSD (ioapic.c) NetBSD (console.c)
The following people have made contributions: Russ Cox (context switching, locking), Cliff Frey (MP), Xiao Yu (MP), Nickolai Zeldovich, and Austin Clements.
We are also grateful for the bug reports and patches contributed by Silas Boyd-Wickizer, Anton Burtsev, Cody Cutler, Mike CAT, Tej Chajed, eyalz800, Nelson Elhage, Saar Ettinger, Alice Ferrazzi, Nathaniel Filardo, Peter Froehlich, Yakir Goaron,Shivam Handa, Bryan Henry, Jim Huang, Alexander Kapshuk, Anders Kaseorg, kehao95, Wolfgang Keller, Eddie Kohler, Austin Liew, Imbar Marinescu, Yandong Mao, Matan Shabtay, Hitoshi Mitake, Carmi Merimovich, Mark Morrissey, mtasm, Joel Nider, Greg Price, Ayan Shafqat, Eldar Sehayek, Yongming Shen, Cam Tenny, tyfkda, Rafael Ubal, Warren Toomey, Stephen Tu, Pablo Ventura, Xi Wang, Keiichi Watanabe, Nicolas Wolovick, wxdao, Grant Wu, Jindong Zhang, Icenowy Zheng, and Zou Chang Wei.
The code in the files that constitute xv6 is Copyright 2006-2018 Frans Kaashoek, Robert Morris, and Russ Cox.
We switched our focus to xv6 on RISC-V; see the mit-pdos/xv6-riscv.git repository on github.com.
To build xv6 on an x86 ELF machine (like Linux or FreeBSD), run "make". On non-x86 or non-ELF machines (like OS X, even on x86), you will need to install a cross-compiler gcc suite capable of producing x86 ELF binaries (see https://pdos.csail.mit.edu/6.828/). Then run "make TOOLPREFIX=i386-jos-elf-". Now install the QEMU PC simulator and run "make qemu".
The goal of this project is to modify the xv6 OS to support memory sharing between processes, limited to processes that are in a parent-child relationship. The parent process should report to the operating system the memory that is shared, and all immediate children of that process can receive access from the operating system in their address space to that very memory located in the parent process. In addition to these changes, several user programs that use these system changes are implemented. This new functionality does not disrupt the current operation of the system. Processes can be started and finished normally, and that all occupied memory is properly freed.
Two new system calls are implemented: share_data
and get_data
. Both system calls work with a new shared structure that has the following attributes:
- String name, maximum 10 characters
- Pointer to start of shared space
- Size of shared space
This system call is called from the parent process to register the shared structure. When declaring a shared structure, specify a unique string name for the structure
(if the given string is longer than 10 characters, take the first 10), the address where the structure is located, and the size of the
structure. proc
structure is modified so that it stores up to 10 shared structures. Unused shared structures are marked with size 0 in the newly created process.
In case of successful writing of a new shared structure, this system call returns its serial number within the process (0-9). Possible errors for this system call are:
- -1: bad parameter passed.
- -2: there already exists a shared structure with the given name.
- -3: there are already 10 shared structures in the current process.
This system call is called from the child process to access the shared structure previously declared by the parent using the share_data
system call.
The first parameter is the string name of the structure we want to access, and the second parameter is a pointer (passed by reference) that points to the shared space
after the system call is executed. The child process already has a prepared shared structure from which it just reads the value for the addr
pointer.
In case of successful execution of this system call, the return value is 0. Possible errors for this system call are:
- -1: bad parameter passed.
- -2: there is no shared structure with the given name.
Three primary changes in the system are made: fork()
, exec()
and process shutdown.
This system call ensures that the newly created process inherits all shared structures from its parent. Since the memory image is identical in the child, there is no need to
change anything within the structures themselves or the side structures of the child. Until it does exec()
, the child process cannot access the parent's shared objects,
but only has access to its own copy. In addition, a pointer to the parent's page directory is stored in the child's proc
structure using a new attribute specifically
intended for this.
The regular process size in virtual memory is limited to 1GB, and starting from 1GB shared objects are being mapped. This system call is modified so that the newly started process in its virtual space, starting from 1GB, maps all the pages listed in the shared structures inherited from the parent. These mappings point to the same frames as in the parent directory. After the mapping is done, the shared structures are being modified, so that the virtual addresses are now valid taking into account the new mapping.
It is necessary that both the parent and all child processes can be successfully completed, as well as that all occupied memory is freed when they are shut down.
Total of three user programs are implemented - one parent (dalle
), which starts the other two (coMMa
and liSa
) as part of its work.
This user program runs an interactive system for analyzing text files. It receives the path to the file as an argument, or if there is none, the default value is "../home/README". The program prepares the following structures to share with their children:
- Path to the file we are processing -
char *
- The counter to which sentence we reached within the file -
int
- The longest word in the current sentence -
char *
- Size of the longest word in the entire text up to the current sentence (including it) -
int
- The shortest word in the current sentence -
char *
- Size of the shortest word in the entire text up to the current sentence (with it) -
int
- The longest word in the entire text up to the current sentence (including it) -
char *
- Command indicator -
int
- The shortest word in the entire text up to the current sentence (including it) -
char *
After creating nine shared structures for this data, two new processes are started as children:coMMa
andliSa
. After starting the children,dalle
just waits for the completion of both processes and then terminates itself.
This user program is tasked with accepting commands from the user in a loop. Possible commands that the user can enter are:
- latest - prints which sentence we are processing in order, as well as the longest and shortest word in that sentence
global extrema
- prints the largest and smallest word found so far and their sizepause
- pauses text processing.resume
- resumes text processing.end
- notifies the process that it should shut down, and shuts down itself. Theglobal extrema
andlatest
commands directly read from shared structures in order to write the specified data. Thepause
,resume
, andend
commands send signals to theliSa
user program using a shared command indicator.
This user program is tasked with processing a text file. Upon opening the file, it finds the shortest and longest word in the current sentence (as well as globally)
and places them in the appropriate shared structure, then increments the shared counter of the current sentence and calls sleep(150)
to simulate data analysis.
After processing each sentence, this process should check if coMMa
specified a command. In case a pause command is specified,
sleep(1)
is executed, and then the program check for the command again. Only when resume
or end
command is specified, the program
continues to work or ends.