CSS 430 for 2016 Summer
Final Project: File System
Due Date: See the syllabus
In this project, you will build a Unix-like file system on the ThreadOS. Through the use of the file system, user programs will now be able to access persistent data on disk by way of stream-oriented files rather than the more painful direct access to disk blocks with rawread() and rawrite().
Given the size of this project, you are encouraged to form a team of four or fewer students. In working with your partner(s), the workload should divided equally across all members. This project requires much more time to be completed than the previous four assignments. Schedule and pace your design and coding work accordingly. If you are very confident of working independently you may do so.
If you're using the files provided to you via the Linux Lab: Use ThreadOS' original files, (i.e., the original classes) except for Kernel.java. For Kernel.java, use the one you got in the assignment 4, (i.e., Kernel_org.java). For simplicity, you should just re-download all readable files from the ThreadOS directory to get started.
If you're using the IntelliJ starter project then all the files you need are in that project (and please disregard the above paragraph)
The file system should provide user threads with the system calls that will allow them to format, to open, to read from , to write to, to update the seek pointer of, to close, to delete, and to get the size of their files.
For simplicity, the file system being created will consiste of a single level. The "/" root directory is predefined by the file system and permanently available for user threads to store their files. No other directories are provided by the system and created by users dynamically.
Each user thread needs to keep track of all files it has opened. For this purpose, it should maintain a table of those open files in its TCB. This table is called a user file descriptor table. It has 32 entries. Whenever a thread opens a file, it must allocate to this file a new table entry, termed a file descriptor. Each file descriptor includes the file access mode and the reference to the corresponding file (structure) table entry. The file access mode indicates "read only", "write only", "read/write", or "append". The file (structure) table is a system-maintained table shared among all user threads, each entry of which maintains the seek pointer and the inode number of a file. Depending on the access mode, the seek pointer is set to the first or the tail of the file, and keeps track of a next position to read from and to write to the file. It is entirely possible for one thread to open the same file many times, thus having several entries in the corresponding TCB's user file descriptor table. Although each of these user file descriptor table entries refer to a different file (structure) table entry with its own seek pointer, all of them eventually points to the same inode.
The file system you will implement must provide the following eight system calls.
Unless specified otherwise, each of the above system calls returns -1 when detecting an error.
The first disk block, block 0, is called the superblock. It is used to describe
Starting from the blocks after the superblock will be the inode blocks. Each inode describes one file. Our inode is a simplified version of the Unix inode. It includes 12 pointers of the index block. The first 11 of these pointers point to direct blocks. The last pointer points to an indirect block. In addition, each inode must include (1) the length of the corresponding file, (2) the number of file (structure) table entries that point to this inode, and (3) the flag to indicate if it is unused (= 0), used(= 1), or in some other status (= 2, 3, 4, ...). 16 inodes can be stored in one block.
You will need a constructor that retrieves an existing inode from the disk into the memory. Given an inode number, termed inumber, this constructor reads the corresponding disk block, locates the corresponding inode information in that block, and initializes a new inode with this information.
The system must avoid any inode inconsistency among different user threads. There are two solutions to maintain the inode consistency:
The "/" root directory maintains each file in a different directory entry that contains its file name (maximum 30 characters; 60 bytes in Java) and the corresponding inode number. The directory receives the maximum number of inodes to be created, (i.e., thus the max. number of files to be created) and keeps track of which inode numbers are in use. Since the directory itself is considered as a file, its contents are maintained by an inode, specifically inode 0. This can be located in the first 32 bytes of the disk block 1.
Upon booting ThreadOS, the file system instantiates the Directory class as the root directory through its constructor, reads the file from the disk that can be found through the inode 0 at 32 bytes of the disk block 1, and initializes the Directory instnace with the file contents. Prior to shutdown, the file system must write back the Directory information onto the disk. The methods bytes2directory( ) and directory2bytes will initialize the Directory instance with a byte array read from the disk and converts the Directory instance into a byte array that will be thereafter written back to the disk.
The file system maintains the file (structure) table shared among all user threads. When a user thread opens a file, it follows the sequence listed below:
The file (structure) table entry should be:
The file (structure) table is defined as follows:
Each user thread maintains a user file descriptor table in its own TCB. Every time it opens a file, it allocates a new entry table including a reference to the corresponding file (structure) table entry. Whenever a thread spawns a new child thread, it passes a copy of its TCB to this child which thus has a copy of its parent's user file descriptor table. This in turn means the both the parent and the child refer to the same file (structure) table entries and eventually share the same files.
Note that the file descriptors 0, 1, and 2 are reserved for the standard input, output, and error, and thus those entires must remain null.
Work for this project is divided up into three sections. The goal here is to help y'all start work on this early (i.e., to divide the work up into chunks and do each chunk in good time, rather than trying to do the entire project a day or two before it's due)(NOTE: Trying to do the entire project 1 or 2 days before it's due usually results in poor code that earns a low grade or even outright failiure to complete the work). Each section is refered to as a 'phase', and each phase has it's own deadline (please see the Syllabus and/or Canvas for the due dates)
6.1: Planning Phase
For this phase you will find a group to work with (PLEASE don't try this on your own!) and figure out what you need to do for this massive project.
What to turn in for this phase: Each group will hand in a single document containing the work for this phase. Each group's document must contain:
6.2: Initial Work
For this phase you will complete some of the work for the project. You will finish the project during the Completing The Project phase.
What To Turn In For This Phase:
Each group should upload a single copy of their work so far, and an
updated list of work items.
The work should include whatever .JAVA files you've worked on. Everything
you hand in should both compile and execute. While it doesn't
have to accomplish anything in particular at this stage it's good to
include 'test code' that verifies that your individual pieces are
working.
The updated list of work items should list all the
work items for the project, should clearly indicate which items have
been completed, and should list the name(s) of the people assigned to
each item. It's expected that this will have changed since the
prior phase (which is why you're being asked to turn in an updated
version)
What you hand in during this phase does not necessarily have to be the same as the list of work items that you handed in during the previous phase. As long as you've gotten a 'reasonable amount' of work done there will be no point penalty.
One of the goals for this phase is for you to realize how much work this project is, and to schedule more time to complete this project before the final due date. In other words, one goal for this phase is for you to (potentially) be overwhelmed and not finish everything on your list for this phase so that you'll make changes to your life and then successfully finish the final project on time!
VERY IMPORTANT NOTE: NO flexibility will be provided for the next phase. If you don't hand anything in (or you hand in junk code that does poorly) there will be no extra chances to revise your work, and no adjustments made to the due date of the Project Completion phase.
6.3: Project Completion
Before you read anything else about this phase, please re-read the VERY IMPORTANT NOTE at the end of the prior phase.
At the end of this phase each group will hand in a fully complete implementation of the file system for ThreadOS. Each group should upload a single copy for the entire group.
The technical goal for this phase is to finish the design and implementation of the file system in ThreadOS as specified above. Run Test5.java which is found in the ThreadOS directory. This test program will test the functionality of every feature and system call listed above and check the data consistency of your file system. Some of the things that are tested include:
gradeguide5.txt is the grade guide for the final project.
Visit the Canvas page about general assignment information in order to double-check your working environment and turn-in procedure.
This website could answer your questions. Please click here before emailing the professor :-).