Study Groups

The Disk Layout Problem

Neudauer, Nancy Ann (2001) The Disk Layout Problem. Canadian Industrial Problem Solving Workshops > 5th IPSW [Seattle 18/5/2001 - 22/5/2001].

Full text available as:

PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract/Summary

Imagine that we keep a daily log of the files that our computer reads from its hard disk. For most computer users the logs of one day compared to the next may be very similar. For example, opening up a commonly used program may require access to the same files in the same order every time that event occurs. We shall call such a sequence of files a trace.

The essence of the problem therefore is: given a set of traces that are expected to be representative of common use, we must rearrange the files on the disk so that the performance is optimized.

Programs called disk defragmenters use these simple principles to rearrange data records on a disk so that each file is contiguous, with no holes or few holes between data records. Some more sophisticated disk defragmenters also try to place related files near each other, usually based on simple static structure rather than a dynamic analysis of the accesses. We are interested in more dynamic defragmentation procedures.

We first consider a 1D model of the disk. We then look at the results from an investigation of the 2D disk model followed by a discussion of caching strategies. Finally we list some of the complications that may need to be addressed in order to make the models more realistic.

Item Type:Study Group Report
Study Group:Canadian Industrial Problem Solving Workshops > 5th IPSW [Seattle 18/5/2001 - 22/5/2001]
Company Name:Microsoft Research
Industrial Sector:None/Other
Information and communication technology
Additional Contributors:Corbett, Brian and Dresden, Gregory and Paulhus, Marc
ID Code:167
Deposited By:Michele Taroni
Deposited On:10 October 2008

Problem Statement

We can organize data on a personal computer's hard drive according to many different data strategies resulting in different performances due to disk latencies, consisting of both rotational latency and seek time. Rotational latency is a physical characteristic of the disk and motor, so we focus on the problem of storing data in a manner that optimizes the seek time of the data. The optimization of this problem will result in better performance for users.

Archive Staff Only: edit this record